Help
Table of Contents
- Searching
- Navigating articles and pages
- Optical Character Recognition
- How to correct OCR text
- User accounts
- Printing
- Technical requirements
Searching
Types of searchable text in this collection
This collection contains various kinds of searchable Hawaiian newspaper text:
- Article text: produced most recently, this text is produced as via automatic OCR, including segmenting each newspaper into articles, and each article headline has been corrected by humans. This text is available for all ~69,000 pages in this collection.
- Page text: produced at an earlier time, some of this text was produced via automatic OCR (optical character recognition) and some of it was produced by human transcribers directly. This text is divided into pages and is available for ~21,000 pages out of the total of ~69,000 pages in this collection.
- Article titles: includes only article titles, and not article text.
- Article translations: some articles have been translated, this is the text of those translations.
- Dedication text: This text was added by people who transcribed Page text as a dedication for their transcription work.
Simple search
Perform a simple search by typing keywords into the search box and clicking . The search engine will return results that include all of your search terms.
Punctuation and case will not affect your search results. For example, using either of the two terms below will give you identical results:
Mr. K. Kealawaa
Mr k kealawaa
Exact Phrase Search
You can search for an exact phrase by placing quotation marks around your search terms. Typing [ "ʻike kuʻu maka" ] for example will return only those results which contain the words ʻike kuʻu maka together in that exact order.
Boolean Operators
Boolean operators AND, OR and NOT can be used to refine your search results. They must be written as capital (uppercase) letters. AND will return only results that contain all of the terms or phrases; NOT will exclude results containing the second term or phrase; and OR will return results that contain either of the terms or phrases.
For example, [ hālawa AND molokaʻi ] will return results containing both the words molokaʻi and hālawa in the same file. This is the same as just typing the two words as a basic search; for example [ hālawa molokaʻi ] or [ molokaʻi hālawa ].
Searching for [ hālawa NOT molokaʻi ] will return only those files which contain the word hālawa where the word molokaʻi does not also occur.
Finally, searching for [ hālawa OR molokaʻi ] will return files which contain either the word hālawa or the word molokaʻi.
Boolean searches also work with quoted phrases. For example [ “molokaʻi nui a hina” AND “maui o kama” ] will return the expected results showing only files which contain both of those phrases.
Advanced search
Click to show the "Advanced search" popup. This allows you to limit your search results by:
- A date range
- One or more publications
It also allows you to search within:
- Article text, Page text, Article titles and Article translations
- Page text
- Article titles
- Article translations
- Dedication text
Advanced query syntax
Query terms can be boosted to increase their importance in the search, thus changing the order of the search results. This is done by adding "^" and a boost factor at the end of the term. For example:
hanalei river^2 treats "river" as more important than "hanalei" when ranking the search results returned.
Wildcard searches can be performed by including "?" for a single character wildcard in the query term. For example:
i?i - will match all three letter words beginning and ending with ‘i’ such as ili, ihi, iki, iwi
Include "*" for a multiple character wildcard in the query term. For example:
hoʻo* - will match all words starting with "hoʻo" such as hoʻomalu, hoʻolaha and hoʻopapa
A fuzzy search will retrieve terms that are similar to a specified term without necessarily being an exact match. It can help to compensate for errors in the text due to the Optical Character Recognition process. Add "~1" to the end of individual terms, for example:
pekuna~1 will find terms like "peruna" and "pekina" as well as "pekana"
Proximity searching allows you to search for words that appear close together in the text. For example:
"Keoni Kamika"~3 will find results containing both the words "Keoni" and "Kamika" where they are no more than 3 words apart. As well as finding "Keoni Kamika" it will also find "Keoni L. Kamika", "Keoni Kalani Kamika", "Keoni Keawe-Kamika" and even "Kamika, Keoni".
Search filters
Any filters applied to the search are shown below the search box. You can remove these by clicking them, or remove them all by clicking "Clear all".
On the search results page, the area at the left of the page shows the most common values occurring in various categories in the search results. Selecting one of these facets applies it as a search filter.
Using Glottal Stops and Macrons
In all default searches on Kaniʻāina, the kahakō (macron) and ʻokina (glottal stop) are not significant, meaning that a search for [ pololei ] will find all occurances of pololei as well as pololeʻi. To enable the inclusion of kahakō and ʻokina, use the Search Settings area, described below.
Understanding Newspaper Search Result Differences
Newspaper results from the main Papakilo search may differ from this special Veridian newspaper search because the specific nature of the underlying datasets are different. If you are unable to find something via this search you may like to try a similar search from the main Papakilo homepage. Technical reasons for these differences include that this special Veridian newspaper search index tokenizes based on word coordinates on a page, so that words can be highlighted on e.g. newspaper pages, which is different to the tokenization of the main Papakilo search, which is more generalized because it needs to cater for a wide range of disparate datasets.
Navigating articles and pages
The document viewer is split into two parts: the right side shows the page images that make up the document, and the left side is used for displaying metadata and text.
Common navigation tasks:
Previous/next issue - use to go back to the previous issue in the publication, browse all issues of the publication, or advance to the next issue in the publication.
Previous/next search result - use to go back to the previous search result, return to the search results page, or advance to the next search result.
Maximise the viewing area - use in the upper right hand corner.
Zoom in/out - use the magnifying glass symbols in the upper right hand corner or use the wheel on your mouse.
Pan (move the image around the screen) - left-click and hold to drag the image around.
Previous/next page - use to go back to the previous page or show the next page in the issue.
Optical Character Recognition
Optical Character Recognition, or OCR, is a process by which software reads a page image and translates it into a text file by recognising the shapes of the letters (The NINCH Guide to Good Practice in the Digital Representation and Management of Cultural Heritage Materials).
OCR enables searching of large quantities of full-text data, but it is never 100% accurate. The level of accuracy depends on the print quality of the original issue, its condition at the time of microfilming, the level of detail captured by the microfilm scanner, and the quality of the OCR software. Issues with poor quality paper, small print, mixed fonts, multiple column layouts, or damaged pages may have poor OCR accuracy.
The searchable text and titles in this collection have been automatically generated using OCR software. They may not have been manually reviewed or corrected.
How to correct OCR text
OCR text corrections are saved to the database and will improve the service for all users by increasing the accuracy of search results.
There are two ways you can begin to correct text. From the document viewer:
- Select the article or page you want to correct. This will display the text in the left pane of the document viewer. Click on the "Correct this text" link that appears above this text.
- Right-click on the article or page image and select "Correct article text" or "Correct page text" from the options pop-up window.
The text correction interface is split into two parts: the right side shows the page images that make up the document, and the left side is used for editing the lines of text.
When you move your mouse over the page images in the right pane, the blocks making up the pages will highlight. You can scroll this view by dragging with the mouse, or zoom in/out using the buttons above the viewer. Clicking a highlighted block will select it and load a form for editing that block into the left pane.
Correct the text line by line. A red box is displayed in the right pane to help you determine what text should be included in the line. Once you have finished correcting text, click "Save". The changes you make will take effect immediately.
You can then make further corrections to the same block, move onto the next block by clicking the "Save & next" or "Next" button, select another block in the right pane, or exit the text correction view by clicking the "Return to viewing mode" link.
Clicking "Save & exit" instead of "Save" will save the changes and then return you to the normal viewing mode automatically.
Hint: Many web browsers include spell checking functionality and this can assist with your text correction by identifying misspelt words. If your web browser does not have this functionality, it's likely there is a spell checking add-on available (see your web browser's help for information on how to install add-ons).
For recommendations about topics such as punctuation, misspellings and illegible text, see General guidelines for text correction.
User accounts
You don’t have to create an account to use many of the features of this website. However if you choose to create an account you will gain access to additional features (e.g. clippings sets, comments, tags, OCR text corrections). You may also select to store a history of your searches and viewed items.
To create an account click register and enter your email address, first and last name and display name. Once submitted, you will receive an email containing a link to activate your user account and complete the registration process.
Any contributions (e.g. comments, tags, OCR text corrections) you make are "public" (i.e. all other users of the website can see them), and they are associated with your user account. For detailed information about the management of personal data in user accounts, see our Privacy policy.
Clippings sets
Clippings sets allow you to save articles or pages to uniquely named lists for later review. This feature is available to logged-in users only.
To create a clippings set, right-click on an article or page in the document viewer to reveal a pop-up pane. Select "Add to clippings set". From here you can choose to create a new list or add to an existing one, and make an optional note. Your clippings sets may be deleted, emailed, renamed or annotated within the "Clippings sets" tab of the My account page.
Comments
Logged-in users can add public comments to articles.
To add a comment, select an article in the document viewer by clicking on it. To the left of your screen is a pane displaying the article text. Scroll to the bottom of this pane until you reach the "Comments" section. Add your comments into the text box and click "Add comment".
To view your recent comments visit the My account page and choose the "Recently added comments" section from the "Recent activity" tab.
Tags
Logged-in users can add public tags to articles. Tags can be browsed and used to narrow down searches.
To add a tag, select an article in the document viewer by clicking on it. To the left of your screen is a pane displaying the article text. Scroll to the bottom of this pane until you reach the "Tags" section. Enter your tag into the text box and click "Add tag".
To view your recent tags visit the My account page and choose the "Recently added tags" section from the "Recent activity" tab.
Printing
Articles can be printed directly from your web browser.
If available, PDF versions of issues and pages can be downloaded for printing.
Technical requirements
In general, you only need a common web browser like Chrome, Firefox, Safari or Microsoft Edge to search and browse this collection.
To view or print PDFs, you will also need a PDF viewer like Adobe Acrobat Reader.