How to report accessibility and content quality issues in HathiTrust
Answer
Most books on HathiTrust were digitized by scanning print volumes into image files. The text was then generated through automated OCR processes, which in most cases have not been reviewed by a human for accuracy.
To view the text from the book viewer, select Text-Only View.
Users can view the OCR text one page at a time.
PDF files downloaded from the HathiTrust Digital Library include the OCR text embedded within the document. This text can be extracted using PDF reader software such as Adobe Reader.
If a user encounters poor-quality or confusing OCR, they can request a correction by clicking Get Help in the upper-right corner of the book viewer and selecting Report a Problem.
HathiTrust can determine whether it’s possible to improve the quality by re-running OCR for a text.
See HathiTrust Accessibility for additional information.