Symbolbild mit einem Rollregal und Binärcode

Automated Text Recognition – Extracting Data via OCR/HTR

Automated or optical text recognition (OCR) is used to automatically capture text from digital images and thus generate searchable and analyzable data. The Mannheim University Library has many years of experience in digitization and with the use of various text recognition software.

The Research Data Center is happy to support researchers at the University of Mannheim along the entire workflow from digitization to layout and text recognition as well as training specialized models and structuring of the data.

Services

  • Consulting on automated text recognition (OCR) for research projects
  • OCR Recommender
  • Open OCR consultation hour: every 2nd Thursday of the month, from 3 to 4 p.m., without registration (link to Zoom meeting: ocr-bw.bib.uni-mannheim.de/sprechstunde, meeting ID: 682 8185 1819, ID code: 443071).

In our FAQs you will find answers to the most frequently asked questions about automated text recognition and the software used in the OCR-BW.

If the answer you are looking for is not listed, simply contact us by e-mail.

Projects and cooperations

If we can support you or if you have any questions, please do not hesitate to contact us.

Contact

Forschungsdatenzentrum (FDZ)

Forschungsdatenzentrum (FDZ)

Team: Irene Schumm, Phil Kolbe, David Morgan, Thomas Schmidt, Renat Shigapov, Christos Sidiropoulos, Larissa Will
University of Mannheim
Universitätsbibliothek Mannheim
Schloss Schneckenhof West
68161 Mannheim