Building an efficient OCR system for historical documents with little training data
Jiří Martínek and Ladislav Lenc and Pavel KrálNeural Computing and Applications (2020)
Neural Networks | Image Processing
The main goal of this project is to make accessible archival resources from the Czech-Bavarian border region using state-of-the-art information technologies. It will be possible to search information based on geolocation. We also focus on a clear presentation and an effective search of the documents in a form of raster images. We further realize an intelligent full-text access to the printed documents in both Czech and German languages. The information will be available through an existing portal Porta Fontium.
More information at Modern Access to Historical Sources
Please, cite our article if you use any of the available resources.
This project has been supported by Cross-border Cooperation Program Czech Republic - Free State of Bavaria ETS Objective, 2014-2020.