Historical Map Toponym Extraction for Efficient Information Retrieval


Ladislav Lenc and Jiří Martínek and Josef Baloun and Martin Prantl and Pavel Král
15th IAPR International Workshop on Document Analysis Systems (2022)

PDF

Abstract

The paper deals with detection, classification and recognition of toponyms in hand-drawn historical cadastral maps. Toponyms are local names of towns, villages and landscape features such as rivers, forests etc. The detected and recognized toponyms are utilized as keywords in an information retrieval system that allows intelligent and efficient searching in historical map collections. We create a novel annotated dataset that is freely available for research and educational purposes. Then, we propose a novel approach for toponym classification based on KAZE descriptor. Next we compare and evaluate several state-of-the-art methods for text and object detection on our toponym detection task. We further show the results of toponym text recognition using popular Tesseract engine.

Authors

BibTex

@InProceedings{10.1007/978-3-031-06555-2_12, author="Lenc, Ladislav and Martinek, Jiri and Baloun, Josef and Prantl, Martin and Kral, Pavel", editor="Uchida, Seiichi and Barney, Elisa and Eglin, V{\'e}ronique", title="Historical Map Toponym Extraction for Efficient Information Retrieval", booktitle="Document Analysis Systems", year="2022", publisher="Springer International Publishing", address="Cham", pages="171--183", abstract="The paper deals with detection, classification and recognition of toponyms in hand-drawn historical cadastral maps. Toponyms are local names of towns, villages and landscape features such as rivers, forests etc. The detected and recognized toponyms are utilized as keywords in an information retrieval system that allows intelligent and efficient searching in historical map collections. We create a novel annotated dataset that is freely available for research and educational purposes. Then, we propose a novel approach for toponym classification based on KAZE descriptor. Next we compare and evaluate several state-of-the-art methods for text and object detection on our toponym detection task. We further show the results of toponym text recognition using popular Tesseract engine.", isbn="978-3-031-06555-2" }
Back to Top