Latent semantics in Named Entity Recognition

Michal Konkol and Tomáš Brychcín and Miloslav Konopík
Expert Systems with Applications (2015)
BibTex  | ScienceDirect

Research topics

Semantic analysis | Named entitity recognition


Abstract In this paper, we propose new features for Named Entity Recognition (NER) based on latent semantics. Furthermore, we explore the effect of unsupervised morphological information on these methods and on the {NER} system in general. The newly created {NER} system is fully language-independent thanks to the unsupervised nature of the proposed features. We evaluate the system on English, Spanish, Dutch and Czech corpora and study the difference between weakly and highly inflectional languages. Our system achieves the same or even better results than state-of-the-art language dependent systems. The proposed features proved to be very useful and are the main reason of our promising results.

