Publications

Scientific publications

Крижановский А.А., Крижановская Н.Б., Родионова А.П.
Архитектура корпусного менеджера и разметка текстов корпуса ВепКар
// Электронная письменность народов Российской Федерации: опыт, проблемы и перспективы. Материалы межд. науч. конференции (Уфа, 27-29 ноября 2019 г.). 2019. C. 19-23
Keywords: Vepsian language, Karelian language, corpus linguistics, word-sense disambiguation, text tagging
The Open corpus of Veps and Karelian languages (VepKar) is useful both for linguists and dialectologists, since dialect texts are given with a detailed description: informant's name, year and place of his birth, place of recording, dialect, etc. The article discusses the features of the smallest Ludian subcorpus. The architecture of the developed corpus manager dictorpus (used in the VepKar) is presented in the article. Semantic and morphological tags links texts with VepKar dictionary entries. Semantic categorization of the meanings of lemmas is planned to be based on categories from the comparative onomasiological dictionary. As part of future work, the morphological analyzer will be developed on the basis of finite state machines.
Indexed at RSCI, Google Scholar

Preprint (544 Kb, total downloads: 108)

Last modified: August 24, 2021