backRetour à 2/2012
General and Professional Education
2/2012 pp. 3-12

pdf Texte intégral au format pdf


This paper deals with the problem of lexicon compilation in different modes: manual, computer-assisted, automated, automatic ones. The task of completing the morphological zone of a lexical base is considered. Using this task different modes of compiling the lexicon are presented and their appropriateness for the task is estimated basing on the criteria of speed and quality achieved. It is stated that the automated mode fits best the task of lexical-morphological knowledge acquisition. The work for compiling a lexicon in different modes and its assessment is offered as a task for practical studies for prospective linguists.


lexical knowledge acquisition, lexicon compilation, automatic mode, automated mode, computer-assisted mode, manual mode, practical studies


1. Goldsmith, John. Unsupervised Learning of the Morphology of a Natural Language. In Computational Linguistics. Volume 27, Number 2. 2001. Pp. 153-198. 

2. Kis, Balazs, Begoña Villada, Gosse Bouma, Gábor Ugray, Tamás Bíró, Gábor Pohl, John Nerbonne. A New Approach to the Corpus-based Statistical Investigation of Hungarian Multi-word Lexemes. In Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC) 2004. Lisbon, Portugal. 2004. Pp. 1677-1680.

3. Sheremetyeva, S. Towards Designing Natural Language Interfaces. In Proceedings of the 4th International Conference “Computational Linguistics and Intelligent Text Processing” Mexico City, Mexico, February 16-22. 2003.

4. Sheremetyeva, S. On Probability of Resources for a Quick Ramp up of Multilingual MT of Patent Claims. In Proceedings of the MT Summit XI Workshop on Patent Translation. Copenhagen, Denmark. 2007. Pp. 28-33.

5. Sheremetyeva, S. On extracting multiword NP terminology for MT. In EAMT-2009: Proceedings of the 13th Annual Conference of the European Association for Machine Translation / ed. Lluís Màrquez and Harold Somers. Universitat Politècnica de Catalunya, Barcelona, Spain, May 14-15. 2009. Pp. 205–212.

6. Wermter, Joachim, Udo Hahn. Paradigmatic Modifiability Statistics for the Extraction of Complex Multi-Word Terms. In Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP). Vancouver, Canada. 2005. Pp. 843–850.

7. Бабина, О. И., Н. Ю. Дюмин. Корпусный метод автоматического морфологического анализа флективных языков // Вестник ЮУрГУ. Сер. Лингвистика. – 2012. – Вып. 14. – С. 4-9.

8. Баранов, А.Н. Введение в прикладную лингвистику. М., 2001.