User’s Guide to INEL Enets Corpus
Keywords:
Enets languages, Tundra Enets, Forest Enets, corpus linguistics, INEL, Samoyedic languagesSynopsis
The present paper documents the structure and scope of the Enets INEL corpus and can serve as a user guide. The details of the metadata are provided, the layout of the annotation tiers as well as the annotation schemes used in the corpus are described.
The present corpus of the Enets language(s) has been developed as part of the long-term research INEL project (“Grammatical Descriptions, Corpora and Language Technology for Indigenous Northern Eurasian Languages”). It is based on published and unpublished data of both Enets lects and aims to bring together most transcribed Enets texts ever available. While a large number of Forest Enets texts are still not included into the current version of the corpus, for Tundra Enets this concerns only a handful of texts. The corpus makes possible typologically oriented corpus-based research on Enets and expands the documentation of the lesser described indigenous languages of Northern Eurasia.
