The database is composed of 156 texts. These texts contain about 225 000 tokens/words (110 000 recorded data and 115 000 written texts).
All the texts have already been edited in FLEx and the community members are currently correcting the first version of the dictionary. The dictionary contains more than 12 000 entries.
143 texts have been inserted into the lexical database up to February 29. These texts contain about 207 000 tokens/words (94 000 recorded data and 113 000 written texts).
The text are gradually edited in FLEx. The lexicon section contains currently 8 000 entries.
139 texts have been inserted into the lexical database up to October 15. These texts contain about 201 000 tokens/words (88 000 recorded data and 113 000 written texts).
The text are gradually edited in FLEx. The lexicon section contains currently 4 400 entries.
Up to now (June 30, 2019) 130 texts have been inserted into the lexical database. These texts contain about
175 000 tokens/words (80 000 recorded data and 95 000 written texts).
The texts section of the FLEx database is now gradually edited, text by text, and the individual words are inserted into Lexicon section with information about grammar category, translation into Spanish, examples, variety information, usage and others. The lexicon section contains currently 1 800 entries.
Up to now (May 1, 2019) the lexical database contains 110 texts with about 148 000 tokens (70 000 recorded data, 78 000 written texts).
The recordings are made with community members from the three villages. The participants range from 20 to 90 years of age. In some of the recordings younger speakers have the role of interviewers, asking questions to their family members or neighbours.
Transcriptions in the programme ELAN
The transcription in ELAN is not phonological but simply orthographical, however the orthography used in the transcriptions represents all the phonemes of A Fala in a distinctive way, so that there is no doubt about the original pronunciation of individual words. The transcriptions are as detailed as possible. In the transcription process they are double-checked and they are also consulted with the speakers themselves to clarify possible douts and to explain difficult and unusual words.
Consulting transcribed interview with the participant
The orthography was originally established by the Ascociación Cultural a Nosa Fala in 2017. It is not an “oficial” orthography as there is no institution that would be in charge to do that. Nevertheless the orthography was discussed with the community members from the three villages and the result is a compromise that takes into accont many different opinions and aspects (practical use, tradition, etc.). This orthography is also accepted by the Asociación Cultural U Lagartu Verdi.
Editing the annotations in the programme FLEx
Once transcribed the recordings, they are exported to FLEx (Fieldworks Language Explorer) using the ELAN – FLEx – ELAN workflow. In FLEx individual words/tokens are editted adscribing each Headword/Lexem various categories: translation to Spanish, grammatical information, dialect label (as all three varieties are carefully marked), definition and exaples if necessary and other descriptions depending on individual words. The FLEx database also enables to link images, audio and video files. All these features are exploited to make the database the most complex possible.
Dictionary editing in FLEx
The final editing will inted to give the dictionary user-friendly and attractive format. It is expected that the first edition of the dictionary will contain at least 12 000 entries.