31 to 40 of 116 Results
Jul 9, 2025
Darģis, Roberts; Auziņa, Ilze, 2023, "Ilvars - Latvian Male VITS Text-to-Speech Model (vers. 2023)", https://hdl.handle.net/20.500.12574/89, AiLab IMCS UL
A neural model for text-to-speech (TTS) synthesis in Latvian. Trained using VITS on a 25-hour speech corpus of audiobooks read in a male voice. Available for academic and non-commercial purposes via an API. To get access to the API, please, send a request to info@ailab.lv.This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
May 16, 2025
Rituma, Laura; Pretkalniņa, Lauma; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Grūzītis, Normunds; Znotiņš, Artūrs, 2024, "LVTB - Latvian Treebank v2.15 (2024-11-15)", https://hdl.handle.net/20.500.12574/112, AiLab IMCS UL
Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version of Latvian UD Treebank (UDLV-LVTB).This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Apr 22, 2025
Zuicena, Ieva; Auziņa, Ieva; Briede, Santa; Jansone, Irēna Ilga; Kuplā, Ieva; Lejniece, Gunta; Migla, Ilga; Oldere, Laimdota; Ozola, Ārija; Požarnova, Vija; Rapa, Sanda; Roze, Anitra; Šmidebergs, Imants; Šnē, Dorisa; Šnē, Māra; Timuška, Agris; Grasmanis, Mikus; Pretkalniņa, Lauma; Znotiņš, Artūrs, 2024, "Dictionary of Contemporary Latvian Language (MLVV) (2024-12-21)", https://hdl.handle.net/20.500.12574/120, Latvian Language Institute of the University of Latvia
“Contemporary dictionary of Latvian language” (MLVV), developed by the Latvian Language institute of University of Latvia, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. The analysis of the word stock is based on MLVV card files, internet sources, as well as, on last decade’s encyclopaedias and...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Apr 22, 2025
Spektors, Andrejs; Pretkalniņa, Lauma; Grūzītis, Normunds; Paikens, Pēteris; Rituma, Laura; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Lokmane, Ilze; Klints, Agute; Stāde, Madara; Grasmanis, Mikus; Auziņa, Ilze; Znotiņš, Artūrs; Darģis, Roberts; Bārzdiņš, Guntis, 2024, "Tēzaurs.lv 2025 (Winter Edition)", https://hdl.handle.net/20.500.12574/119, AiLab IMCS UL
Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 405,000 entries based on 345 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic and other annotations, inflection tables, corpus examples, and integrated with the Latvian WordNet data. This dataset is availab...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Apr 22, 2025
Ceplītis, Laimdots; Spektors, Andrejs, 2024, "Dictionary of Latvian Literary Language (LLVV) (2024-02)", https://hdl.handle.net/20.500.12574/100, AiLab IMCS UL
In the 20th century, UL Latvian language institute (former Language and literature institute of the Academy of Sciences) has produced the largest lexicographic source of Latvian language, which has been digitalized (2001–2022) by UL Institute of Mathematics and Computer Sciences. The dictionary contains words of standard Latvian used since 19th cen...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Mar 27, 2025
Baklāne, Anda; Saulespurēns, Valdis; Ozols, Artis; Krasovska, Marlēna; Vēveris, Viesturs; Eglāja-Kristsone, Eva; Rožkalne, Anita; Skaistkalne, Evija, 2025, "Corpus of Latvian Early Novels (2025-03-11)", https://hdl.handle.net/20.500.12574/125, National Library of Latvia
Corpus of Latvian novels, first published before 1940.This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Mar 26, 2025
Baklāne, Anda; Saulespurēns, Valdis; Ozols, Artis; Krasovska, Marlēna, 2021, "Corpus of Latvian Early Novels", https://hdl.handle.net/20.500.12574/78, National Library of Latvia
Corpus of Latvian novels, first published before the 1940.This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Mar 26, 2025
Bethere, Dina; Barone, Lelde; Immure, Inese; Intsone, Agija; Liniņa, Ilona; Ozola, Elza; Romanovska, Agnese; Straupeniece, Daiga; Darģis, Roberts, 2025, "Latvian Sign Language Corpus", https://hdl.handle.net/20.500.12574/121, RTU Liepaja
The corpus contains video news produced by the Latvian Deaf Union and news from Latvian public media with sign language interpretation. Video recordings of Latvian sign language utterances are segmented and arranged in three levels: SIGN, CONCEPT and SENTENCE. The corpus comprises 12,500 signs over 150 minutes. Data is browsable with ELAN software.This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Mar 26, 2025
Auziņa, Ilze; Darģis, Roberts; Levāne-Petrova, Kristīne; Auziņa, Arta; Saulīte, Baiba; Ļaksa-Timinska, Ilze; Gailīte, Elīna; Nešpore-Bērzkalne, Gunta; Rābante-Buša, Guna; Pokratniece, Kristīne; Klints, Agute, 2024, "LATE Media Speech Corpus V1 (LATE-mediji)", https://hdl.handle.net/20.500.12574/114, AiLab IMCS UL
The corpus contains audio recordings of media broadcasts and their transcripts in orthographic transcription. The data are transcribed in the orthography of Standard Latvian, observing also the principles of punctuation.This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Mar 26, 2025
Auziņa, Ilze; Rābante-Buša, Guna; Darģis, Roberts, 2024, "LATE Phonetically Annotated Speech Corpus V1 (fonLATE)", https://hdl.handle.net/20.500.12574/115, AiLab IMCS UL
A small subset of phonetically annotated data has been derived from the LATE-sarunas and LATE-media. The phonetic annotation is available at two levels: (1) the dictionary or standard pronunciation of a word or segment, regardless of its actual pronunciation made by the particular speaker, and (2) the actual pronunciation of a word or segment.This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
