21 to 30 of 116 Results
Nov 20, 2025
Andronova, Everita; Baltiņa, Maija; Frīdenberga, Anna; Grūzītis, Normunds; Ķauķīte, Sintija; Pokratniece, Kristīne; Pretkalniņa, Lauma; Siliņa-Piņķe, Renāte; Skrūzmane, Elga; Spektors, Andrejs; Spektors, Mārtiņš; Štrausa, Ilze; Trumpa, Anta; Trumpa, Edmunds; Vanags, Pēteris, 2025, "The Corpus of Early Written Latvian (2025)", https://hdl.handle.net/20.500.12574/141, AiLab IMCS UL
The Corpus of early written Latvian 'SENIE' provides access to the texts and facsimiles of written Latvian of the 16th–18th century. Its aim is to facilitate studies of early Latvian in general and to serve as the basis for 'The Historical dictionary of Latvian (16th–17th cc.)'. Corpus serves as a unique digital repository of early Latvian texts, w...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Nov 6, 2025
Pretkalniņa, Lauma; Andronova, Everita; Frīdenberga, Anna; Skrūzmane, Elga; Siliņa-Piņķe, Renāte; Trumpa, Anta; Vanags, Pēteris, 2025, "Spelling normalization tool for Latvian 18th century texts", https://hdl.handle.net/20.500.12574/140, AiLab IMCS UL
The spelling normalization tool (pilot converter) is meant for converting any 18th century Latvian Unicode-encoded text into a more modern spelling. This version of the tool takes care of normalizing the roots of the words, thus, it is meant for for facillitating user-friendly corpora search in tools like Sketch Engine. The tool consists of 134 uni...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Nov 5, 2025
Neimane, Liene Krista, 2025, "Latvian Sign Language Landmark Corpus", https://hdl.handle.net/20.500.12574/139, Liene Krista Neimane
The corpus contains MediaPipe-extracted landmark data representing 45 Latvian Sign Language signs. It includes 33 alphabet letters (a-z), 11 numbers (0-10), and a pause, which were captured from videos featuring several different signers. The collection covers both isolated signs and sign combinations forming complete words or short sentences. For...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Oct 6, 2025
Zuicena, Ieva; Auziņa, Ieva; Briede, Santa; Jansone, Irēna Ilga; Kuplā, Ieva; Lejniece, Gunta; Migla, Ilga; Oldere, Laimdota; Ozola, Ārija; Požarnova, Vija; Rapa, Sanda; Roze, Anitra; Šmidebergs, Imants; Šnē, Dorisa; Šnē, Māra; Timuška, Agris; Grasmanis, Mikus; Pretkalniņa, Lauma; Znotiņš, Artūrs, 2025, "Dictionary of Contemporary Latvian Language (MLVV) (2025-06-21)", https://hdl.handle.net/20.500.12574/133, Latvian Language Institute of the University of Latvia
“Contemporary dictionary of Latvian language” (MLVV), developed by the Latvian Language institute of University of Latvia, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. The analysis of the word stock is based on MLVV card files, internet sources, as well as, on last decade’s encyclopaedias and...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Oct 6, 2025
Spektors, Andrejs; Pretkalniņa, Lauma; Grūzītis, Normunds; Paikens, Pēteris; Rituma, Laura; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Lokmane, Ilze; Klints, Agute; Stāde, Madara; Grasmanis, Mikus; Auziņa, Ilze; Znotiņš, Artūrs; Darģis, Roberts; Bārzdiņš, Guntis, 2025, "Tēzaurs.lv 2025 (Summer Edition)", https://hdl.handle.net/20.500.12574/132, AiLab IMCS UL
Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 405,000 entries based on 345 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic and other annotations, inflection tables, corpus examples, and integrated with the Latvian WordNet data. This dataset is availab...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Sep 16, 2025
Deksne, Daiga, 2025, "Embedding Model Fine-Tuning Dataset", https://hdl.handle.net/20.500.12574/136, University of Latvia
Dataset for Embedding Model Fine-Tuning has been created within the framework of the National Research Program project "Analysis of the applicability of artificial intelligence methods in the field of EU fund projects". For the purposes of this project, we fine-tuned the bge-m3 model developed by BAAI (Chen et al., 2024). For fine-tuning, we collec...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Sep 16, 2025
Deksne, Daiga; Skadiņš, Raivis; Hohbergs, Andris; Jaunzars, Rūdolfs; Petrovs, Andrejs; Rūdule, Justīne; Pinnis, Mārcis, 2025, "Procurement Validation Dataset", https://hdl.handle.net/20.500.12574/135, University of Latvia
The Procurement Validation Dataset was created within the framework of the State Research Programme project "Analysis of the Applicability of Artificial Intelligence Methods in the Field of European Union Fund Projects". The dataset consists of 30 procurement documents evaluated by CFCA experts. The procurement checklists prepared by the experts ha...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Jul 31, 2025
Zuicena, Ieva; Auziņa, Ieva; Briede, Santa; Jansone, Irēna Ilga; Kuplā, Ieva; Lejniece, Gunta; Migla, Ilga; Oldere, Laimdota; Ozola, Ārija; Požarnova, Vija; Rapa, Sanda; Roze, Anitra; Šmidebergs, Imants; Šnē, Dorisa; Šnē, Māra; Timuška, Agris; Grasmanis, Mikus; Pretkalniņa, Lauma; Znotiņš, Artūrs, 2025, "Dictionary of Contemporary Latvian Language (MLVV) (2025-03-20)", https://hdl.handle.net/20.500.12574/128, Latvian Language Institute of the University of Latvia
“Contemporary dictionary of Latvian language” (MLVV), developed by the Latvian Language institute of University of Latvia, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. The analysis of the word stock is based on MLVV card files, internet sources, as well as, on last decade’s encyclopaedias and...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Jul 31, 2025
Spektors, Andrejs; Pretkalniņa, Lauma; Grūzītis, Normunds; Paikens, Pēteris; Rituma, Laura; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Lokmane, Ilze; Klints, Agute; Stāde, Madara; Grasmanis, Mikus; Auziņa, Ilze; Znotiņš, Artūrs; Darģis, Roberts; Bārzdiņš, Guntis, 2025, "Tēzaurs.lv 2025 (Spring Edition)", https://hdl.handle.net/20.500.12574/127, AiLab IMCS UL
Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 405,000 entries based on 345 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic and other annotations, inflection tables, corpus examples, and integrated with the Latvian WordNet data. This dataset is availab...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Jul 21, 2025
Pauniņš, Artis, 2025, "LV portāls e-consultations (2020-2024)", https://hdl.handle.net/20.500.12574/131, University of Latvia
This dataset contains articles from e-consultations about the legislation of the Republic of Latvia. The articles are stored in JSON files that contain the HTML of questions and answers as well as other metadata, such as source URL, title and authors to get citations. The citations to all the articles are available here: https://html-preview.github...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
