CLARIN-LV

CLARIN (Common Language Resources and Technology Infrastructure) is a research infrastructure that was initiated from the vision that all digital language resources and tools from all over Europe and beyond are accessible through a single sign-on online environment for the support of researchers in the humanities and social sciences.

Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

1 to 10 of 124 Results

Dictionary of Contemporary Latvian Language (MLVV) (2026-04-08) Jun 26, 2026 Zuicena, Ieva; Auziņa, Ieva; Briede, Santa; Jansone, Irēna Ilga; Kuplā, Ieva; Lejniece, Gunta; Migla, Ilga; Oldere, Laimdota; Ozola, Ārija; Požarnova, Vija; Rapa, Sanda; Roze, Anitra; Šmidebergs, Imants; Šnē, Dorisa; Šnē, Māra; Timuška, Agris; Grasmanis, Mikus; Pretkalniņa, Lauma; Znotiņš, Artūrs, 2026, "Dictionary of Contemporary Latvian Language (MLVV) (2026-04-08)", https://hdl.handle.net/20.500.12574/157, Latvian Language Institute, Faculty of Humanities, University of Latvia “Contemporary dictionary of Latvian language” (MLVV), developed by the Latvian Language Institute of the Faculty of Humanities at the University of Latvia, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. The analysis of the word stock is based on MLVV card files, internet sources, as well as, on... This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Dictionary of Contemporary Latvian Language (MLVV) (2026-06-21) Jun 26, 2026 Zuicena, Ieva; Auziņa, Ieva; Briede, Santa; Jansone, Irēna Ilga; Kuplā, Ieva; Lejniece, Gunta; Migla, Ilga; Oldere, Laimdota; Ozola, Ārija; Požarnova, Vija; Rapa, Sanda; Roze, Anitra; Šmidebergs, Imants; Šnē, Dorisa; Šnē, Māra; Timuška, Agris; Grasmanis, Mikus; Pretkalniņa, Lauma; Znotiņš, Artūrs, 2026, "Dictionary of Contemporary Latvian Language (MLVV) (2026-06-21)", https://hdl.handle.net/20.500.12574/161, Latvian Language Institute, Faculty of Humanities, University of Latvia “Contemporary dictionary of Latvian language” (MLVV), developed by the Latvian Language Institute of the Faculty of Humanities at the University of Latvia, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. The analysis of the word stock is based on MLVV card files, internet sources, as well as, on... This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Tēzaurs.lv 2026 (Spring Edition) Jun 26, 2026 Spektors, Andrejs; Pretkalniņa, Lauma; Grūzītis, Normunds; Paikens, Pēteris; Rituma, Laura; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Lokmane, Ilze; Klints, Agute; Stāde, Madara; Grasmanis, Mikus; Auziņa, Ilze; Znotiņš, Artūrs; Darģis, Roberts; Bārzdiņš, Guntis, 2026, "Tēzaurs.lv 2026 (Spring Edition)", https://hdl.handle.net/20.500.12574/156, AiLab IMCS UL Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 410,000 entries based on 350 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic and other annotations, inflection tables, corpus examples, and integrated with the Latvian WordNet data. This dataset is availab... This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Tēzaurs.lv 2026 (Summer Edition) Jun 26, 2026 Spektors, Andrejs; Pretkalniņa, Lauma; Grūzītis, Normunds; Paikens, Pēteris; Rituma, Laura; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Lokmane, Ilze; Klints, Agute; Stāde, Madara; Grasmanis, Mikus; Auziņa, Ilze; Znotiņš, Artūrs; Darģis, Roberts; Bārzdiņš, Guntis, 2026, "Tēzaurs.lv 2026 (Summer Edition)", https://hdl.handle.net/20.500.12574/160, AiLab IMCS UL Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 410,000 entries based on 350 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic and other annotations, inflection tables, corpus examples, and integrated with the Latvian WordNet data. This dataset is availab... This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
LVTB - Latvian Treebank v2.18 May 28, 2026 Rituma, Laura; Pretkalniņa, Lauma; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Grūzītis, Normunds; Znotiņš, Artūrs, 2026, "LVTB - Latvian Treebank v2.18", https://hdl.handle.net/20.500.12574/159, AiLab IMCS UL Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version of Latvian UD Treebank (UDLV-LVTB). This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
LVTB - Latvian Treebank v2.17 May 28, 2026 Rituma, Laura; Pretkalniņa, Lauma; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Grūzītis, Normunds; Znotiņš, Artūrs, 2025, "LVTB - Latvian Treebank v2.17", https://hdl.handle.net/20.500.12574/142, AiLab IMCS UL Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version of Latvian UD Treebank (UDLV-LVTB). This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
ConLoan-LV: A Contrastive Dataset for Latvian Language Loanwords, Code-switching, and Named Entities May 12, 2026 Štekeļs, Jorens, 2026, "ConLoan-LV: A Contrastive Dataset for Latvian Language Loanwords, Code-switching, and Named Entities", https://hdl.handle.net/20.500.12574/158, University of Latvia ConLoan-LV is a multi-purpose contrastive dataset designed for the classification and analysis of Latvian language loanwords, code-switching, and named entities. Replicating and extending the ConLoan methodology, the dataset contains 353 manually validated sentences in the baseline version and 676 in the extended version, with all sentences sourced... This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Dictionary of Contemporary Latvian Language (MLVV) (2025-12-21) Apr 20, 2026 Zuicena, Ieva; Auziņa, Ieva; Briede, Santa; Jansone, Irēna Ilga; Kuplā, Ieva; Lejniece, Gunta; Migla, Ilga; Oldere, Laimdota; Ozola, Ārija; Požarnova, Vija; Rapa, Sanda; Roze, Anitra; Šmidebergs, Imants; Šnē, Dorisa; Šnē, Māra; Timuška, Agris; Grasmanis, Mikus; Pretkalniņa, Lauma; Znotiņš, Artūrs, 2025, "Dictionary of Contemporary Latvian Language (MLVV) (2025-12-21)", https://hdl.handle.net/20.500.12574/150, Latvian Language Institute, Faculty of Humanities, University of Latvia “Contemporary dictionary of Latvian language” (MLVV), developed by the Latvian Language Institute of the Faculty of Humanities at the University of Latvia, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. The analysis of the word stock is based on MLVV card files, internet sources, as well as, on... This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Tēzaurs.lv 2026 (Winter Edition) Apr 20, 2026 Spektors, Andrejs; Pretkalniņa, Lauma; Grūzītis, Normunds; Paikens, Pēteris; Rituma, Laura; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Lokmane, Ilze; Klints, Agute; Stāde, Madara; Grasmanis, Mikus; Auziņa, Ilze; Znotiņš, Artūrs; Darģis, Roberts; Bārzdiņš, Guntis, 2025, "Tēzaurs.lv 2026 (Winter Edition)", https://hdl.handle.net/20.500.12574/151, AiLab IMCS UL Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 410,000 entries based on 350 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic and other annotations, inflection tables, corpus examples, and integrated with the Latvian WordNet data. This dataset is availab... This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
The Index of Aggressive Communication in Internet Portal Comments Apr 8, 2026 Darģis, Roberts, 2022, "The Index of Aggressive Communication in Internet Portal Comments", https://hdl.handle.net/20.500.12574/45, AiLab IMCS UL A corpus containing comments from Internet news sites tvnet.lv, delfi.lv, apollo.lv. The specialized corpus platform and its tools are designed to study aggression in the comments of news portals. The toolkit allows to identify news items from Internet portals that are most aggressively commented, as well as to study aggressive communication trends... This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.

Dictionary of Contemporary Latvian Language (MLVV) (2026-04-08)

Jun 26, 2026

Zuicena, Ieva; Auziņa, Ieva; Briede, Santa; Jansone, Irēna Ilga; Kuplā, Ieva; Lejniece, Gunta; Migla, Ilga; Oldere, Laimdota; Ozola, Ārija; Požarnova, Vija; Rapa, Sanda; Roze, Anitra; Šmidebergs, Imants; Šnē, Dorisa; Šnē, Māra; Timuška, Agris; Grasmanis, Mikus; Pretkalniņa, Lauma; Znotiņš, Artūrs, 2026, "Dictionary of Contemporary Latvian Language (MLVV) (2026-04-08)", https://hdl.handle.net/20.500.12574/157, Latvian Language Institute, Faculty of Humanities, University of Latvia

“Contemporary dictionary of Latvian language” (MLVV), developed by the Latvian Language Institute of the Faculty of Humanities at the University of Latvia, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. The analysis of the word stock is based on MLVV card files, internet sources, as well as, on...