CLARIN-LV

CLARIN (Common Language Resources and Technology Infrastructure) is a research infrastructure that was initiated from the vision that all digital language resources and tools from all over Europe and beyond are accessible through a single sign-on online environment for the support of researchers in the humanities and social sciences.

Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

71 to 80 of 98 Results

LVTB - Latvian Treebank v2.11 Nov 15, 2023 Rituma, Laura; Pretkalniņa, Lauma; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Grūzītis, Normunds, 2022, "LVTB - Latvian Treebank v2.11", https://hdl.handle.net/20.500.12574/75, AiLab IMCS UL Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version of Latvian UD Treebank (UDLV-LVTB). This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
LVTB - Latvian Treebank v2.10 Nov 15, 2023 Rituma, Laura; Pretkalniņa, Lauma; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Grūzītis, Normunds, 2022, "LVTB - Latvian Treebank v2.10", https://hdl.handle.net/20.500.12574/63, AiLab IMCS UL Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version of Latvian UD Treebank (UDLV-LVTB). This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Latvian Treebank v2.8 Nov 15, 2023 Rituma, Laura; Pretkalniņa, Lauma; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Grūzītis, Normunds, 2021, "Latvian Treebank v2.8", https://hdl.handle.net/20.500.12574/55, AiLab IMCS UL Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version of Latvian UD Treebank (UDLV-LVTB). This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Latvian Treebank v2.6 Nov 15, 2023 Rituma, Laura; Pretkalniņa, Lauma; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Grūzītis, Normunds, 2020, "Latvian Treebank v2.6", https://hdl.handle.net/20.500.12574/54, AiLab IMCS UL Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version of Latvian UD Treebank (UDLV-LVTB). This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Latvian Treebank v2.9 Nov 15, 2023 Rituma, Laura; Pretkalniņa, Lauma; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Grūzītis, Normunds, 2021, "Latvian Treebank v2.9", https://hdl.handle.net/20.500.12574/56, AiLab IMCS UL This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Tēzaurs.lv 2023 (Summer Edition) Nov 14, 2023 Spektors, Andrejs; Pretkalniņa, Lauma; Grūzītis, Normunds; Paikens, Pēteris; Rituma, Laura; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Lokmane, Ilze; Klints, Agute; Stāde, Madara; Grasmanis, Mikus; Strankale, Laine; Auziņa, Ilze; Znotiņš, Artūrs; Darģis, Roberts; Bārzdiņš, Guntis, 2023, "Tēzaurs.lv 2023 (Summer Edition)", https://hdl.handle.net/20.500.12574/87, AiLab IMCS UL Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 393,500 entries based on 345 sources. The dictionary is enriched with phonetic, morphological, semantic and other annotations, and it is integrated with the Latvian WordNet data. This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Dictionary of Contemporary Latvian Language (MLVV) Jul 17, 2023 Jērāne, Santa; Kuplā, Ieva; Lejniece, Gunta; Migla, Ilga; Oldere, Laimdota; Ozola, Ārija; Požarnova, Vija; Roze, Anitra; Šmidebergs, Imants; Šnē, Dorisa; Šnē, Māra; Zuicena, Ieva; Pretkalniņa, Lauma, 2019, "Dictionary of Contemporary Latvian Language (MLVV)", https://hdl.handle.net/20.500.12574/57, Latvian Language Institute of the University of Latvia “Contemporary dictionary of Latvian language” (MLVV), which is developed by the UL Latvian Language institute, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. The analysis of the word stock is based on MLVV card files, internet sources, as well as, on last decade’s encyclopaedias and dictionaries... This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Tēzaurs.lv 2022 Jul 17, 2023 Spektors, Andrejs; Pretkalniņa, Lauma; Grūzītis, Normunds; Paikens, Pēteris; Rituma, Laura; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Lokmane, Ilze; Klints, Agute; Stāde, Madara; Grasmanis, Mikus; Strankale, Laine, 2022, "Tēzaurs.lv 2022", https://hdl.handle.net/20.500.12574/66, AiLab IMCS UL Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains nearly 390,000 entries compiled from more than 330 sources. The dictionary is enriched with phonetic, morphological, semantic and other annotations, and it is integrated with the Latvian WordNet data. This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
LVMED: Dataset of Latvian text normalisation samples for the medical domain May 29, 2023 Lasmanis, Viesturs Jūlijs; Grūzītis, Normunds, 2023, "LVMED: Dataset of Latvian text normalisation samples for the medical domain", https://hdl.handle.net/20.500.12574/85, AiLab IMCS UL The CSV dataset contains sentence pairs for a text-to-text transformation task: given a sentence that contains 0..n abbreviations, rewrite (normalize) the sentence in full words (word forms). Training dataset: 64,665 sentence pairs Validation dataset: 7,185 sentence pairs. Testing dataset: 7,984 sentence pairs. All sentences are extracted from a pu... This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
"Early dictionaries for Latvian, 1777-1910" Mar 8, 2023 Lange, J.; Stender, G. F.; Valdemārs, K.; Ulmann, C. C.; Dravnieks, J.; Brasche, G., 2023, ""Early dictionaries for Latvian, 1777-1910"", https://hdl.handle.net/20.500.12574/82, AiLab IMCS UL Collection of early Latvian dictionaries: Lange J. Vollständiges deutschlettisches und lettischdeutsches Lexicon, 1777 Stender G. F. Lettisches Lexikon, Mitau, 1789 Valdemārs Kr. Krievu - latviešu - vācu vārdnīca, Maskava, 1872 Ulmann C. C. Lettisch - deutsches Wörterbuch, Riga, 1872 Ulmann C. C., Brasche G. Deutch - lettisches Wörterbuch, Riga u.... This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.

LVTB - Latvian Treebank v2.11

Nov 15, 2023

Rituma, Laura; Pretkalniņa, Lauma; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Grūzītis, Normunds, 2022, "LVTB - Latvian Treebank v2.11", https://hdl.handle.net/20.500.12574/75, AiLab IMCS UL

Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version of Latvian UD Treebank (UDLV-LVTB).