CLARIN (Common Language Resources and Technology Infrastructure) is a research infrastructure that was initiated from the vision that all digital language resources and tools from all over Europe and beyond are accessible through a single sign-on online environment for the support of researchers in the humanities and social sciences.
Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

71 to 80 of 98 Results
Nov 15, 2023
Rituma, Laura; Pretkalniņa, Lauma; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Grūzītis, Normunds, 2022, "LVTB - Latvian Treebank v2.11", https://hdl.handle.net/20.500.12574/75, AiLab IMCS UL
Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version of Latvian UD Treebank (UDLV-LVTB).
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Nov 15, 2023
Rituma, Laura; Pretkalniņa, Lauma; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Grūzītis, Normunds, 2022, "LVTB - Latvian Treebank v2.10", https://hdl.handle.net/20.500.12574/63, AiLab IMCS UL
Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version of Latvian UD Treebank (UDLV-LVTB).
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Nov 15, 2023
Rituma, Laura; Pretkalniņa, Lauma; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Grūzītis, Normunds, 2021, "Latvian Treebank v2.8", https://hdl.handle.net/20.500.12574/55, AiLab IMCS UL
Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version of Latvian UD Treebank (UDLV-LVTB).
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Nov 15, 2023
Rituma, Laura; Pretkalniņa, Lauma; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Grūzītis, Normunds, 2020, "Latvian Treebank v2.6", https://hdl.handle.net/20.500.12574/54, AiLab IMCS UL
Latvian Treebank (LVTB) is being developed since 2010. It is manually annotated according to a hybrid dependency-constituency grammar model. This version of LVTB contains data used for deriving the corresponding version of Latvian UD Treebank (UDLV-LVTB).
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Nov 15, 2023
Rituma, Laura; Pretkalniņa, Lauma; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Grūzītis, Normunds, 2021, "Latvian Treebank v2.9", https://hdl.handle.net/20.500.12574/56, AiLab IMCS UL
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Nov 14, 2023
Spektors, Andrejs; Pretkalniņa, Lauma; Grūzītis, Normunds; Paikens, Pēteris; Rituma, Laura; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Lokmane, Ilze; Klints, Agute; Stāde, Madara; Grasmanis, Mikus; Strankale, Laine; Auziņa, Ilze; Znotiņš, Artūrs; Darģis, Roberts; Bārzdiņš, Guntis, 2023, "Tēzaurs.lv 2023 (Summer Edition)", https://hdl.handle.net/20.500.12574/87, AiLab IMCS UL
Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 393,500 entries based on 345 sources. The dictionary is enriched with phonetic, morphological, semantic and other annotations, and it is integrated with the Latvian WordNet data.
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Jul 17, 2023
Jērāne, Santa; Kuplā, Ieva; Lejniece, Gunta; Migla, Ilga; Oldere, Laimdota; Ozola, Ārija; Požarnova, Vija; Roze, Anitra; Šmidebergs, Imants; Šnē, Dorisa; Šnē, Māra; Zuicena, Ieva; Pretkalniņa, Lauma, 2019, "Dictionary of Contemporary Latvian Language (MLVV)", https://hdl.handle.net/20.500.12574/57, Latvian Language Institute of the University of Latvia
“Contemporary dictionary of Latvian language” (MLVV), which is developed by the UL Latvian Language institute, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. The analysis of the word stock is based on MLVV card files, internet sources, as well as, on last decade’s encyclopaedias and dictionaries...
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Jul 17, 2023
Spektors, Andrejs; Pretkalniņa, Lauma; Grūzītis, Normunds; Paikens, Pēteris; Rituma, Laura; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Lokmane, Ilze; Klints, Agute; Stāde, Madara; Grasmanis, Mikus; Strankale, Laine, 2022, "Tēzaurs.lv 2022", https://hdl.handle.net/20.500.12574/66, AiLab IMCS UL
Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains nearly 390,000 entries compiled from more than 330 sources. The dictionary is enriched with phonetic, morphological, semantic and other annotations, and it is integrated with the Latvian WordNet data.
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
May 29, 2023
Lasmanis, Viesturs Jūlijs; Grūzītis, Normunds, 2023, "LVMED: Dataset of Latvian text normalisation samples for the medical domain", https://hdl.handle.net/20.500.12574/85, AiLab IMCS UL
The CSV dataset contains sentence pairs for a text-to-text transformation task: given a sentence that contains 0..n abbreviations, rewrite (normalize) the sentence in full words (word forms). Training dataset: 64,665 sentence pairs Validation dataset: 7,185 sentence pairs. Testing dataset: 7,984 sentence pairs. All sentences are extracted from a pu...
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Mar 8, 2023
Lange, J.; Stender, G. F.; Valdemārs, K.; Ulmann, C. C.; Dravnieks, J.; Brasche, G., 2023, ""Early dictionaries for Latvian, 1777-1910"", https://hdl.handle.net/20.500.12574/82, AiLab IMCS UL
Collection of early Latvian dictionaries: Lange J. Vollständiges deutschlettisches und lettischdeutsches Lexicon, 1777 Stender G. F. Lettisches Lexikon, Mitau, 1789 Valdemārs Kr. Krievu - latviešu - vācu vārdnīca, Maskava, 1872 Ulmann C. C. Lettisch - deutsches Wörterbuch, Riga, 1872 Ulmann C. C., Brasche G. Deutch - lettisches Wörterbuch, Riga u....
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Add Data

Sign up or log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.