Metrics
280 Downloads
Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

791 to 800 of 862 Results
Jan 16, 2025 - CLARIN-LV
Zuicena, Ieva; Auziņa, Ieva; Briede, Santa; Jansone, Irēna Ilga; Kuplā, Ieva; Lejniece, Gunta; Migla, Ilga; Oldere, Laimdota; Ozola, Ārija; Požarnova, Vija; Rapa, Sanda; Roze, Anitra; Šmidebergs, Imants; Šnē, Dorisa; Šnē, Māra; Timuška, Agris; Grasmanis, Mikus; Pretkalniņa, Lauma; Znotiņš, Artūrs, 2024, "Dictionary of Contemporary Latvian Language (MLVV) (2024-09-22)", https://hdl.handle.net/20.500.12574/109, Latvian Language Institute of the University of Latvia
“Contemporary dictionary of Latvian language” (MLVV), developed by the Latvian Language institute of University of Latvia, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. The analysis of the word stock is based on MLVV card files, internet sources, as well as, on last decade’s encyclopaedias and...
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Jan 16, 2025 - CLARIN-LV
Spektors, Andrejs; Pretkalniņa, Lauma; Grūzītis, Normunds; Paikens, Pēteris; Rituma, Laura; Saulīte, Baiba; Nešpore-Bērzkalne, Gunta; Lokmane, Ilze; Klints, Agute; Stāde, Madara; Grasmanis, Mikus; Auziņa, Ilze; Znotiņš, Artūrs; Darģis, Roberts; Bārzdiņš, Guntis, 2024, "Tēzaurs.lv 2024 (Autumn Edition)", https://hdl.handle.net/20.500.12574/110, AiLab IMCS UL
Tezaurs.lv is the largest open machine-readable dictionary for Latvian. This version contains more than 405,000 entries based on 345 sources. The dictionary is enriched with phonetic, morphological, derivational, semantic and other annotations, inflection tables, corpus examples, and it is integrated with the Latvian WordNet data. This dataset is a...
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Jan 7, 2025 - CLARIN-LV
Martena, Sanita; Nau, Nicole; Kļavinska, Antra; Juško-Štekele, Angelika; Kociņš-Kūceņš, Armands; Sprukte, Ausma; Briška, Anna; Gusāns, Ingars; Mazure, Laura, 2024, "Corpus of Contemporary Latgalian Speech", https://hdl.handle.net/20.500.12574/105, Rēzekne Academy of Technologies
The corpus consists of audio recordings and their transcripts. It documents natural, spontaneous speech, including field research recordings, interviews, TV and radio broadcasts.
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Dec 20, 2024 - CLARIN-LV
Kļavinska, Antra; Martena, Sanita; Nau, Nicole; Šuplinska, Ilga; Anna, Briška, 2024, "Latgalian Tezaurs 2025 (Winter Edition)", https://hdl.handle.net/20.500.12574/116, Rēzekne Academy of Technologies
Latgalian Tezaurs (LTG T) is a lexical database and online dictionary of Latgalian (ISO 639-3 ltg). The pilot version of December 2024 contains more than 450 entries, including many idioms and other multi-word units. Entries include spelling variants and dialect forms and name the sources where the lexical unit has been documented. Audio recordings...
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Dec 14, 2024 - CLARIN-LV
Baklāne, Anda; Saulespurēns, Valdis; Ozols, Artis, 2022, ""Karogs" corpus", https://hdl.handle.net/20.500.12574/83, National Library of Latvia
Corpus contains texts of the magazine "Karogs" from 1940 to 1994.
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Dec 14, 2024 - CLARIN-LV
Darģis, Roberts, 2022, "Corpus of Latvian PhD Theses (Disertācijas)", https://hdl.handle.net/20.500.12574/93, AiLab IMCS UL
The corpus consists of PhD theses and summaries published in the University of Latvia, Riga Technical University, Riga Stradins University and Liepaja University until 2020.
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Dec 14, 2024 - CLARIN-LV
Levāne-Petrova, Kristīne; Darģis, Roberts; Pokratniece, Kristīne; Lasmanis, Viesturs Jūlijs, 2023, "Balanced Corpus of Modern Latvian (LVK2022)", https://hdl.handle.net/20.500.12574/84, AiLab IMCS UL
The Balanced Corpus of Modern Latvian, which contains unique texts not yet included in other so far developed balanced corpora (LVK2013 and LVK2018). The corpus is primarily based on the design principles of previous balanced corpora. It contains authentic contemporary texts (mostly created after 2000) of various genres with metadata. Unlike its pr...
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Dec 14, 2024 - CLARIN-LV
Levāne-Petrova, Kristīne; Darģis, Roberts, 2018, "Balanced Corpus of Modern Latvian (LVK2018)", https://hdl.handle.net/20.500.12574/11, AiLab IMCS UL
LVK2018 is a balanced and representative 10 million word text corpus of modern Latvian. It represents five different genres: journalism (60%), fiction (20%), scientific (10%), legal (8%), transcriptions (2%). LVK2018 is an extended version of LVK2013.
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Dec 14, 2024 - CLARIN-LV
Levāne-Petrova, Kristīne; Pokratniece, Kristīne; Vēvere, Daira; Poikāns, Ilmārs; Andronova, Everita, 2013, "Balanced Corpus of Modern Latvian (LVK2013) Līdzsvarots latviešu valodas korpuss (LVK2013)", https://hdl.handle.net/20.500.12574/44, AiLab IMCS UL
LVK2013 is the 4.5 million representative corpus of contemporary Latvian. LVK2013 is designed as a general language, representative and balanced corpus that aims to cover the variety of existing texts in some estimated proportions. The corpus contains six different sections: journalism (55%), fiction (20%), scientific (10%), legal (8%), other texts...
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Dec 14, 2024 - CLARIN-LV
Spektors, Andrejs; Grūzītis, Normunds; Darģis, Roberts; Auziņa, Ilze; Saulīte, Baiba; Levāne-Petrova, Kristīne, 2018, "Rainis", https://hdl.handle.net/20.500.12574/41, AiLab IMCS UL
This specialised text corpus contains all of Rainis work: plays, poetry, prose, journalism, translations, letters,etc.
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Add Data

Sign up or log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.