Metrics
384 Downloads
Featured Dataverses

In order to use this feature you must have at least one published or linked dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

171 to 180 of 269 Results
Mar 26, 2025 - CLARIN-LV
Auziņa, Ilze; Darģis, Roberts; Levāne-Petrova, Kristīne; Auziņa, Arta; Saulīte, Baiba; Ļaksa-Timinska, Ilze; Gailīte, Elīna; Nešpore-Bērzkalne, Gunta; Rābante-Buša, Guna; Pokratniece, Kristīne; Klints, Agute, 2024, "LATE Media Speech Corpus V1 (LATE-mediji)", https://hdl.handle.net/20.500.12574/114, AiLab IMCS UL
The corpus contains audio recordings of media broadcasts and their transcripts in orthographic transcription. The data are transcribed in the orthography of Standard Latvian, observing also the principles of punctuation.
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Mar 26, 2025 - CLARIN-LV
Auziņa, Ilze; Rābante-Buša, Guna; Darģis, Roberts, 2024, "LATE Phonetically Annotated Speech Corpus V1 (fonLATE)", https://hdl.handle.net/20.500.12574/115, AiLab IMCS UL
A small subset of phonetically annotated data has been derived from the LATE-sarunas and LATE-media. The phonetic annotation is available at two levels: (1) the dictionary or standard pronunciation of a word or segment, regardless of its actual pronunciation made by the particular speaker, and (2) the actual pronunciation of a word or segment.
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Mar 26, 2025 - CLARIN-LV
Znotiņš, Artūrs; Auziņa, Ilze; Saulīte, Baiba; Darģis, Roberts; Grūzītis, Normunds, 2024, "LVMED: Test Set for Latvian ASR in the Radiology Domain", https://hdl.handle.net/20.500.12574/117, AiLab IMCS UL
A Latvian speech corpus for the testing and comparison of ASR models in the radiology domain. It consists of authentic dictations of CT, XR, MR, MG, US examination reports.
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Mar 26, 2025 - CLARIN-LV
Darģis, Roberts; Auziņa, Ilze, 2022, "Ilvars - Latvian Male VITS Text-to-Speech Model (vers. 2022)", https://hdl.handle.net/20.500.12574/71, AiLab IMCS UL
A neural model for text-to-speech synthesis in Latvian. Trained using VITS on a 20-hour speech corpus of audiobooks read in a male voice. Currently released for research purposes only.
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Mar 26, 2025 - CLARIN-LV
Rābante-Buša, Guna; Grūzītis, Normunds; Bārzdiņš, Guntis; Mendes, Afonso, 2022, "SELMA Latvian NER Dataset", https://hdl.handle.net/20.500.12574/98, AiLab IMCS UL
A dataset of hierarchically annotated named entities in Latvian news articles (provided by the Latvian Information Agency LETA) for the development and evaluation of transition-based parsers for named entity recognition (NER).
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Mar 26, 2025 - CLARIN-LV
Darģis, Roberts; Znotiņš, Artūrs; Auziņa, Ilze; Rābante-Buša, Guna, 2024, "LATE Dev&Test Set V1 for Latvian ASR", https://hdl.handle.net/20.500.12574/99, AiLab IMCS UL
A Latvian speech corpus for the development (validation), testing and comparison of ASR models. The audio data is segmented and aligned with the corresponding orthographic transcriptions which are human verified. The LATE-media subset contains both verbatim (raw) and formatted transcriptions (with punctuation, capitalisation, numbers, abbreviations...
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Mar 26, 2025 - CLARIN-LV
Nešpore, Gunta; Rituma, Laura, 2023, "Word sense annotated "The Little Prince" fragments in Latvian 1.0", https://hdl.handle.net/20.500.12574/80, AiLab IMCS UL
Annotation of word senses for a running text corpus of 1200 tokens (beginning of The Little Prince by Antoine de Saint-Exupéry) as an evaluation corpus for Latvian WSD systems. Data is provided in a tab-separated format similar to CoNLL, indexing senses to the Tēzaurs.lv word sense IDs as of Tēzaurs.lv 2022 (http://hdl.handle.net/20.500.12574/66) d...
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Mar 26, 2025 - CLARIN-LV
Laizāns, Mārtiņš; Pretkalniņa, Lauma, 2015, "Latvian Blog Corpus 2015", https://hdl.handle.net/20.500.12574/79, AiLab IMCS UL
Authomaticaly harvested Latvian blog corpus.
This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data.
Feb 5, 2025 - Latvian Academy of Culture
Ance Kristāla, 2025, "FLPP GigWork DPP", https://doi.org/10.5281/ZENODO.14810827, Zenodo
Pēdējo divdesmit gadu laikā digitālās platformas - IT uzņēmumi, kas veido un uztur digitālas infrastruktūras cilvēku saskarsmei un dažādu pakalpojumu sniegšanai - ir kļuvušas par ekonomikas un sadzīves neatņemamu sastāvdaļu visā pasaulē. Šo uzņēmumu ietekme ir ne tikai ekonomiska, bet arī sociāli strukturāla. Pētījuma “Digitālajās platformās nodarb...
Harvested from the Latvian Academy of Culture community on Zenodo.
Jan 30, 2025 - Latvian Academy of Culture
Ance Kristāla, 2025, "FLPP GigWork DPP", https://doi.org/10.5281/ZENODO.14729247, Zenodo
Only a decade old, platforms like Bolt and Wolt, have become convenient intermediaries between service providers and those seeking a service. Enjoyment of convenience and low prices of platform services often makes one forget the work that is behind them. The project seeks to examine gig-work, to deepen understanding of how it is practiced, perceiv...
Harvested from the Latvian Academy of Culture community on Zenodo.
Add Data

Sign up or log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.