201 to 210 of 240 Results
Dec 29, 2023 - CLARIN-LV
Levāne-Petrova, Kristīne; Pokratniece, Kristīne; Darģis, Roberts, 2021, "Corpus of Students' Essays", https://hdl.handle.net/20.500.12574/51, AiLab IMCS UL
A specialized corpus containing 468 students' essays for the 12th grade Latvian language exam.This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Dec 27, 2023 - CLARIN-LV
Reinsone, Sanita; Ļaksa-Timinska, Ilze; Jaudzema, Justīne, 2021, "Corpus of Latvian Pandemic Diaries 2020–2021", https://hdl.handle.net/20.500.12574/48, Institute of Literature, Folklore and Art of the University of Latvia
The Archives of Latvian Folklore invited anyone document their life during pandemic and contribute to the collection "Diaries in the Time of Pandemic 2020-2021". The corpora consists of diary entries. Each file = 1 author. Dates of entries are marked in a format dd/mm/yyyy.This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Dec 27, 2023 - CLARIN-LV
Kārkla, Zita; Matulis, Haralds, 2022, "Corpus of Latvian Women Writers’ Short Fiction", https://hdl.handle.net/20.500.12574/69, Institute of Literature, Folklore and Art of the University of Latvia
The corpus consists of short fiction by Latvian women writers published from 1893 to 2002.This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Dec 27, 2023 - CLARIN-LV
Sperga, Ilze; Pokratniece, Kristīne; Briška, Anna, 2013, "Latgalian Corpus (MuLa)", https://hdl.handle.net/20.500.12574/8, AiLab IMCS UL
The Special Latgalian Corpus (MuLa) is formed from the special written texts types from the time of national awakening (1987-1989) until 2013. The corpus includes three types of texts : literary texts, technical texts, and information texts. The textual sources selected in defined proportions, based on the chronological principle and text genres th...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Dec 27, 2023 - CLARIN-LV
Auziņa, Ilze; Saulīte, Baiba; Akmane, Agate; Millere, Elīna; Naļivaiko, Inga; Stepanovs, Kaspars; Darģis, Roberts; Grūzītis, Normunds, 2021, "LVMED: Latvian Speech Transcripts of the Medical Domain", https://hdl.handle.net/20.500.12574/67, AiLab IMCS UL
A text corpus of orthographic transcription of a Latvian medical speech corpus. It consists of 900 transcripts (documents) of a ~35 hour radiology speech corpus. Modalities covered: CT, MR, MG, CR, US.This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Dec 21, 2023 - CLARIN-LV
Darģis, Roberts, 2022, "Latvian Wikipedia", https://hdl.handle.net/20.500.12574/64, AiLab IMCS UL
The corpus consists of all information published on Latvian Wikipedia until February 2022.This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Dec 21, 2023 - CLARIN-LV
Utka, Andrius; Levāne-Petrova, Kristīne; Vēvere, Daira; Rābante-Buša, Guna; Kovalevskaitė, Jolanta; Rimkutė, Erika, 2013, "Lithuanian-Latvian-Lithuanian Parallel Corpus (LILA)", https://hdl.handle.net/20.500.12574/6, Vytautas Magnus University
The Latvian-Lithuanian parallel corpus LILA represents the language of the Independence period (starts in 1990), includes fictional and non-fictional texts, periodicals and documents.The corpus contains 8 million tokens and is aligned at sentence level.This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Dec 21, 2023 - CLARIN-LV
Džeriņš, Jānis; Džonsons, Kristaps, 2007, "Latvian Web Corpus 2007", https://hdl.handle.net/20.500.12574/46, AiLab IMCS UL
The Latvian Web Corpus 2007 contains 700,000 Latvian webpages published before 2005. The corpus is automatically annotated. Repetitions are not included.This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Dec 21, 2023 - CLARIN-LV
Auziņa, Ilze; Darģis, Roberts; Bojārs, Uldis; Paikens, Pēteris; Znotiņš, Artūrs; Rābante-Buša, Guna, 2018, "Corpus of the Saeima (the Parliament of Latvia)", https://hdl.handle.net/20.500.12574/50, AiLab IMCS UL
The Corpus of the Saeima contains information about parliamentary debates from seven parliamentary terms (5th–12th Saeima) covering years 1993–2017. The available metadata for each utterance includes the date and type of the parliamentary session and speakers’ names and affiliations.This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Nov 22, 2023 - CLARIN-LV
Jērāne, Santa; Kuplā, Ieva; Lejniece, Gunta; Migla, Ilga; Oldere, Laimdota; Ozola, Ārija; Požarnova, Vija; Roze, Anitra; Šmidebergs, Imants; Šnē, Dorisa; Šnē, Māra; Zuicena, Ieva; Pretkalniņa, Lauma; Auziņa, Ieva; Briede, Santa; Šmidebergs, Imants; Timuška, Agris, 2023, "Dictionary of Contemporary Latvian Language (MLVV) ( 2023-07-07 )", https://hdl.handle.net/20.500.12574/88, Latvian Language Institute of the University of Latvia
“Contemporary dictionary of Latvian language” (MLVV), which is developed by the UL Latvian Language institute, is a new explanatory dictionary based on Latvian language materials obtained during the last decade. The analysis of the word stock is based on MLVV card files, internet sources, as well as, on last decade’s encyclopaedias and dictionaries...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |