81 to 90 of 98 Results
Jan 11, 2023
Znotiņš, Artūrs; Paikens, Pēteris; Goško, Didzis; Bārzdiņš, Guntis; Grūzītis, Normunds, 2018, "NLP-PIPE: Latvian NLP Tool Pipeline", https://hdl.handle.net/20.500.12574/4, AiLab IMCS UL
NLP-PIPE is a modular toolchain that allows researchers to combine multiple natural language processing tools in a unified framework. It supports a wide range of annotation services for Latvian, including tokenization, morphological tagging, lemmatisation, universal dependency parsing, and named entity recognition. In the web based interface, a use...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Dec 20, 2022
Pinnis, Mārcis; Pole, Megija; Kapočiūtė-Dzikienė, Jurgita; Nicmanis, Dāvis; Salimbajevs, Askars; Skadiņš, Raivis; Miķelsons, Mārtiņš; Lizanders, Kristaps; Bērziņš, Aivars; Vasiļevskis, Artūrs; Rozis, Roberts; Kornikaite, Nida, 2022, "Evaluation and development data sets for speech translation for meetings", https://hdl.handle.net/20.500.12574/74, Tilde
The evaluation and development data sets for speech translation for meetings were created within the microproject "Multi-layer evaluation sets for speech translation of web-based meetings" of the project "HumanE AI Network". The data sets feature recordings of various public domain (public administration organised, publicly disseminated) meetings i...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Nov 28, 2022
Znotiņš, Artūrs; Darģis, Roberts; Grūzītis, Normunds; Bārzdiņš, Guntis; Goško, Didzis, 2022, "RUTA:MED – Dual Workflow Medical Speech Transcription Pipeline and Editor", https://hdl.handle.net/20.500.12574/70, AiLab IMCS UL
In the medical domain, various approaches are used to produce examination reports and other medical records. Depending on the language-specific technology support, the type of examination, the size of the hospital or clinic, and other aspects, the reporting workflow can range from completely manual to (semi-)automated. A manual workflow may complet...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Nov 23, 2022
Ernštreits, Valts; Muzikante, Marika; Grīnberga, Maima, 2015, "Estonian-Latvian dictionary", https://hdl.handle.net/20.500.12574/59, Latvian Language Agency
This dictionary was created totally anew and today serves as an up-to-date and richly illustrated assistant to translators from the Estonian language and to learners of both the languages. Dictionary was created by using samples from contemporary language, as well as the corpus of Estonian language currently under construction by the Tartu Universi...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Nov 23, 2022
---, ---, 2017, "Language consultations: electronic database", https://hdl.handle.net/20.500.12574/62, Latvian Language Agency
The database „Language consultations: an electronic database” gives the opportunity to search for both questions and sources by typing the query in the searching field, or also to choose from the searching options either only questions, or only authors. The user could also choose a relevant filter near the menu „Meklēt” („Search”). On the first pag...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Nov 23, 2022
Andronova, Everita; Frīdenberga, Anna; Siliņa-Piņķe, Renāte; Trumpa, Anta; Vanags, Pēteris, 2016, "Historical Dictionary of Latvian", https://hdl.handle.net/20.500.12574/3, Latvian Language Institute of the University of Latvia
Interactive electronic dictionary of the 16th and 17th century (around 1200 entries) with a reference to the “Historical Corpus of the Latvian Language”. The dictionary can be searched by the meaning, by the specific donor language in the etymology, and by other features.This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Nov 23, 2022
Grūzītis, Normunds; Pretkalniņa, Lauma; Saulīte, Baiba; Rituma, Laura; Nešpore-Bērzkalne, Gunta; Paikens, Pēteris; Auziņa, Ilze; Znotiņš, Artūrs; Levāne-Petrova, Kristīne; Darģis, Roberts, 2019, "Full Stack of Latvian Language Resources for NLU", https://hdl.handle.net/20.500.12574/5, AiLab IMCS UL
This repository contains a multilayer text corpus of Latvian. The multilayer corpus is anchored in cross-lingual state-of-the-art representations: Universal Dependencies (UD), FrameNet, PropBank and Abstract Meaning Representation (AMR).This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Nov 23, 2022
Leikuma, Lidija; Bernāne, Līga; Cibuļs, Juris; Butkus, Alvydas; Butkienė, Violeta; Vaisvalavičienė, Kristina; Sperga, Ilze, 2013, "The Lithuanian-Latvian-Latgalian Dictionary", https://hdl.handle.net/20.500.12574/52, Rēzekne Academy of Technologies
"The Lithuanian-Latvian-Latgalian Dictionary" (hereinafter — "the LLL dictionary") has been compiled on the basis of "Lthe Lithuanian Language Written Sources Frequency Dictionary" ("Dažninis rašytinės lietuvių kalbų žodynas" , hereinafter — "the Frequency dictionary;" comp. by Utka A., Kaunas, VDU, 2009;)). It consists of 42,061 headwords based on...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Nov 23, 2022
Auziņa, Ilze; Darģis, Roberts; Rābante-Buša, Guna; Levāne-Petrova, Kristīne; Saulīte, Baiba, 2017, "Annotated longitudinal corpus of Latvian children's language", https://hdl.handle.net/20.500.12574/7, AiLab IMCS UL
The collection contains three longitudinal corpora of monolingual Latvian speaking children, and one longitudinal corpus of simultaneous Latvian-Russian bilingual child. Participants were recorded for 30 minutes each week for 16 months, resulting in 134 hours of speech. 34 hours of obtained speech samples are orthographically transcribed.This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |
Oct 26, 2022
Darģis, Roberts; Akmane, Agate; Naļivaiko, Inga; Grūzītis, Normunds; Auziņa, Ilze; Saulīte, Baiba; Stepanovs, Kaspars, 2021, "LVMED: Latvian Pronunciation Dictionary of the Medical Domain", https://hdl.handle.net/20.500.12574/68, AiLab IMCS UL
A machine-readable pronunciation dictionary of the medical domain derived from a large text corpus of historical medical records. Consists of 109k entries in the CSV format: first column - a wordform; second column - its pronunciation in the IPA encoding. The dictionary contains Latvian words and terms used in the medical domain, as well as abbrevi...This Dataset is harvested from our partners. Clicking the link will take you directly to the archival source of the data. |