82 Works

Estonian Wordnet (kb71-LAST)

Heili Orav
The atom of a wordnet-type thesaurus is a synonym set (also called a synset), which is a set containing all the synonymous words or multi-word units that express the same concept. All words in a synset belong into the same part of speech. THIS IS THE NEWEST VERSION

Estonian WordNet (kb65a-4)

Heili Orav
Compiled manually according to EuroWordNet project.

Pindsüntaktiliselt analüüsitud korpus

Kaili Müürisep
This corpus is a monolingual corpus with Constraint Grammar-style shallow syntactic annotations.

Kõnele

Tanel Alumäe
Kõnele on kõnetuvastusteenus Androidi rakendustele, mis võimaldab saata e-kirju, sooritada infootsingut, kirjutada märkmeid, anda käske jne kõne abil.

Eesti puudepanga korpus

Kaili Müürisep
Estonian Treebank is available both in the VISL and TigerXML format. Esttre consists of ca 1400 manually annotated sentences (10600 tokens), the text classes represented in the Corpus are fiction, both translated and original, newspaper texts and 20 sentences of transcribed spoken language.

Tartu Ülikooli eesti murrete ja sugulaskeelte arhiiv

Tuuli Tuisk
his collection of linguistic material is managed by the University of Tartu, Institute of Estonian and General Linguistics. The archives contain four types of materials: 1) Sound recordings of Estonian dialects and Finno-Ugric languages, 2) Unpublished manuscripts, including - student report papers and thesis written at the Institute of Estonian and General Linguistics, - transcriptions and written notes on Estonian and Finno-Ugric languages, 3) Photos from field-work expedition and linguistic events, 4) Video recordings.

Eesti ajakirjanduse korpus

Kadri Muischnek
Korpus sisaldab eesti ajalehti, 182 miljonit sõna. TEI P5 XML märgendus, UTF8 kodeering.

The database of Estonian multi-word expressions

Kadri Muischnek
This database contains a subtype of multi-word expressions, namely those consisting of a verb and a particle or a verb and its complements.

Eesti-prantsuse paralleelkorpus

Madis Jürviste
Aligned parallel translation corpus containing 65 million words (including both languages).

Eesti-prantsuse paralleelkorpus

Madis Jürviste
Aligned parallel translation corpus containing 65 million words (including both languages).

Morphological analyzer for Estonian ESTMORF

Heiki-Jaan Kaalep
ESTMORF is a computer program for analysing unrestricted Estonian text. ESTMORF is implemented in a most straightforward way: it compares word forms of the running text with combinations of lexemes from its lexicon. ESTMORF determines, for every input word form, the structure of the word (e.g. stem, derivational suffix, inflectional affix), the word class and inflectional categories (e.g. number and case).

Corpus of Adolescent Speech

Einar Meister
Includes speech recordings of 300 native Estonian subjects in the age range from 9 to 18 years

Estonian Wordnet (kb71-STABLE)

Heili Orav
Eesti Wordnetis (versioon 71) on praeguse seisuga (veebruar 2015) u 73 000 mõistet, milles sõnu u 98 700, semantilisi suhteid üle 230 000 ja ingliskeelseid suhteid üle 105 000. Kõik mõisted on varustatud vähemalt ühe keelesisese suhtega ja vähemalt ühe ILI-suhtega.

Arvutaja

Tanel Alumäe
Speech-based calculator/assistant for Android

Eestikeelne kõnesüntees

Meelis Mihkla
Kõnesünteesikeskkond koondab endas eesti keele tekst-kõne sünteesi erinevaid variante, rakendusi, liideseid ja muid materjale, mis sünteesimiseks on loodud. Kuulata saab erinevaid eestikeelseid sünteeshääli ning neid ka alla laadida. Samuti saab alla laadida sünteesi kasutavaid rakendusi nii nutitelefonidele kui ka arvutitele.

Estonian Wordnet (kb69a)

Heili Orav
The atom of a wordnet-type thesaurus is a synonym set (also called a synset), which is a set containing all the synonymous words or multi-word units that express the same concept. All words in a synset belong into the same part of speech.

Aktsendikorpus

Einar Meister
salvestati 10 leedu emakeelega keelejuhti, segmenteeriti käsitsi 6 jaapani, 7 rootsi ja 20 läti keelejuhi vältevastandustega sõnu sisaldavad laused, kokku 1566 lauset

Suur eesti-prantsuse sõnaraamat

Madis Jürviste
A comprehensive bilingual general language translation dictionary.

Registration Year

  • 2015
    82

Data Centers

  • Keeleressursid. The Center of Estonian Language Resources
    82