20 Language Resources

Order by:

 Arboretum treebank    
  • Danish

ID: ELRA-W0084

ISLRN: 025-729-182-451-2

The Arboretum treebank is a morphologically and syntactically annotated repository of Danish sentences, taken from Korpus 90 and Korpus 2000, both compiled by the Society for Danish Language and Literature (http://ordnet.dk/korpusdk/fakta), and containing samples of written Danish from the 90'ies...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1500.00 € submit
7000.00 € submit
Licence: Commercial Use - ELRA VAR
7000.00 € submit
7000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2200.00 € submit
10000.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit
 AUDIO Human Voice Pronunciations - Danish    
  • Danish

ID: ELRA-S0490-05

ISLRN: 787-681-475-552-1

Human voice recordings of single-word lemmas and multiword expressions, besides IPA (International Phonetic Alphabet) and alternative scripts (Japanese – Romaji/Kanji/Hiragana; Chinese – Pinyin; Arabic and Hebrew – w/out diacritics), distributed as distinct sets (from ELRA-S0490-01 to ELRA-S0490-...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
887.80 € submit
887.80 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
932.19 € submit
932.19 € submit

Special offers are also available. Check here for details.

 Danish EUROM1    
  • Danish

ID: ELRA-S0292

ISLRN: 775-921-898-933-2

EUROM1 is the first really multilingual speech database produced in Europe. Equivalent corpora for each of the European languages were collected with the same number of speakers selected in the same way, and recorded in the same conditions with common file formats. Initially eight European countr...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
800.00 € submit
800.00 € submit
Licence: Commercial Use - ELRA VAR
800.00 € submit
800.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1600.00 € submit
1600.00 € submit
Licence: Commercial Use - ELRA VAR
1600.00 € submit
1600.00 € submit
 Danish Propbank    
  • Danish

ID: ELRA-W0117

ISLRN: 213-212-351-142-5

The Danish Propbank (DPB) is a multi-layer treebank, annotated not only with morphosyntactic, but also with semantic information, in particular propositions/frames with VerbNet classes and semantic roles for both arguments and satellites. In addition, the corpus has been annotated with 20 Named E...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
150.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
800.00 € submit
7000.00 € submit
Licence: Commercial Use - ELRA VAR
7000.00 € submit
7000.00 € submit
 Danish SpeechDat-Car - Full database    
  • Danish

ID: ELRA-S0132-01

ISLRN: 392-627-715-651-4

The Danish SpeechDat-Car comprises the recordings of 300 Danish speakers from 5 different regions (162 males, 138 females), recorded over the GSM telephone network, and in a car. This database is partitioned into 15 DVDs (53 GB), plus 1 CD-ROM for e.g. non-signal files and documentation. The spee...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
40000.00 € submit
50000.00 € submit
Licence: Commercial Use - ELRA VAR
50000.00 € submit
50000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
48000.00 € submit
60000.00 € submit
Licence: Commercial Use - ELRA VAR
60000.00 € submit
60000.00 € submit
 Danish SpeechDat-Car - GSM recordings - GSM recordings only    
  • Danish

ID: ELRA-S0132-02

ISLRN: 246-078-501-252-4

The Danish SpeechDat-Car contains the recordings of 300 Danish speakers from 5 different regions (162 males, 138 females), recorded over the GSM telephone network, and in a car. This database is partitioned into 15 DVDs (53 GB), plus 1 CD-ROM for e.g. non-signal files and documentation. The speec...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
20000.00 € submit
25000.00 € submit
Licence: Commercial Use - ELRA VAR
25000.00 € submit
25000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
24000.00 € submit
30000.00 € submit
Licence: Commercial Use - ELRA VAR
30000.00 € submit
30000.00 € submit
 Danish SpeechDat-Car - In-car recordings    
  • Danish

ID: ELRA-S0132-03

ISLRN: 554-348-203-545-0

The Danish SpeechDat-Car contains the recordings of 300 Danish speakers from 5 different regions (162 males, 138 females), recorded over the GSM telephone network, and in a car. This database is partitioned into 15 DVDs (53 GB), plus 1 CD-ROM for e.g. non-signal files and documentation. The speec...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
28000.00 € submit
35000.00 € submit
Licence: Commercial Use - ELRA VAR
35000.00 € submit
35000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
33600.00 € submit
42000.00 € submit
Licence: Commercial Use - ELRA VAR
42000.00 € submit
42000.00 € submit
 Danish SpeechDat(II) FDB-1000    
  • Danish

ID: ELRA-S0072

ISLRN: 887-530-701-059-9

The Danish SpeechDat(II) FDB-1000 contains the recordings of 1,000 Danish speakers (1940 males, 2060 females) recorded over the Danish fixed telephone network. This speech database was validated by SPEX (the Netherlands) to assess its compliance with the SpeechDat format and content specificati...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
9000.00 € submit
18000.00 € submit
Licence: Commercial Use - ELRA VAR
18000.00 € submit
18000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
22000.00 € submit
25000.00 € submit
Licence: Commercial Use - ELRA VAR
25000.00 € submit
25000.00 € submit
 Danish SpeechDat(II) FDB-4000    
  • Danish

ID: ELRA-S0073

ISLRN: 158-802-352-546-0

The Danish SpeechDat(II) FDB-4000 comprises 4,000 Danish speakers (1,940 males, 2,060 females) recorded over the Danish fixed telephone network. This database is partitioned into 14 CDs. The first 13 CDs comprise 300 speakers sessions each, the 14th comprises 100 speakers. This speech database w...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
28000.00 € submit
40000.00 € submit
Licence: Commercial Use - ELRA VAR
40000.00 € submit
40000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
48000.00 € submit
56000.00 € submit
Licence: Commercial Use - ELRA VAR
56000.00 € submit
56000.00 € submit
 Danish SpeechDat(M) database - DB1    
  • Danish

ID: ELRA-S0040

ISLRN: 696-873-170-497-2

The Danish SpeechDat(M) database is the speech database collected within the SpeechDat(M) project. It consists ofpolyphone-like data recorded by 1,523 speakers. The speech files are stored as sequences of 8 bit 8 kHz A-law samples. Each prompted utterance is stored within a separatefile and the a...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
11000.00 € submit
14000.00 € submit
Licence: Commercial Use - ELRA VAR
14000.00 € submit
14000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
20000.00 € submit
20000.00 € submit
Licence: Commercial Use - ELRA VAR
20000.00 € submit
20000.00 € submit
 Danish SpeechDat(M) database - DB2    
  • Danish

ID: ELRA-S0041

ISLRN: 450-161-024-148-1

The (polyphone-like) Danish SpeechDat(M) database contains the recordings of 1,523 Danish speakers from 11 regions. Speech samples are stored as sequences of 8 bit 8 kHz A-law. Each prompted utterance is stored in a separate file, and the associated label files are stored in SAM file format. Ea...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6000.00 € submit
12000.00 € submit
Licence: Commercial Use - ELRA VAR
12000.00 € submit
12000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
10000.00 € submit
20000.00 € submit
Licence: Commercial Use - ELRA VAR
20000.00 € submit
20000.00 € submit
 GEOLINGUAL Multilingual Geographical Entity Tables    
  • Arabic
  • Chinese
  • Danish
  • Dutch; Flemish
  • English
  • French
  • German
  • Hebrew
  • Japanese
  • Korean
  • Modern Greek (1453-)
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Turkish

ID: ELRA-L0205

ISLRN: 816-648-322-249-9

A table of over 200 countries and other major geographical names worldwide – including their adjectives, persons, and main languages – in the following languages: Arabic, Chinese Simplified, Danish, Dutch, English, French, German, Greek, Hebrew, Japanese, Korean, Polish, Portuguese, Russian, Span...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
1000.00 € submit
1000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
1050.00 € submit
1050.00 € submit
 GLOBAL Multilingual Lexical Data - Bilingual - Level 1    
  • Arabic
  • Chinese
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • French
  • German
  • Hebrew
  • Hindi
  • Italian
  • Japanese
  • Korean
  • Latin
  • Modern Greek (1453-)
  • Norwegian
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swedish
  • Thai
  • Turkish

ID: ELRA-M0111-04

ISLRN: 255-971-767-096-3

The GLOBAL Multilingual Lexical Data (references ELRA-M0111-01 to ELRA-M0111-06 in the ELRA Catalogue) consists of a network of lexicographic cores for major world languages, comprising diverse monolingual, bilingual and multilingual combinations, in different sizes, originally built for language...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
6800.00 € submit
6800.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
7140.00 € submit
7140.00 € submit

Special offers are also available. Check here for details.

 GLOBAL Multilingual Lexical Data - Bilingual - Level 2    
  • Danish
  • Dutch; Flemish
  • French
  • German
  • Hebrew
  • Italian
  • Norwegian
  • Portuguese
  • Spanish; Castilian
  • Swedish

ID: ELRA-M0111-05

ISLRN: 642-267-621-639-3

The GLOBAL Multilingual Lexical Data (references ELRA-M0111-01 to ELRA-M0111-06 in the ELRA Catalogue) consists of a network of lexicographic cores for major world languages, comprising diverse monolingual, bilingual and multilingual combinations, in different sizes, originally built for language...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
13690.00 € submit
13690.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
14374.50 € submit
14374.50 € submit

Special offers are also available. Check here for details.

 GLOBAL Multilingual Lexical Data - Monolingual - Level 1    
  • Arabic
  • Chinese
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • French
  • German
  • Hebrew
  • Hindi
  • Italian
  • Japanese
  • Korean
  • Latin
  • Modern Greek (1453-)
  • Norwegian
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swedish
  • Thai
  • Turkish

ID: ELRA-M0111-01

ISLRN: 604-974-454-390-3

The GLOBAL Multilingual Lexical Data (references ELRA-M0111-01 to ELRA-M0111-06 in the ELRA Catalogue) consists of a network of lexicographic cores for major world languages, comprising diverse monolingual, bilingual and multilingual combinations, in different sizes, originally built for language...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
4250.00 € submit
4250.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
4462.50 € submit
4462.50 € submit

Special offers are also available. Check here for details.

 GLOBAL Multilingual Lexical Data - Monolingual - Level 2    
  • Danish
  • Dutch; Flemish
  • French
  • German
  • Hebrew
  • Italian
  • Norwegian
  • Portuguese
  • Spanish; Castilian
  • Swedish

ID: ELRA-M0111-02

ISLRN: 282-033-962-912-2

The GLOBAL Multilingual Lexical Data (references ELRA-M0111-01 to ELRA-M0111-06 in the ELRA Catalogue) consists of a network of lexicographic cores for major world languages, comprising diverse monolingual, bilingual and multilingual combinations, in different sizes, originally built for language...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
8510.00 € submit
8510.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
8935.50 € submit
8935.50 € submit

Special offers are also available. Check here for details.

 MULTIGLOSS Multilingual Glossaries - L1-English pair    
  • Afrikaans
  • Arabic
  • Azerbaijani
  • Bulgarian
  • Catalan; Valencian
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hebrew
  • Hindi
  • Hungarian
  • Icelandic
  • Indonesian
  • Italian
  • Japanese
  • Korean
  • Latin
  • Latvian
  • Lithuanian
  • Malay (macrolanguage)
  • Modern Greek (1453-)
  • Norwegian
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Serbian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • Thai
  • Turkish
  • Ukrainian
  • Urdu
  • Vietnamese
  • Western Frisian

ID: ELRA-M0112-01

ISLRN: 098-079-939-987-5

A series of innovative multilingual word-to-sense glossaries, based on a human-edited word-to-sense bilingual index of each language to English, which is linked automatically to the translation equivalents in 45 target languages. Each word and expression in every language is translated via its...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
2500.00 € submit
2500.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
2625.00 € submit
2625.00 € submit

Special offers are also available. Check here for details.

 MULTIGLOSS Multilingual Glossaries - L1-English pair + 1 language    
  • Afrikaans
  • Arabic
  • Azerbaijani
  • Bulgarian
  • Catalan; Valencian
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hebrew
  • Hindi
  • Hungarian
  • Icelandic
  • Indonesian
  • Italian
  • Japanese
  • Korean
  • Latin
  • Latvian
  • Lithuanian
  • Malay (macrolanguage)
  • Modern Greek (1453-)
  • Norwegian
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Serbian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • Thai
  • Turkish
  • Ukrainian
  • Urdu
  • Vietnamese
  • Western Frisian

ID: ELRA-M0112-02

ISLRN: 610-290-284-705-6

A series of innovative multilingual word-to-sense glossaries, based on a human-edited word-to-sense bilingual index of each language to English, which is linked automatically to the translation equivalents in 45 target languages. Each word and expression in every language is translated via its...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
3750.00 € submit
3750.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
3937.50 € submit
3937.50 € submit

Special offers are also available. Check here for details.

 Parallel Corpora & Domains (bilingual and multilingual)    
  • Arabic
  • Chinese
  • Danish
  • Dutch; Flemish
  • English
  • Finnish
  • French
  • German
  • Hebrew
  • Italian
  • Japanese
  • Korean
  • Modern Greek (1453-)
  • Northern Sami
  • Norwegian
  • Polish
  • Portuguese
  • Russian
  • Spanish; Castilian
  • Swedish
  • Turkish

ID: ELRA-W0336

ISLRN: 471-919-856-164-1

Parallel corpora for nearly 400 language pairs and numerous multilingual combinations, including 10 million bilingual segments and 90 million tokens in 20 languages: Arabic, Chinese (Simplified), Danish, Dutch, English, Finnish, French, German, Greek, Hebrew, Italian, Japanese, Korean, North Sami...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
0.10 € submit
0.10 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
0.11 € submit
0.11 € submit

Special offers are also available. Check here for details.

 STO SprogTeknologisk Ordbase (Danish Lexicon for NLP/HLT Applications)    
  • Danish

ID: ELRA-L0056

ISLRN: 050-677-531-676-8

The STO Lexicon is the most comprehensive computational lexicon of Danish comprising approx. 81,530 entry words, and it is well integrated with the European activities in the field of lexicon development building on experience obtained from the PAROLE and SIMPLE projects. The model and descriptiv...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2000.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
21000.00 € submit
21000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2500.00 € submit
6250.00 € submit
Licence: Commercial Use - ELRA VAR
26250.00 € submit
26250.00 € submit