11 Language Resources

Order by:

 2006 CoNLL Shared Task - Ten Languages    
  • Bulgarian
  • Danish
  • Dutch; Flemish
  • German
  • Japanese
  • Portuguese
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • Turkish

ID: ELRA-W0086

ISLRN: 578-227-532-044-0

2006 CoNLL Shared Task - Ten Languages consists of dependency treebanks in ten languages used as part of the CoNLL 2006 shared task on multi-lingual dependency parsing. The languages covered in this release are: Bulgarian, Danish, Dutch, German, Japanese, Portuguese, Slovene, Spanish, Swedish and...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
 EnToSSLNE - a Lexicon of Parallel Named Entities from English to South Slavic Languages    
  • Bosnian
  • Bulgarian
  • Croatian
  • English
  • Macedonian
  • Serbian
  • Slovenian

ID: ELRA-M0051

ISLRN: 690-348-503-270-1

This lexicon contains multiword entries which are not strictly named entities, but contain a word which is. For example, German shepherd is an entry in this lexicon, since many dogs of this breed exist. But, the adjective German makes it a named entity in a broader sense. Accordingly, there are m...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
300.00 € submit
1000.00 € submit
Licence: Commercial Use - ELRA VAR
1000.00 € submit
1000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
600.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
 LC-STAR English-Slovenian Bilingual Aligned Phrasal lexicon      
  • English
  • Slovenian

ID: ELRA-S0274

ISLRN: 336-577-115-310-7

The LC-STAR English-Slovenian Bilingual Aligned Phrasal lexicon was created within the scope of the LC-STAR project (IST 2001-32216) which was sponsored by the European Commission. It was designed for SST (Speech-to-Speech Translation). The lexicon comprises 12,722 phrases from the tourist domai...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3750.00 € submit
5500.00 € submit
Licence: Commercial Use - ELRA VAR
5500.00 € submit
5500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4875.00 € submit
7150.00 € submit
Licence: Commercial Use - ELRA VAR
7150.00 € submit
7150.00 € submit
 LC-STAR Slovenian Phonetic lexicon      
  • Slovenian

ID: ELRA-S0273

ISLRN: 038-045-048-122-2

The LC-STAR Slovenian Phonetic lexicon was created within the scope of the LC-STAR project (IST 2001-32216) which was sponsored by the European Commission. The lexicon comprises 110,900 entries, distributed over three categories: - a set of 64,521 common word entries. This set is extracted from...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
15250.00 € submit
23000.00 € submit
Licence: Commercial Use - ELRA VAR
23000.00 € submit
23000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
23250.00 € submit
31500.00 € submit
Licence: Commercial Use - ELRA VAR
31500.00 € submit
31500.00 € submit
 MULTIGLOSS Multilingual Glossaries - L1-English pair    
  • Afrikaans
  • Arabic
  • Azerbaijani
  • Bulgarian
  • Catalan; Valencian
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hebrew
  • Hindi
  • Hungarian
  • Icelandic
  • Indonesian
  • Italian
  • Japanese
  • Korean
  • Latin
  • Latvian
  • Lithuanian
  • Malay (macrolanguage)
  • Modern Greek (1453-)
  • Norwegian
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Serbian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • Thai
  • Turkish
  • Ukrainian
  • Urdu
  • Vietnamese
  • Western Frisian

ID: ELRA-M0112-01

ISLRN: 098-079-939-987-5

A series of innovative multilingual word-to-sense glossaries, based on a human-edited word-to-sense bilingual index of each language to English, which is linked automatically to the translation equivalents in 45 target languages. Each word and expression in every language is translated via its...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
2500.00 € submit
2500.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
2625.00 € submit
2625.00 € submit

Special offers are also available. Check here for details.

 MULTIGLOSS Multilingual Glossaries - L1-English pair + 1 language    
  • Afrikaans
  • Arabic
  • Azerbaijani
  • Bulgarian
  • Catalan; Valencian
  • Chinese
  • Croatian
  • Czech
  • Danish
  • Dutch; Flemish
  • English
  • Estonian
  • Finnish
  • French
  • German
  • Hebrew
  • Hindi
  • Hungarian
  • Icelandic
  • Indonesian
  • Italian
  • Japanese
  • Korean
  • Latin
  • Latvian
  • Lithuanian
  • Malay (macrolanguage)
  • Modern Greek (1453-)
  • Norwegian
  • Persian
  • Polish
  • Portuguese
  • Romanian; Moldavian; Moldovan
  • Russian
  • Serbian
  • Slovak
  • Slovenian
  • Spanish; Castilian
  • Swedish
  • Thai
  • Turkish
  • Ukrainian
  • Urdu
  • Vietnamese
  • Western Frisian

ID: ELRA-M0112-02

ISLRN: 610-290-284-705-6

A series of innovative multilingual word-to-sense glossaries, based on a human-edited word-to-sense bilingual index of each language to English, which is linked automatically to the translation equivalents in 45 target languages. Each word and expression in every language is translated via its...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
3750.00 € submit
3750.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
3937.50 € submit
3937.50 € submit

Special offers are also available. Check here for details.

 ONOMASTICA-COPERNICUS DATABASE      
  • Czech
  • Estonian
  • Latvian
  • Polish
  • Slovak
  • Slovenian
  • Ukrainian

ID: ELRA-S0043

ISLRN: 246-224-540-110-4

The ONOMASTICA project was a European-wide research initiative within the scope of the Linguistic Research and Engineering Programme, the aim of which was the construction of a multi-language pronunciation lexicon of proper names. That project covered eleven European languages: Danish, Dutch, Eng...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
400.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
800.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
 Secretariat-General parallel corpus SL-EN and EN-SL (part 1) (Processed)    
  • English
  • Slovenian

ID: ELRA-W0190

ISLRN: 271-870-307-699-4

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. English-Slovenian parallel corpus in TMX format from the...

MEMBERacademiccommercial
Licence: Other - Public Domain
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Other - Public Domain
0.00 € submit
0.00 € submit
 Secretariat-General parallel corpus SL-EN and EN-SL (part 2) (Processed)    
  • English
  • Slovenian

ID: ELRA-W0191

ISLRN: 963-471-195-725-8

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. English-Slovenian parallel corpus in TMX format from the...

MEMBERacademiccommercial
Licence: Other - Public Domain
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Other - Public Domain
0.00 € submit
0.00 € submit
 Slovenian BNSI Broadcast News Speech Corpus    
  • Slovenian

ID: ELRA-S0275

ISLRN: 502-280-144-938-4

This speech database consists of TV news shows (both evening news, “TV Dnevnik” and late night news, “Odmevi”), from the archive of a Slovenian national broadcaster RTV Slovenia. The recordings took place between June 1999 and May 2003. The database comprises a total of 36 hours of recordings (...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6000.00 € submit
19000.00 € submit
Licence: Commercial Use - ELRA VAR
19000.00 € submit
19000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
10000.00 € submit
33000.00 € submit
Licence: Commercial Use - ELRA VAR
33000.00 € submit
33000.00 € submit
 Slovenian-English corpus with statistical reports from the Statistical Office of the Republic of Slovenia website (Processed)    
  • English
  • Slovenian

ID: ELRA-W0267

ISLRN: 169-569-336-630-0

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Slovenian-English corpus with statistical reports from t...

MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit