Resource Type:
Corpus: | ![]() |
Lexical/Conceptual: | ![]() |
Tool/Service: | ![]() |
Language Description: | ![]() |
Media Type:
Text: | ![]() |
Audio: | ![]() |
Image: | ![]() |
Video: | ![]() |
Text Numerical: | ![]() |
Text N-Gram: | ![]() |
1685 Language Resources (Page 43 of 85)
« Previous | Next »Order by:


- English
- French
- German
- Spanish; Castilian
ID: ELRA-T0098
ISLRN: 759-928-762-639-3Cards available: 2210 Languages: German, English, French,Spanish Card Description: Each card in this terminological database contains a definition, relation between concepts, graphics, abbreviations, notes, sub-domains, sources, grammatical labels.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2718.30 €
![]() |
2718.30 €
![]() |
Licence: Commercial Use - ELRA VAR |
2718.30 €
![]() |
2718.30 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
4530.50 €
![]() |
4530.50 €
![]() |
Licence: Commercial Use - ELRA VAR |
4530.50 €
![]() |
4530.50 €
![]() |


- Arabic
- English
ID: ELRA-E0040
ISLRN: 631-407-723-040-2The MEDAR Evaluation Package was produced within the project MEDAR (MEDiterranean ARabic language and speech technology), supported by the European Commission's ICT programme and which has been running from February 1st 2008 until July 31st 2010. The project addressed International Cooperation be...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
0.00 €
![]() |
0.00 €
![]() |


- French
ID: ELRA-E0024
ISLRN: 699-856-029-354-6The MEDIA Evaluation Package was produced within the French national project MEDIA (Automatic evaluation of man-machine dialogue systems), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The MEDIA project enabled to carry out a campaign...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
5000.00 €
![]() |
Licence: Evaluation Use - ELRA EVALUATION |
1000.00 €
![]() | |
Licence: Commercial Use - ELRA VAR |
20000.00 €
![]() |
20000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
7500.00 €
![]() |
Licence: Evaluation Use - ELRA EVALUATION |
6500.00 €
![]() | |
Licence: Commercial Use - ELRA VAR |
25000.00 €
![]() |
25000.00 €
![]() |
This resource is also available in a bundle. Check here for bundled pricing.


- French
ID: ELRA-S0272
ISLRN: 195-971-767-455-9The MEDIA speech database for French was produced by ELDA within the French national project MEDIA (Automatic evaluation of man-machine dialogue systems), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). It contains 1,258 transcribed ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
5000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5000.00 €
![]() |
5000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
10000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
10000.00 €
![]() |
10000.00 €
![]() |


- English
- Modern Greek (1453-)
ID: ELRA-W0210
ISLRN: 043-737-892-695-4This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Memorandum of Understanding for a three-year European St...
MEMBER | academic | commercial |
---|---|---|
Licence: Other - Public Domain |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Other - Public Domain |
0.00 €
![]() |
0.00 €
![]() |


- English
ID: ELRA-S0394
ISLRN: 217-906-813-531-9INTRODUCTION Metalogue Multi-Issue Bargaining Dialogue was developed by the Metalogue Consortium (http://cordis.europa.eu/project/rcn/110655_en.html) under the European Community's Seventh Framework Programme for Research and Technological Development (https://ec.europa.eu/research/fp7/index_e...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
0.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
250.00 €
![]() |
250.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
250.00 €
![]() |
250.00 €
![]() |


- English
- Modern Greek (1453-)
ID: ELRA-W0208
ISLRN: 462-928-711-185-4This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Methodological Reconciliation Table Council Directive 20...
MEMBER | academic | commercial |
---|---|---|
Licence: Other - Public Domain |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Other - Public Domain |
0.00 €
![]() |
0.00 €
![]() |


- Spanish; Castilian
ID: ELRA-S0228-94
ISLRN: 217-568-306-452-3This corpus comprises 19,156 entries uttered by 30 speakers (16 males and 14 females), recorded over 2 channels (desktop in quiet office). Speech samples are stored as a sequence of 16-bit 44.1kHz for a total of 5 hours of speech per channel.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
![]() |
5400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5400.00 €
![]() |
5400.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
![]() |
5400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5400.00 €
![]() |
5400.00 €
![]() |


- Spanish; Castilian
ID: ELRA-S0228-104
ISLRN: 866-276-372-885-9This corpus was recorded in a quiet office environment over 3 channels and collected from a total of 826 speakers, including 408 males and 418 females, all of whom have been carefully screened to ensure their standard and clear pronunciation. The audio scripts cover information such as news. Spee...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
81000.00 €
![]() |
81000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
81000.00 €
![]() |
81000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
81000.00 €
![]() |
81000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
81000.00 €
![]() |
81000.00 €
![]() |


- Arabic
ID: ELRA-S0404
ISLRN: 938-639-614-524-5The MGB-5 Moroccan Dialect comprises 14 hours of Moroccan Arabic speech extracted from 93 YouTube videos distributed across seven genres: comedy, cooking, family/children, fashion, drama, sports, and science clips. Given that dialectal Arabic does not have a clearly defined orthography, differ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
1500.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
1500.00 €
![]() |
1500.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
2000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
2000.00 €
![]() |
2000.00 €
![]() |



- French
ID: ELRA-S0100
ISLRN: 740-149-502-864-8MHATLex is a new enhanced lexical resource for written and speech automatic processing for French. It is derived from BDLex (see ELRA-S0004). It contains three levels of representation: - Syntactic level: S - Phonological word level: W - Phonetic level: P At the W level, a word has two repr...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1500.00 €
![]() |
5000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5000.00 €
![]() |
5000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2500.00 €
![]() |
7500.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
7500.00 €
![]() |
7500.00 €
![]() |


- Spanish; Castilian
ID: ELRA-S0165
ISLRN: 313-534-255-935-8The ATLAS Spanish Microphone Database (MICROAES) has been collected in Spain by Applied Technologies on Language and Speech, S.L. (ATLAS). This database comprises microphone recordings from 300 different speakers, who have been selected from five different dialectal areas. Sex and age distributio...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
18000.00 €
![]() |
28000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
28000.00 €
![]() |
28000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
22000.00 €
![]() |
32000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
32000.00 €
![]() |
32000.00 €
![]() |


- Chinese
- Finnish
- French
- German
- Persian
- Russian
- Somali
- Swahili (macrolanguage)
ID: ELRA-E0047
ISLRN: 200-586-423-805-2MiLQ is a benchmark of mixed-language (code-switched) search queries created by bilingual speakers for evaluating Information Retrieval with mixed-language queries. It provides query versions where English expressions are embedded within native-language structures. This work is derived from The C...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
0.00 €
![]() |
0.00 €
![]() |


- Dutch; Flemish
- English
- French
- German
ID: ELRA-S0238
ISLRN: 189-835-264-931-4In 1996, some 75 Dutch people participated in recording a multi-purpose continuous speech database. Most of them were recruited from the TNO Human Factors Research Institute, where the recordings were made. The main part of the database consisted of Dutch sentences. However, most speakers partici...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
400.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
500.00 €
![]() |


- Chinese
- English
ID: ELRA-S0457
ISLRN: 451-966-049-653-3The data is recorded by 3972 Chinese native speakers with accents covering seven major dialect areas. The recorded text is a mixture of Chinese and English sentences, covering general scenes and human-computer interaction scenes. It is rich in content and accurate in transcription. It can be used...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
145825.00 €
![]() |
145825.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
145825.00 €
![]() |
145825.00 €
![]() |
Special offers are also available. Check here for details.


- Danish
- Dutch; Flemish
- English
- French
- German
- Italian
- Modern Greek (1453-)
- Portuguese
- Spanish; Castilian
ID: ELRA-W0023
ISLRN: 963-635-729-341-8The MLCC text corpus has two main components - one set to allow comparable studies to be carried out in different languages and one set as the basis for translation studies. The first set is referred as the Polylingual Document Collection, a collection of newspaper articles from financial new...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
1600.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
3600.00 €
![]() |


- French
ID: ELRA-W0032
ISLRN: 488-420-763-510-8The corpus that includes the tagging of the anaphors was created by the CRISTAL-GRESEC (Stendhal-Grenoble 3 University, France) team and XRCE (Xerox Research Centre Europe, France) in the framework of the call launched by the DGLF-LF (national institution for the French language and the languages...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
250.00 €
![]() |
250.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1000.00 €
![]() |
1000.00 €
![]() |


- Lithuanian
ID: ELRA-W0299
ISLRN: 268-109-862-136-1This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Monolingual documents received from the Government of th...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution - CC-BY-4.0 |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution - CC-BY-4.0 |
0.00 €
![]() |
0.00 €
![]() |


- Modern Greek (1453-)
ID: ELRA-W0014
ISLRN: 546-958-429-693-4Monolingual Greek corpus of 1 million words. The corpus consists of articles written in 1996 from the Greek daily newspaper ELEFTHEROTIPIA. Each file contains annotated text with SGML mark-up accompanied by a text header.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
360.00 €
![]() |
360.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
600.00 €
![]() |
600.00 €
![]() |


- Vietnamese
ID: ELRA-W0310
ISLRN: 004-081-406-421-7The Monolingual Vietnamese Annotated Corpus consists of 100,000 sentences, manually annotated with word boundaries, POS, named entities, with an average length of 20 words per sentence. The corpus is provided in XML format and is annotated according to TEI-encoding guidelines.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
500.00 €
![]() |
900.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
1800.00 €
![]() |
1800.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
800.00 €
![]() |
1300.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
2500.00 €
![]() |
2500.00 €
![]() |