Text (1054)
Audio (681)
Video (23)
True (226)
TEI (10)
TMX (6)

Resource Type:

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Media Type:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

1685 Language Resources (Page 43 of 85)

« Previous | Next »Order by:

 Mechanical Engineering    
  • English
  • French
  • German
  • Spanish; Castilian

ID: ELRA-T0098

ISLRN: 759-928-762-639-3

Cards available: 2210 Languages: German, English, French,Spanish Card Description: Each card in this terminological database contains a definition, relation between concepts, graphics, abbreviations, notes, sub-domains, sources, grammatical labels.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2718.30 € submit
2718.30 € submit
Licence: Commercial Use - ELRA VAR
2718.30 € submit
2718.30 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4530.50 € submit
4530.50 € submit
Licence: Commercial Use - ELRA VAR
4530.50 € submit
4530.50 € submit
 MEDAR Evaluation Package    
  • Arabic
  • English

ID: ELRA-E0040

ISLRN: 631-407-723-040-2

The MEDAR Evaluation Package was produced within the project MEDAR (MEDiterranean ARabic language and speech technology), supported by the European Commission's ICT programme and which has been running from February 1st 2008 until July 31st 2010. The project addressed International Cooperation be...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
0.00 € submit
0.00 € submit
 MEDIA Evaluation Package    
  • French

ID: ELRA-E0024

ISLRN: 699-856-029-354-6

The MEDIA Evaluation Package was produced within the French national project MEDIA (Automatic evaluation of man-machine dialogue systems), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). The MEDIA project enabled to carry out a campaign...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
5000.00 € submit
Licence: Evaluation Use - ELRA EVALUATION
1000.00 € submit
Licence: Commercial Use - ELRA VAR
20000.00 € submit
20000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
7500.00 € submit
Licence: Evaluation Use - ELRA EVALUATION
6500.00 € submit
Licence: Commercial Use - ELRA VAR
25000.00 € submit
25000.00 € submit

This resource is also available in a bundle. Check here for bundled pricing.

 MEDIA speech database for French    
  • French

ID: ELRA-S0272

ISLRN: 195-971-767-455-9

The MEDIA speech database for French was produced by ELDA within the French national project MEDIA (Automatic evaluation of man-machine dialogue systems), as part of the Technolangue programme funded by the French Ministry of Research and New Technologies (MRNT). It contains 1,258 transcribed ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
10000.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit
 Memorandum for a ESM programme (Processed)    
  • English
  • Modern Greek (1453-)

ID: ELRA-W0210

ISLRN: 043-737-892-695-4

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Memorandum of Understanding for a three-year European St...

MEMBERacademiccommercial
Licence: Other - Public Domain
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Other - Public Domain
0.00 € submit
0.00 € submit
 Metalogue Multi-Issue Bargaining Dialogue    
  • English

ID: ELRA-S0394

ISLRN: 217-906-813-531-9

INTRODUCTION Metalogue Multi-Issue Bargaining Dialogue was developed by the Metalogue Consortium (http://cordis.europa.eu/project/rcn/110655_en.html) under the European Community's Seventh Framework Programme for Research and Technological Development (https://ec.europa.eu/research/fp7/index_e...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
0.00 € submit
Licence: Commercial Use - ELRA VAR
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
250.00 € submit
250.00 € submit
Licence: Commercial Use - ELRA VAR
250.00 € submit
250.00 € submit
 Methodological Reconciliation (Processed)    
  • English
  • Modern Greek (1453-)

ID: ELRA-W0208

ISLRN: 462-928-711-185-4

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Methodological Reconciliation Table Council Directive 20...

MEMBERacademiccommercial
Licence: Other - Public Domain
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Other - Public Domain
0.00 € submit
0.00 € submit
 Mexican Spanish Kids Speech Recognition Corpus (Desktop)    
  • Spanish; Castilian

ID: ELRA-S0228-94

ISLRN: 217-568-306-452-3

This corpus comprises 19,156 entries uttered by 30 speakers (16 males and 14 females), recorded over 2 channels (desktop in quiet office). Speech samples are stored as a sequence of 16-bit 44.1kHz for a total of 5 hours of speech per channel.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
 Mexican Spanish Speech Recognition Corpus (Mobile)    
  • Spanish; Castilian

ID: ELRA-S0228-104

ISLRN: 866-276-372-885-9

This corpus was recorded in a quiet office environment over 3 channels and collected from a total of 826 speakers, including 408 males and 418 females, all of whom have been carefully screened to ensure their standard and clear pronunciation. The audio scripts cover information such as news. Spee...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
81000.00 € submit
81000.00 € submit
Licence: Commercial Use - ELRA VAR
81000.00 € submit
81000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
81000.00 € submit
81000.00 € submit
Licence: Commercial Use - ELRA VAR
81000.00 € submit
81000.00 € submit
 MGB-5 Moroccan Dialect    
  • Arabic

ID: ELRA-S0404

ISLRN: 938-639-614-524-5

The MGB-5 Moroccan Dialect comprises 14 hours of Moroccan Arabic speech extracted from 93 YouTube videos distributed across seven genres: comedy, cooking, family/children, fashion, drama, sports, and science clips. Given that dialectal Arabic does not have a clearly defined orthography, differ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
1500.00 € submit
Licence: Commercial Use - ELRA VAR
1500.00 € submit
1500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
2000.00 € submit
Licence: Commercial Use - ELRA VAR
2000.00 € submit
2000.00 € submit
 MHATLex      
  • French

ID: ELRA-S0100

ISLRN: 740-149-502-864-8

MHATLex is a new enhanced lexical resource for written and speech automatic processing for French. It is derived from BDLex (see ELRA-S0004). It contains three levels of representation: - Syntactic level: S - Phonological word level: W - Phonetic level: P At the W level, a word has two repr...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1500.00 € submit
5000.00 € submit
Licence: Commercial Use - ELRA VAR
5000.00 € submit
5000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2500.00 € submit
7500.00 € submit
Licence: Commercial Use - ELRA VAR
7500.00 € submit
7500.00 € submit
 MICROAES    
  • Spanish; Castilian

ID: ELRA-S0165

ISLRN: 313-534-255-935-8

The ATLAS Spanish Microphone Database (MICROAES) has been collected in Spain by Applied Technologies on Language and Speech, S.L. (ATLAS). This database comprises microphone recordings from 300 different speakers, who have been selected from five different dialectal areas. Sex and age distributio...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
18000.00 € submit
28000.00 € submit
Licence: Commercial Use - ELRA VAR
28000.00 € submit
28000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
22000.00 € submit
32000.00 € submit
Licence: Commercial Use - ELRA VAR
32000.00 € submit
32000.00 € submit
 MiLQ: Mixed-Language Query Test Set for Bilingual Web Search – Evaluation Package    
  • Chinese
  • Finnish
  • French
  • German
  • Persian
  • Russian
  • Somali
  • Swahili (macrolanguage)

ID: ELRA-E0047

ISLRN: 200-586-423-805-2

MiLQ is a benchmark of mixed-language (code-switched) search queries created by bilingual speakers for evaluating Information Retrieval with mixed-language queries. It provides query versions where English expressions are embedded within native-language structures. This work is derived from The C...

MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Evaluation Use - ELRA EVALUATION
0.00 € submit
0.00 € submit
 MIST Multi-lingual Interoperability in Speech Technology database    
  • Dutch; Flemish
  • English
  • French
  • German

ID: ELRA-S0238

ISLRN: 189-835-264-931-4

In 1996, some 75 Dutch people participated in recording a multi-purpose continuous speech database. Most of them were recruited from the TNO Human Factors Research Institute, where the recordings were made. The main part of the database consisted of Dutch sentences. However, most speakers partici...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
400.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
500.00 € submit
 Mixed Speech with Chinese and English Data by Mobile Phone - 1,535 Hours    
  • Chinese
  • English

ID: ELRA-S0457

ISLRN: 451-966-049-653-3

The data is recorded by 3972 Chinese native speakers with accents covering seven major dialect areas. The recorded text is a mixture of Chinese and English sentences, covering general scenes and human-computer interaction scenes. It is rich in content and accurate in transcription. It can be used...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
145825.00 € submit
145825.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
145825.00 € submit
145825.00 € submit

Special offers are also available. Check here for details.

 MLCC Multilingual and Parallel Corpora    
  • Danish
  • Dutch; Flemish
  • English
  • French
  • German
  • Italian
  • Modern Greek (1453-)
  • Portuguese
  • Spanish; Castilian

ID: ELRA-W0023

ISLRN: 963-635-729-341-8

The MLCC text corpus has two main components - one set to allow comparable studies to be carried out in different languages and one set as the basis for translation studies. The first set is referred as the Polylingual Document Collection, a collection of newspaper articles from financial new...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
1600.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
3600.00 € submit
 Modern French Corpus including Anaphors Tagging    
  • French

ID: ELRA-W0032

ISLRN: 488-420-763-510-8

The corpus that includes the tagging of the anaphors was created by the CRISTAL-GRESEC (Stendhal-Grenoble 3 University, France) team and XRCE (Xerox Research Centre Europe, France) in the framework of the call launched by the DGLF-LF (national institution for the French language and the languages...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
250.00 € submit
250.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
1000.00 € submit
 Monolingual documents from the Government of Lithuania (Processed)    
  • Lithuanian

ID: ELRA-W0299

ISLRN: 268-109-862-136-1

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Monolingual documents received from the Government of th...

MEMBERacademiccommercial
Licence: Attribution - CC-BY-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution - CC-BY-4.0
0.00 € submit
0.00 € submit
 Monolingual Greek corpus    
  • Modern Greek (1453-)

ID: ELRA-W0014

ISLRN: 546-958-429-693-4

Monolingual Greek corpus of 1 million words. The corpus consists of articles written in 1996 from the Greek daily newspaper ELEFTHEROTIPIA. Each file contains annotated text with SGML mark-up accompanied by a text header.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
360.00 € submit
360.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
600.00 € submit
600.00 € submit
 Monolingual Vietnamese Annotated Corpus    
  • Vietnamese

ID: ELRA-W0310

ISLRN: 004-081-406-421-7

The Monolingual Vietnamese Annotated Corpus consists of 100,000 sentences, manually annotated with word boundaries, POS, named entities, with an average length of 20 words per sentence. The corpus is provided in XML format and is annotated according to TEI-encoding guidelines.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
500.00 € submit
900.00 € submit
Licence: Commercial Use - ELRA VAR
1800.00 € submit
1800.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
800.00 € submit
1300.00 € submit
Licence: Commercial Use - ELRA VAR
2500.00 € submit
2500.00 € submit

« Previous | Next »