Text (1052)
Audio (679)
Video (23)
True (226)
TEI (10)
TMX (6)

Resource Type:

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Media Type:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

1681 Language Resources (Page 11 of 85)

« Previous | Next »Order by:

 Bizkaifon (Bizkaieraren Fonoteka)    
  • Basque

ID: ELRA-S0153

ISLRN: 941-344-942-204-3

Bizkaifon contains sound archives and associated information of dialectal varieties of spoken Basque. The database was collected by the Department of Electronics and Telecommunications, University of the Basque Country, with the financial help of the Diputación Foral de Bizkaia. It consists of 21...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
1000.00 € submit
Licence: Commercial Use - ELRA VAR
1000.00 € submit
1000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1000.00 € submit
1000.00 € submit
Licence: Commercial Use - ELRA VAR
1000.00 € submit
1000.00 € submit
 BMI Brochures 2011-2015 (Processed)    
  • English
  • German

ID: ELRA-W0200

ISLRN: 886-938-216-393-3

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. English translations of German BMI brochures from the la...

MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
 BMI Brochures and Website 2016 (Processed)    
  • English
  • German

ID: ELRA-W0199

ISLRN: 416-672-686-637-0

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. Bilingual tmx file of German to English translations of ...

MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
 BMVI Publications (Processed)    
  • English
  • German

ID: ELRA-W0197

ISLRN: 492-102-548-814-7

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. TMX file with 11555 TUs, bilingual German/English, publi...

MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
 BMVI Website (Processed)    
  • English
  • German

ID: ELRA-W0198

ISLRN: 391-726-618-848-6

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. tmx file, 2718 TUs, bilingual German/English, texts from...

MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Other - Open Under-PSI
0.00 € submit
0.00 € submit
 BrasiLEX Brazilian Portuguese lexicon    
  • Portuguese

ID: ELRA-L0034

ISLRN: 654-505-941-943-8

BrasiLEX is a multifunctional monolingual lexicon of the Brazilian variety of Portuguese, developed by the Natural Language Group of INESC. It has about 65,000 entries (lemmas) and 1,600 correspondent inflexion paradigms. The set of entries includes compound words and the inflexion paradigms incl...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3000.00 € submit
25000.00 € submit
Licence: Commercial Use - ELRA VAR
25000.00 € submit
25000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5000.00 € submit
30000.00 € submit
Licence: Commercial Use - ELRA VAR
30000.00 € submit
30000.00 € submit

This resource is also available in a bundle. Check here for bundled pricing.

 Brazilian Portuguese Speech Data by Mobile Phone - 1,044 Hours    
  • Portuguese

ID: ELRA-S0445

ISLRN: 767-329-448-534-2

The data volumn is 1044 hours and is recorded by 2038 Brazilian native speakers. The recording text is designed by linguistic experts, which covers general interactive, in-car and home category. The texts are manually proofread with high accuracy. Recording devices are mainstream Android phones a...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
247950.00 € submit
247950.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
247950.00 € submit
247950.00 € submit

Special offers are also available. Check here for details.

 Brazilian Portuguese Speech Recognition Corpus (Desktop)    
  • Portuguese

ID: ELRA-S0228-74

ISLRN: 403-396-918-176-7

This corpus comprises 99,804 entries uttered by 50 speakers (25 males and 25 females), recorded over 4 channels (desktop in quiet office/home). Speech samples are stored as a sequence of 16-bit 44.1kHz for a total of 37.3 hours of speech per channel.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
 BREF-120 - A large corpus of French read speech    
  • French

ID: ELRA-S0067

ISLRN: 843-228-642-422-1

BREF-120 resulted from the efforts of LIMSI-CNRS researchers under sponsorship from the GDR-PRC CHM, the ACCT (OFIL), the EEC (ESPRIT Polyglot project), and the Aupelf-Uref. A sub-set of BREF-120 is BREF-80 (ELRA-S0006), which consists of about 50-60 sentences per speaker and recordings conducted...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
2500.00 € submit
10000.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4000.00 € submit
15000.00 € submit
Licence: Commercial Use - ELRA VAR
15000.00 € submit
15000.00 € submit
 BREF-80    
  • French

ID: ELRA-S0006

ISLRN: 310-036-258-354-7

The BREF corpus was designed to provide enough read speech data for the development and evaluation of continuous speech recognition systems (both speaker-dependent and speaker-independent), and to provide a large corpus of continuous speech for the acquisition of acoustic-phonetic knowledge of sp...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
400.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
800.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
 BREF-POLYGLOT    
  • French

ID: ELRA-S0007

ISLRN: 382-431-956-363-1

The BREF-Polyglot is a sub-corpus of the BREF corpus (1 ISO9660 CDROM); it contains speaker-dependent training data from 6 speakers. There are a total of 3193 sentences (2 signal files for each sentence), on average 530 per speaker. While this data represents only a small portion of the entire BR...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
400.00 € submit
3000.00 € submit
Licence: Commercial Use - ELRA VAR
3000.00 € submit
3000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
800.00 € submit
6000.00 € submit
Licence: Commercial Use - ELRA VAR
6000.00 € submit
6000.00 € submit
 British Children Speech Data by Microphone - 55 Hours    
  • English

ID: ELRA-S0474

ISLRN: 604-288-560-387-1

It collects 201 British children. The recordings are mainly children textbooks, storybooks. The average sentence length is 4.68 words and the average sentence repetition rate is 6.6 times. This data is recorded by high fidelity microphone. The text is manually transcribed with high accuracy. ...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
39187.50 € submit
39187.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
39187.50 € submit
39187.50 € submit

Special offers are also available. Check here for details.

 British English Kids Speech Recognition Corpus (Desktop)    
  • English

ID: ELRA-S0228-96

ISLRN: 732-482-893-782-4

This corpus comprises 19,196 entries uttered by 30 speakers (15 males and 15 females), recorded over 2 channels (desktop in quiet office). Speech samples are stored as a sequence of 16-bit 44.1kHz for a total of 3.65 hours of speech per channel.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
 British English Source Lexicon (BESL) version 2.2    
  • English

ID: ELRA-L0058

ISLRN: 875-872-158-794-8

BESL is a complete database of the English lexicon. It consists of over 230,000 lemmas, over 350,000 word forms, 60,000 proper nouns, 3,000 abbreviations, and 58,000 multi-word compound nouns. Each headword is provided with a full listing of all inflected forms and other morphological variation. ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
7000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
10000.00 € submit
 British English Speech Data by Mobile Phone - 831 Hours    
  • English

ID: ELRA-S0448

ISLRN: 542-952-231-001-2

831 Hours–Mobile Telephony British English Speech Data, which is recorded by 1651 native British speakers. The recording contents cover many categories such as generic, interactive, in-car and smart home. The texts are manually proofreaded to ensure a high accuracy rate. The database matchs the A...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
213151.50 € submit
213151.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
213151.50 € submit
213151.50 € submit

Special offers are also available. Check here for details.

 British English Speech Data by Mobile Phone_Reading - 199 Hours    
  • English

ID: ELRA-S0466

ISLRN: 825-851-392-960-9

The data set contains 346 British English speakers' speech data, all of whom are English locals. Around 392 sentences of each speaker. The valid data is 199 hours. Recording environment is quiet. Recording contents contain various categories like economics, news, entertainment, commonly used spok...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
47262.50 € submit
47262.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
47262.50 € submit
47262.50 € submit

Special offers are also available. Check here for details.

 British-English SpeechDat-Car    
  • English

ID: ELRA-S0131

ISLRN: 804-196-753-996-4

The British English SpeechDat-Car database contains the recordings of 300 British English speakers from 6 different regions (170 males, 130 females), recorded over the GSM telephone network, in a car. This database is partitioned into 115 CDs (DVDs are also available). The speech data files are ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
90000.00 € submit
90000.00 € submit
Licence: Commercial Use - ELRA VAR
90000.00 € submit
90000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
120000.00 € submit
120000.00 € submit
Licence: Commercial Use - ELRA VAR
120000.00 € submit
120000.00 € submit
 British English SpeechDat(II) FDB-4000    
  • English

ID: ELRA-S0097

ISLRN: 575-262-304-348-7

The British English SpeechDat(II) FDB-4000 database contains the recordings of 4,000 British English speakers (1,968 males, 2,032 females) recorded over the British fixed telephone network. This database is partitioned into 20 CDs. Speech samples are stored as sequences of 8-bit 8 kHz A-law. Eac...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
35000.00 € submit
45000.00 € submit
Licence: Commercial Use - ELRA VAR
45000.00 € submit
45000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
45000.00 € submit
55000.00 € submit
Licence: Commercial Use - ELRA VAR
55000.00 € submit
55000.00 € submit
 British English SpeechDat(II) MDB-1000    
  • English

ID: ELRA-S0074

ISLRN: 424-526-381-046-7

The British English SpeechDat(II) MDB-1000 database contains the recordings of 1,000 British speakers recorded over the GSM digital mobile network. The MDB-1000 database is partitioned into 5 CDs in ISO 9660 format. Speech samples are stored as sequences of 8-bit 8 kHz A-law. Each prompted utter...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
10000.00 € submit
19000.00 € submit
Licence: Commercial Use - ELRA VAR
19000.00 € submit
19000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
12000.00 € submit
24000.00 € submit
Licence: Commercial Use - ELRA VAR
24000.00 € submit
24000.00 € submit
 British English SpeechDat(II) SDB-2400    
  • English

ID: ELRA-S0098

ISLRN: 007-575-120-102-1

The British English SpeechDat(II) SDB-2400 database is designed for development and assessment of speaker verification and identification systems. It contains the recordings of 120 speakers who uttered 22 items 20 times, and was collected over the fixed and mobile telephone networks in quiet and ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
32000.00 € submit
39000.00 € submit
Licence: Commercial Use - ELRA VAR
39000.00 € submit
39000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
39000.00 € submit
47000.00 € submit
Licence: Commercial Use - ELRA VAR
47000.00 € submit
47000.00 € submit

« Previous | Next »