Resource Type:
Corpus: | ![]() |
Lexical/Conceptual: | ![]() |
Tool/Service: | ![]() |
Language Description: | ![]() |
Media Type:
Text: | ![]() |
Audio: | ![]() |
Image: | ![]() |
Video: | ![]() |
Text Numerical: | ![]() |
Text N-Gram: | ![]() |
991 Language Resources (Page 6 of 50)
« Previous | Next »Order by:


- English
ID: ELRA-S0228-78
ISLRN: 040-245-794-542-7This corpus comprises 50,858 entries uttered by 51 speakers (28 males and 23 females), recorded over 2 channels (desktop in quiet office/home). Speech samples are stored as a sequence of 16-bit 48kHz for a total of 29.7 hours of speech per channel.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
![]() |
5400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5400.00 €
![]() |
5400.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
![]() |
5400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5400.00 €
![]() |
5400.00 €
![]() |


- English
ID: ELRA-S0228-111
ISLRN: 920-976-101-187-7This corpus was recorded in a quiet office/home environment over 3 channels and collected from a total of 302 speakers, including 149 males and 153 females, all of whom have been carefully screened to ensure their standard and clear pronunciation. The audio scripts come from news and tweets. Spee...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
32400.00 €
![]() |
32400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
32400.00 €
![]() |
32400.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
32400.00 €
![]() |
32400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
32400.00 €
![]() |
32400.00 €
![]() |


- English
ID: ELRA-S0228-101
ISLRN: 535-041-750-483-4This corpus comprises 63,495 entries uttered by 54 speakers (27 males and 27 females), recorded over 3 channels (mobile in noisy café/restaurant/street). Speech samples are stored as a sequence of 16-bit 16 kHz for a total of 22.3 hours of speech per channel.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
![]() |
5400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5400.00 €
![]() |
5400.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
![]() |
5400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5400.00 €
![]() |
5400.00 €
![]() |


- Bulgarian
ID: ELRA-L0075
ISLRN: 450-247-052-039-5This database contains 81,647 entries in Bulgarian with a linguistic environment tool (for WINDOWS XP). The data may be used for morphological analysis and synthesis, syntactic agreement checking, phonetic stress determining. Structure of entries: Local linguistic variant File format: MS ACCESS ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2000.00 €
![]() |
10000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
10000.00 €
![]() |
10000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
4000.00 €
![]() |
16000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
16000.00 €
![]() |
16000.00 €
![]() |


- Bulgarian
ID: ELRA-L0030
ISLRN: 611-552-122-892-7This dictionary contains 67500 entries divided into 242 inflectional types (including proper nouns), morphosyntactic information for each entry, and a morphological engine (MS DOS and WINDOWS 95/NT) for morphological analysis and generation. The data may be used for morphological analysis and syn...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
45.00 €
![]() |
6000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
6000.00 €
![]() |
6000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
100.00 €
![]() |
12000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
12000.00 €
![]() |
12000.00 €
![]() |


- Bulgarian
- English
ID: ELRA-M0041
ISLRN: 941-120-951-927-7The Bulgarian WordNet is a network of lexical-semantic relations, an electronic thesaurus with a structure modelled on that of the Princeton WordNet and those constructed in the EuroWordNet and BalkaNet project. Bulgarian WordNet describes meaning of a lexical unit by placing it within a network ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
300.00 €
![]() |
3000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
4500.00 €
![]() |
4500.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
600.00 €
![]() |
6000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
9000.00 €
![]() |
9000.00 €
![]() |


- English
ID: ELRA-S0228-85
ISLRN: 942-019-580-826-2This corpus comprises 6,976 entries uttered by 150 speakers (80 males and 70 females), recorded over 4 channels (desktop in quiet office). Speech samples are stored as a sequence of 16-bit 48kHz for a total of 3.86 hours of speech per channel.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
4050.00 €
![]() |
4050.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
4050.00 €
![]() |
4050.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
4050.00 €
![]() |
4050.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
4050.00 €
![]() |
4050.00 €
![]() |


- English
ID: ELRA-S0228-89
ISLRN: 836-335-444-460-7This corpus comprises 2,250 entries uttered by 150 speakers (106 males and 44 females), recorded over the telephone network. Speech samples are stored as a sequence of 16-bit 8 kHz for a total of 2.83 hours of speech.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3240.00 €
![]() |
3240.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3240.00 €
![]() |
3240.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3240.00 €
![]() |
3240.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3240.00 €
![]() |
3240.00 €
![]() |


- English
ID: ELRA-S0228-90
ISLRN: 229-685-009-012-2This corpus comprises 2,400 entries uttered by 150 speakers (106 males and 44 females), recorded over the telephone network. Speech samples are stored as a sequence of 16-bit 8 kHz for a total of 2.24 hours of speech.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3240.00 €
![]() |
3240.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3240.00 €
![]() |
3240.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3240.00 €
![]() |
3240.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3240.00 €
![]() |
3240.00 €
![]() |


- English
ID: ELRA-S0228-87
ISLRN: 668-176-572-368-0This corpus comprises 1,500 entries uttered by 150 speakers (106 males and 44 females), recorded over the telephone network. Speech samples are stored as a sequence of 16-bit 8 kHz for a total of 2.09 hours of speech.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3240.00 €
![]() |
3240.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3240.00 €
![]() |
3240.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3240.00 €
![]() |
3240.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3240.00 €
![]() |
3240.00 €
![]() |


- English
ID: ELRA-S0228-88
ISLRN: 616-328-968-271-5This corpus comprises 1,500 entries uttered by 150 speakers (106 males and 44 females), recorded over the telephone network. Speech samples are stored as a sequence of 16-bit 8 kHz for a total of 3.6 hours of speech.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3240.00 €
![]() |
3240.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3240.00 €
![]() |
3240.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3240.00 €
![]() |
3240.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3240.00 €
![]() |
3240.00 €
![]() |


- French
ID: ELRA-S0228-72
ISLRN: 360-129-212-036-3This corpus comprises 75,147 entries uttered by 50 speakers (25 males and 25 females), recorded over 3 channels (mobile quiet office). Speech samples are stored as a sequence of 16-bit 16kHz for a total of 25.67 hours of speech per channel.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
![]() |
5400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5400.00 €
![]() |
5400.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
![]() |
5400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5400.00 €
![]() |
5400.00 €
![]() |


- Chinese
ID: ELRA-L0101
ISLRN: 634-690-317-631-5This database is not only comprehensive but also linguistically accurate. It is based on solid principles of Cantonese phonology and semantics, and takes into account the phenomena of polyphony as well as tone change, which is unpredictable and requires manual proofreading. It covers 300,000 entr...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
9000.00 €
![]() |
15000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
18000.00 €
![]() |
30000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
11250.00 €
![]() |
18750.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
22500.00 €
![]() |
37500.00 €
![]() |


- Chinese
ID: ELRA-S0287
ISLRN: 537-563-219-913-3The Cantonese Speecon database is divided into 2 sets: 1) The first set comprises the recordings of 550 adult Cantonese speakers (273 males, 277 females), recorded over 4 microphone channels in 4 recording environments (office, entertainment, car, public place), for a total of ca. 213 hours of ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
50000.00 €
![]() |
67000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
67000.00 €
![]() |
67000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
60000.00 €
![]() |
75000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
75000.00 €
![]() |
75000.00 €
![]() |


- Dutch; Flemish
- English
- Finnish
ID: ELRA-S0410
ISLRN: 072-357-063-759-1A multi-lingual speech corpus used for modeling language acquisition called CAREGIVER has been designed and recorded within the framework of the EU funded Acquisition of Communication and Recognition Skills (ACORNS) project. The motivation behind the corpus and its design relies on current knowle...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
0.00 €
![]() |


- Catalan; Valencian
ID: ELRA-W0047
ISLRN: 000-089-517-382-8The Catalan Corpus of News Articles comprises articles in Catalan from 1 January 1999 to 31 March 2007. These articles are grouped per trimester without chronological order inside. The DVD contains one folder per year. Each folder has been divided into subfolders, containing the archives per tri...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2975.00 €
![]() |
14855.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
14855.00 €
![]() |
14855.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3930.00 €
![]() |
19315.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
19315.00 €
![]() |
19315.00 €
![]() |


- Catalan; Valencian
- Spanish; Castilian
ID: ELRA-W0053
ISLRN: 124-613-721-890-1This corpus contains more than 100 million words and it contains 10 years of bilingual articles from “El Periódico de Catalunya”. Both language data are rather close as the Catalan text is a translation of the Spanish one, partly achieved by means of Machine translation and then post-edited. The...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
2000.00 €
![]() |
20000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
20000.00 €
![]() |
20000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3000.00 €
![]() |
24000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
24000.00 €
![]() |
24000.00 €
![]() |


- Catalan; Valencian
ID: ELRA-S0326
ISLRN: 532-758-322-989-7The Catalan SpeechDat-Car database contains the in-car recordings of 300 speakers who uttered from around 120 read and spontaneous items. Each speaker recorded two sessions. Recordings have been made through 4 different channels, via in-car microphones (1 close-talk microphone, 3 far-talk microph...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
2000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
2000.00 €
![]() |
2000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
4000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
4000.00 €
![]() |
4000.00 €
![]() |


- Catalan; Valencian
ID: ELRA-S0324
ISLRN: 829-350-109-825-6This speech database contains the recordings of 2000 Catalan speakers who called from Fixed telephones and who are recorded over the fixed PSTN using and ISDN-BRI interface. Each speaker uttered around 50 read and spontaneous items. The speech database follows the specifications made within the S...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
2000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
2000.00 €
![]() |
2000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
4000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
4000.00 €
![]() |
4000.00 €
![]() |


- Catalan; Valencian
ID: ELRA-S0325
ISLRN: 241-541-350-834-7This speech database contains the recordings of 2000 Catalan speakers who called from GSM telephones and who are recorded over the fixed PSTN using and ISDN-BRI interface. Each speaker uttered around 50 read and spontaneous items. The speech database follows the specifications made within the Spe...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
2000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
2000.00 €
![]() |
2000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
4000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
4000.00 €
![]() |
4000.00 €
![]() |