Resource Type:
Corpus: | ![]() |
Lexical/Conceptual: | ![]() |
Tool/Service: | ![]() |
Language Description: | ![]() |
Media Type:
Text: | ![]() |
Audio: | ![]() |
Image: | ![]() |
Video: | ![]() |
Text Numerical: | ![]() |
Text N-Gram: | ![]() |
1685 Language Resources (Page 33 of 85)
« Previous | Next »Order by:


- Italian
ID: ELRA-L0006
ISLRN: 965-829-467-456-4The ILC Italian Morphological Lexicon consists of a set of lemmas/lexical entries (about 60,000) with the corresponding inflected word-forms, and a morphological engine for morphological analysis and generation. Lemmas and word-forms are encoded with grammatical codes compatible with the EAGLES r...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
4000.00 €
![]() |
12000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
12000.00 €
![]() |
12000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
8000.00 €
![]() |
20000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
20000.00 €
![]() |
20000.00 €
![]() |



- Italian
ID: ELRA-S0059
ISLRN: 052-156-999-928-3ILE is a 588,000 entries Italian lexicon transcribed with SAMPA notation. It was generated, mainly for speech recognition purposes, by means of a morphological analyzer handling more than 100,000 morphemes, each of them transcribed and manually checked. Each stem was combined with all its possibl...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3000.00 €
![]() |
12000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
12000.00 €
![]() |
12000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
6000.00 €
![]() |
18000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
18000.00 €
![]() |
18000.00 €
![]() |



- French
ID: ELRA-S0163
ISLRN: 779-878-863-649-8The ILPho database is a phonetic lexicon which contains 39,000 lemmas (319,318 entries). It is distributed in two formats. The first format is compact and corresponds to an easy extension of the text format in which the Multext lexicons (réf. ELRA-L0010) (Ide et Veronis, 1994) are distributed, by...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
100.00 €
![]() |
2500.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
2500.00 €
![]() |
2500.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
100.00 €
![]() |
2500.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
2500.00 €
![]() |
2500.00 €
![]() |


- Modern Greek (1453-)
ID: ELRA-W0022
ISLRN: 002-552-644-443-1The ILSP/ELEFTHEROTYPIA Corpus contains approximately 3 million words classified and annotated according to the common core PAROLE encoding standard. Thus, each file is classified according to the parameters of Medium, Topic and Genre, and structurally annotated at paragraph level (CES Level 1). ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
850.00 €
![]() |
850.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1275.00 €
![]() |
1275.00 €
![]() |


- English
ID: ELRA-S0456
ISLRN: 001-453-575-915-4Indian English audio data captured by mobile phones, 1,012 hours in total, recorded by 2,100 Indian native speakers. The recorded text is designed by linguistic experts, covering generic, interactive, on-board, home and other categories. The text has been proofread manually with high accuracy; ...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
153824.00 €
![]() |
153824.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
153824.00 €
![]() |
153824.00 €
![]() |
Special offers are also available. Check here for details.


- Indonesian
ID: ELRA-S0439
ISLRN: 394-545-170-456-21285 Indonesian native speakers participated in the recording with authentic accent. The recorded script is designed by linguists and cover a wide range of topics including generic, interactive, on-board and home. The text is manually proofread with high accuracy. It matches with mainstream Andro...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
103198.50 €
![]() |
103198.50 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
103198.50 €
![]() |
103198.50 €
![]() |
Special offers are also available. Check here for details.


- Indonesian
ID: ELRA-S0470
ISLRN: 311-413-414-907-0Indonesia speech data (reading) is collected from 496 Indonesian native speakers and is recorded in quiet environment. The recording is rich in content, covering multiple categories such as econimics, entertainment, news, figure, letter, and oral. Around 400 sentences for each speaker. The valid ...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
57978.50 €
![]() |
57978.50 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
57978.50 €
![]() |
57978.50 €
![]() |
Special offers are also available. Check here for details.


- Indonesian
ID: ELRA-S0228-115
ISLRN: 238-085-521-885-2This corpus was recorded in a quiet office environment over 4 channels and collected from a total of 200 speakers, including 97 males and 103 females, all of whom have been carefully screened to ensure their standard and clear pronunciation. The audio scripts cover information such as news and da...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
21600.00 €
![]() |
21600.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
21600.00 €
![]() |
21600.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
21600.00 €
![]() |
21600.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
21600.00 €
![]() |
21600.00 €
![]() |


- Catalan; Valencian
- English
- Spanish; Castilian
ID: ELRA-T0094
ISLRN: 723-632-688-733-6Insurance contracts, private and public insurance, resource terminology used within European Union institutions. Cards available: 1000 Languages: Catalan, Spanish, English Format: ASCII Medium: floppy disk Card Description: Each card in this terminological database contains a definition, abb...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER | ||
Licence: Commercial Use - ELRA VAR |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER | ||
Licence: Commercial Use - ELRA VAR |


- English
- Latvian
ID: ELRA-W0158
ISLRN: 810-722-062-476-6This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. International Agreements have been translated into natio...
MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-4.0 |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Attribution, Share Alike - CC-BY-SA-4.0 |
0.00 €
![]() |
0.00 €
![]() |


- Yoruba
ID: ELRA-S0492
ISLRN: 012-405-700-001-6A modern, high-fidelity, multi-speaker, Yorùbá read speech corpus suitable for Speech Synthesis, Automatic Speech Recognition and Computational Linguistics research. The subject matter is drawn from the Broadcast News domain as well as fictional texts, delivering a multi-purpose, contemporary spe...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
11200.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
11200.00 €
![]() |
11200.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
12000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
12000.00 €
![]() |
12000.00 €
![]() |


- English
ID: ELRA-S0083
ISLRN: 723-960-059-948-7Approx. 20 minutes of speech (per speaker) from 23 German and 23 Italian intermediate learners of English. Each speaker recorded sentences from several blocks of differing types (reading simple sentences, using minimal pairs, giving answers to multiple choice questions). The prompts were of varyi...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
500.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
1500.00 €
![]() |


- English
ID: ELRA-S0228-108
ISLRN: 554-977-743-197-5This corpus was recorded in a quiet office/home environment over 3 channels and collected from a total of 213 speakers, including 103 males and 110 females, all of whom have been carefully screened to ensure their standard and clear pronunciation. The audio scripts cover information such as news ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
25200.00 €
![]() |
25200.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
25200.00 €
![]() |
25200.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
25200.00 €
![]() |
25200.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
25200.00 €
![]() |
25200.00 €
![]() |


- Italian
ID: ELRA-S0228-98
ISLRN: 501-616-216-038-6This corpus comprises 19,788 entries uttered by 31 speakers (15 males and 16 females), recorded over 2 channels (desktop in quiet office). Speech samples are stored as a sequence of 16-bit 44.1kHz for a total of 4.9 hours of speech per channel.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
![]() |
5400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5400.00 €
![]() |
5400.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5400.00 €
![]() |
5400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
5400.00 €
![]() |
5400.00 €
![]() |


- Italian
ID: ELRA-L0069
ISLRN: 840-625-201-574-7This Italian lexicon is made up of 862,500 inflected forms corresponding to 112,000 simple word lemmas. It contains: - 66,340 nouns, with type, gender, number and inflected forms (including irregular forms) - 12,030 verbs, with mood, tense, person, gender, number and inflected forms (including ir...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5500.00 €
![]() |
6500.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
8000.00 €
![]() |
8000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
7000.00 €
![]() |
8500.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
10000.00 €
![]() |
10000.00 €
![]() |


- Italian
ID: ELRA-L0070
ISLRN: 565-957-248-233-5This Italian lexicon is the same as the one described in ELRA-L0069, but with the addition of clitic verbs, which increases the number of inflected forms to 1,800,000 (still corresponding to 112,000 simple words lemmas). Half the lexicon is made up of clitic verbs. It contains: - 66,340 nouns, wi...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
6500.00 €
![]() |
8000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
10000.00 €
![]() |
10000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
8500.00 €
![]() |
10000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
12500.00 €
![]() |
12500.00 €
![]() |


- English
ID: ELRA-S0429
ISLRN: 703-740-233-998-1497 Italians recorded in a relatively quiet environment in authentic English. The recorded script is designed by linguists and covers a wide range of topics including generic, interactive, on-board and home. The text is manually proofread with high accuracy. It matches with mainstream Android an...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
53912.50 €
![]() |
53912.50 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
53912.50 €
![]() |
53912.50 €
![]() |
Special offers are also available. Check here for details.


- Italian
ID: ELRA-S0147
ISLRN: 458-657-455-735-5The Italian Speech Corpus 1 contains the recordings of 202 native Italian speakers (112 males, 90 females) recorded in an office and a closed public place, over 4 channels, in a range of low to medium background noise environments (Plantronics Audio 10 (computer/desk mic), Shure SM58 (desk mounte...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1200.00 €
![]() |
9500.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
9500.00 €
![]() |
9500.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
1500.00 €
![]() |
15000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
15000.00 €
![]() |
15000.00 €
![]() |


- Italian
ID: ELRA-S0450
ISLRN: 217-750-727-467-7The data were recorded by 3,109 native Italian speakers with authentic Italian accents. The recorded content covers a wide range of categories such as general purpose, interactive, in car commands, home commands, etc. The recorded text is designed by a language expert, and the text is manually pr...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
342237.50 €
![]() |
342237.50 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
342237.50 €
![]() |
342237.50 €
![]() |
Special offers are also available. Check here for details.


- Italian
ID: ELRA-S0472
ISLRN: 341-812-724-006-1Italian speech data (reading) is collected from 325 Italian native speakers and is recorded in quiet environment. The recording is rich in content, covering multiple categories such as econimics, entertainment, news, and oral. Each sentence contains 9.2 words in average. Each sentence is repeated...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
38807.50 €
![]() |
38807.50 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
38807.50 €
![]() |
38807.50 €
![]() |
Special offers are also available. Check here for details.