Text (1054)
Audio (681)
Video (23)
True (226)
TEI (10)
TMX (6)

Resource Type:

Corpus:
Lexical/Conceptual:
Tool/Service:
Language Description:

Media Type:

Text:
Audio:
Image:
Video:
Text Numerical:
Text N-Gram:

1685 Language Resources (Page 33 of 85)

« Previous | Next »Order by:

 ILC Italian Morphological Lexicon    
  • Italian

ID: ELRA-L0006

ISLRN: 965-829-467-456-4

The ILC Italian Morphological Lexicon consists of a set of lemmas/lexical entries (about 60,000) with the corresponding inflected word-forms, and a morphological engine for morphological analysis and generation. Lemmas and word-forms are encoded with grammatical codes compatible with the EAGLES r...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4000.00 € submit
12000.00 € submit
Licence: Commercial Use - ELRA VAR
12000.00 € submit
12000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
8000.00 € submit
20000.00 € submit
Licence: Commercial Use - ELRA VAR
20000.00 € submit
20000.00 € submit
 ILE: Italian LExicon      
  • Italian

ID: ELRA-S0059

ISLRN: 052-156-999-928-3

ILE is a 588,000 entries Italian lexicon transcribed with SAMPA notation. It was generated, mainly for speech recognition purposes, by means of a morphological analyzer handling more than 100,000 morphemes, each of them transcribed and manually checked. Each stem was combined with all its possibl...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3000.00 € submit
12000.00 € submit
Licence: Commercial Use - ELRA VAR
12000.00 € submit
12000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6000.00 € submit
18000.00 € submit
Licence: Commercial Use - ELRA VAR
18000.00 € submit
18000.00 € submit
 ILPho phonetic lexicon      
  • French

ID: ELRA-S0163

ISLRN: 779-878-863-649-8

The ILPho database is a phonetic lexicon which contains 39,000 lemmas (319,318 entries). It is distributed in two formats. The first format is compact and corresponds to an easy extension of the text format in which the Multext lexicons (réf. ELRA-L0010) (Ide et Veronis, 1994) are distributed, by...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
100.00 € submit
2500.00 € submit
Licence: Commercial Use - ELRA VAR
2500.00 € submit
2500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
100.00 € submit
2500.00 € submit
Licence: Commercial Use - ELRA VAR
2500.00 € submit
2500.00 € submit
 ILSP/ELEFTHEROTYPIA Corpus (Greek corpus)    
  • Modern Greek (1453-)

ID: ELRA-W0022

ISLRN: 002-552-644-443-1

The ILSP/ELEFTHEROTYPIA Corpus contains approximately 3 million words classified and annotated according to the common core PAROLE encoding standard. Thus, each file is classified according to the parameters of Medium, Topic and Genre, and structurally annotated at paragraph level (CES Level 1). ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
850.00 € submit
850.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1275.00 € submit
1275.00 € submit
 Indian English Speech Data by Mobile Phone - 1,012 Hours    
  • English

ID: ELRA-S0456

ISLRN: 001-453-575-915-4

Indian English audio data captured by mobile phones, 1,012 hours in total, recorded by 2,100 Indian native speakers. The recorded text is designed by linguistic experts, covering generic, interactive, on-board, home and other categories. The text has been proofread manually with high accuracy; ...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
153824.00 € submit
153824.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
153824.00 € submit
153824.00 € submit

Special offers are also available. Check here for details.

 Indonesian Speech Data by Mobile Phone - 639 Hours    
  • Indonesian

ID: ELRA-S0439

ISLRN: 394-545-170-456-2

1285 Indonesian native speakers participated in the recording with authentic accent. The recorded script is designed by linguists and cover a wide range of topics including generic, interactive, on-board and home. The text is manually proofread with high accuracy. It matches with mainstream Andro...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
103198.50 € submit
103198.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
103198.50 € submit
103198.50 € submit

Special offers are also available. Check here for details.

 Indonesian Speech Data by Mobile Phone_R - 359 Hours    
  • Indonesian

ID: ELRA-S0470

ISLRN: 311-413-414-907-0

Indonesia speech data (reading) is collected from 496 Indonesian native speakers and is recorded in quiet environment. The recording is rich in content, covering multiple categories such as econimics, entertainment, news, figure, letter, and oral. Around 400 sentences for each speaker. The valid ...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
57978.50 € submit
57978.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
57978.50 € submit
57978.50 € submit

Special offers are also available. Check here for details.

 Indonesian Speech Recognition Corpus (Desktop)    
  • Indonesian

ID: ELRA-S0228-115

ISLRN: 238-085-521-885-2

This corpus was recorded in a quiet office environment over 4 channels and collected from a total of 200 speakers, including 97 males and 103 females, all of whom have been carefully screened to ensure their standard and clear pronunciation. The audio scripts cover information such as news and da...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
21600.00 € submit
21600.00 € submit
Licence: Commercial Use - ELRA VAR
21600.00 € submit
21600.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
21600.00 € submit
21600.00 € submit
Licence: Commercial Use - ELRA VAR
21600.00 € submit
21600.00 € submit
 Insurance (Termcat)    
  • Catalan; Valencian
  • English
  • Spanish; Castilian

ID: ELRA-T0094

ISLRN: 723-632-688-733-6

Insurance contracts, private and public insurance, resource terminology used within European Union institutions. Cards available: 1000 Languages: Catalan, Spanish, English Format: ASCII Medium: floppy disk Card Description: Each card in this terminological database contains a definition, abb...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
Licence: Commercial Use - ELRA VAR
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
Licence: Commercial Use - ELRA VAR
 International Agreements (Processed)    
  • English
  • Latvian

ID: ELRA-W0158

ISLRN: 810-722-062-476-6

This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. International Agreements have been translated into natio...

MEMBERacademiccommercial
Licence: Attribution, Share Alike - CC-BY-SA-4.0
0.00 € submit
0.00 € submit
NON MEMBERacademiccommercial
Licence: Attribution, Share Alike - CC-BY-SA-4.0
0.00 € submit
0.00 € submit
 ÌròyìnSpeech    
  • Yoruba

ID: ELRA-S0492

ISLRN: 012-405-700-001-6

A modern, high-fidelity, multi-speaker, Yorùbá read speech corpus suitable for Speech Synthesis, Automatic Speech Recognition and Computational Linguistics research. The subject matter is drawn from the Broadcast News domain as well as fictional texts, delivering a multi-purpose, contemporary spe...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
11200.00 € submit
Licence: Commercial Use - ELRA VAR
11200.00 € submit
11200.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
12000.00 € submit
Licence: Commercial Use - ELRA VAR
12000.00 € submit
12000.00 € submit
 ISLE Speech Corpus    
  • English

ID: ELRA-S0083

ISLRN: 723-960-059-948-7

Approx. 20 minutes of speech (per speaker) from 23 German and 23 Italian intermediate learners of English. Each speaker recorded sentences from several blocks of differing types (reading simple sentences, using minimal pairs, giving answers to multiple choice questions). The prompts were of varyi...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
0.00 € submit
1500.00 € submit
 Italian English Speech Recognition Corpus (Mobile)    
  • English

ID: ELRA-S0228-108

ISLRN: 554-977-743-197-5

This corpus was recorded in a quiet office/home environment over 3 channels and collected from a total of 213 speakers, including 103 males and 110 females, all of whom have been carefully screened to ensure their standard and clear pronunciation. The audio scripts cover information such as news ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
25200.00 € submit
25200.00 € submit
Licence: Commercial Use - ELRA VAR
25200.00 € submit
25200.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
25200.00 € submit
25200.00 € submit
Licence: Commercial Use - ELRA VAR
25200.00 € submit
25200.00 € submit
 Italian Kids Speech Recognition Corpus (Desktop)    
  • Italian

ID: ELRA-S0228-98

ISLRN: 501-616-216-038-6

This corpus comprises 19,788 entries uttered by 31 speakers (15 males and 16 females), recorded over 2 channels (desktop in quiet office). Speech samples are stored as a sequence of 16-bit 44.1kHz for a total of 4.9 hours of speech per channel.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5400.00 € submit
5400.00 € submit
Licence: Commercial Use - ELRA VAR
5400.00 € submit
5400.00 € submit
 Italian lexicon with morphological information    
  • Italian

ID: ELRA-L0069

ISLRN: 840-625-201-574-7

This Italian lexicon is made up of 862,500 inflected forms corresponding to 112,000 simple word lemmas. It contains: - 66,340 nouns, with type, gender, number and inflected forms (including irregular forms) - 12,030 verbs, with mood, tense, person, gender, number and inflected forms (including ir...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5500.00 € submit
6500.00 € submit
Licence: Commercial Use - ELRA VAR
8000.00 € submit
8000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
7000.00 € submit
8500.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit
 Italian lexicon with morphological information and clitic verbs    
  • Italian

ID: ELRA-L0070

ISLRN: 565-957-248-233-5

This Italian lexicon is the same as the one described in ELRA-L0069, but with the addition of clitic verbs, which increases the number of inflected forms to 1,800,000 (still corresponding to 112,000 simple words lemmas). Half the lexicon is made up of clitic verbs. It contains: - 66,340 nouns, wi...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6500.00 € submit
8000.00 € submit
Licence: Commercial Use - ELRA VAR
10000.00 € submit
10000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
8500.00 € submit
10000.00 € submit
Licence: Commercial Use - ELRA VAR
12500.00 € submit
12500.00 € submit
 Italian Speaking English Speech Data by Mobile Phone - 227 Hours    
  • English

ID: ELRA-S0429

ISLRN: 703-740-233-998-1

497 Italians recorded in a relatively quiet environment in authentic English. The recorded script is designed by linguists and covers a wide range of topics including generic, interactive, on-board and home. The text is manually proofread with high accuracy. It matches with mainstream Android an...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
53912.50 € submit
53912.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
53912.50 € submit
53912.50 € submit

Special offers are also available. Check here for details.

 Italian Speech Corpus 1 (Appen)    
  • Italian

ID: ELRA-S0147

ISLRN: 458-657-455-735-5

The Italian Speech Corpus 1 contains the recordings of 202 native Italian speakers (112 males, 90 females) recorded in an office and a closed public place, over 4 channels, in a range of low to medium background noise environments (Plantronics Audio 10 (computer/desk mic), Shure SM58 (desk mounte...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1200.00 € submit
9500.00 € submit
Licence: Commercial Use - ELRA VAR
9500.00 € submit
9500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
1500.00 € submit
15000.00 € submit
Licence: Commercial Use - ELRA VAR
15000.00 € submit
15000.00 € submit
 Italian Speech Data by Mobile Phone - 1,441 Hours    
  • Italian

ID: ELRA-S0450

ISLRN: 217-750-727-467-7

The data were recorded by 3,109 native Italian speakers with authentic Italian accents. The recorded content covers a wide range of categories such as general purpose, interactive, in car commands, home commands, etc. The recorded text is designed by a language expert, and the text is manually pr...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
342237.50 € submit
342237.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
342237.50 € submit
342237.50 € submit

Special offers are also available. Check here for details.

 Italian Speech Data by Mobile Phone_Reading - 215 Hours    
  • Italian

ID: ELRA-S0472

ISLRN: 341-812-724-006-1

Italian speech data (reading) is collected from 325 Italian native speakers and is recorded in quiet environment. The recording is rich in content, covering multiple categories such as econimics, entertainment, news, and oral. Each sentence contains 9.2 words in average. Each sentence is repeated...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
38807.50 € submit
38807.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
38807.50 € submit
38807.50 € submit

Special offers are also available. Check here for details.

« Previous | Next »