Resource Type:
Corpus: | ![]() |
Lexical/Conceptual: | ![]() |
Tool/Service: | ![]() |
Language Description: | ![]() |
Media Type:
Text: | ![]() |
Audio: | ![]() |
Image: | ![]() |
Video: | ![]() |
Text Numerical: | ![]() |
Text N-Gram: | ![]() |
1681 Language Resources (Page 15 of 85)
« Previous | Next »Order by:


- Chinese
ID: ELRA-L0107
ISLRN: 500-068-723-953-8A comprehensive monolingual lexical database of Chinese consisting of Simplified and Traditional Chinese modules, covering general vocabulary and important technical terms. Each entry is accompanied by various attributes, such as phonological, grammatical, and morphological information, as well a...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
4500.00 €
![]() |
7500.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
9000.00 €
![]() |
15000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5625.00 €
![]() |
9375.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
11250.00 €
![]() |
18750.00 €
![]() |


- Chinese
ID: ELRA-S0398
ISLRN: 353-548-770-894-7This database contains the recordings of 500 Chinese Mandarin speakers from Northern China (250 males and 250 females), from 18 to 60 years’ old, recorded in quiet studios located in Shenzhen and in Hong Kong Special Administrative Region, People’s Republic of China. Demographics of native sp...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5200.00 €
![]() |
7400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
7400.00 €
![]() |
7400.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
7400.00 €
![]() |
7400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
7400.00 €
![]() |
7400.00 €
![]() |


- Chinese
ID: ELRA-S0397
ISLRN: 503-886-852-083-2This database contains the recordings of 1000 Chinese Mandarin speakers from Southern China (500 males and 500 females), from 18 to 60 years’ old, recorded in quiet studios located in Shenzhen and in Hong Kong Special Administrative Region, People’s Republic of China. Demographics of native s...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
10400.00 €
![]() |
14800.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
14800.00 €
![]() |
14800.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
14800.00 €
![]() |
14800.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
14800.00 €
![]() |
14800.00 €
![]() |


- Chinese
ID: ELRA-L0108
ISLRN: 279-636-746-963-2This is a comprehensive database of Chinese derivative affixes with adjacency attributes.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
5100.00 €
![]() |
8500.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
10200.00 €
![]() |
17000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
6375.00 €
![]() |
10625.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
12750.00 €
![]() |
21250.00 €
![]() |


- Chinese
ID: ELRA-L0102
ISLRN: 968-547-869-011-3A large-scale database of Chinese pinyin readings. Especially noteworthy are the differences in pronunciation between Taiwan and the PRC, for example 期待 qí dài (Taiwan) and qī dài (PRC).
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
3450.00 €
![]() |
5750.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
6900.00 €
![]() |
11500.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
4313.00 €
![]() |
7188.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
8625.00 €
![]() |
14375.00 €
![]() |


- English
ID: ELRA-S0434
ISLRN: 688-460-788-757-11,279 Chinese speakers from major dialect regions participated in the recording. It is in line with the specific accent of Chinese English speakers. The recorded script cover many categories such as spoken English, speech, and human-computer interaction, rich in content, extensive in fields, and ...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
47690.00 €
![]() |
47690.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
47690.00 €
![]() |
47690.00 €
![]() |
Special offers are also available. Check here for details.


- English
ID: ELRA-S0483
ISLRN: 724-148-936-774-6This dataset is 100,000 colloquial English sentences recorded by 3,691 Chinese, covering many domestic dialect zones like Jiangsu, Shandong, Beijing, Henan, and meets the specific accent of Chinese speaking English. The recording texts contain commonly used sentences with rich contents, broad fi...
MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
33801.00 €
![]() |
33801.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Commercial Use - ELRA VAR |
33801.00 €
![]() |
33801.00 €
![]() |
Special offers are also available. Check here for details.


- Chinese
- Vietnamese
ID: ELRA-M0080
ISLRN: 120-577-487-890-2The Chinese-Vietnamese Dictionary consists of 52,470 entries containing the following information: phonetics (using IPA), morphology, grammar, semantics, pragmatics and examples. The dictionary is provided in XML format.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
200.00 €
![]() |
400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
900.00 €
![]() |
900.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
300.00 €
![]() |
600.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
1350.00 €
![]() |
1350.00 €
![]() |


- Chinese
- Vietnamese
ID: ELRA-W0312
ISLRN: 128-772-037-486-0The Chinese-Vietnamese Parallel Corpus consists of 200,000 sentence pairs, with an average length of 15 words per sentence. The corpus is provided in XML format and is annotated according to TEI-encoding guidelines.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
200.00 €
![]() |
400.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
1400.00 €
![]() |
1400.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
300.00 €
![]() |
600.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
2100.00 €
![]() |
2100.00 €
![]() |


- Chinese
- Vietnamese
ID: ELRA-S0485
ISLRN: 428-557-564-826-7Chinese-Vietnamese - PhraseBank with audio files of daily conversations spoken by native speakers containing 4002 sentence pairs. Scripts with Pinyin, Topic, Cat, Vietnamese translation with corresponding audio in Chinese and Vietnamese. Corpus in XML and WAV formats.
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
400.00 €
![]() |
500.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
900.00 €
![]() |
900.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
600.00 €
![]() |
750.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
1350.00 €
![]() |
1350.00 €
![]() |


- Portuguese
ID: ELRA-W0062
ISLRN: 368-672-631-502-0The CINTIL-DeepBank (Branco et al., 2010) is a corpus of sentences annotated with their full-fledged deep grammatical representations, composed of 10,039 sentences and 110,166 tokens taken from different sources and domains: news (8,861 sentences; 101,430 tokens), and novels (399 sentences; 3,082...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
3000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3000.00 €
![]() |
3000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
3000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3000.00 €
![]() |
3000.00 €
![]() |


- Portuguese
ID: ELRA-W0061
ISLRN: 133-035-138-613-6The CINTIL-DependencyBank (Silva and Branco, 2012) is a corpus of sentences annotated with their syntactic dependency graphs and grammatical function tags composed of 10,039 sentences and 110,166 tokens taken from different sources and domains: news (8,861 sentences; 101,430 tokens), novels (399 ...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
3000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3000.00 €
![]() |
3000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
3000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3000.00 €
![]() |
3000.00 €
![]() |


- Portuguese
ID: ELRA-W0056
ISLRN: 723-486-478-286-6The CINTIL-PropBank is a corpus of sentences annotated with their constituency structure and semantic role tags, composed of 10,039 sentences and 110,166 tokens taken from different sources and domains: news (8,861 sentences; 101,430 tokens), and novels (399 sentences; 3,082 tokens). In addition,...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
3000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3000.00 €
![]() |
3000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
3000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3000.00 €
![]() |
3000.00 €
![]() |


- Portuguese
ID: ELRA-W0055
ISLRN: 411-691-515-701-9The CINTIL-TreeBank is a corpus of syntactic constituency trees of Portuguese texts composed of 10,039 sentences and 110,166 tokens taken from different sources and domains: news (8,861 sentences; 101,430 tokens), novels (399 sentences; 3,082 tokens). In addition, there are 779 sentences (5,654 t...
MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
3000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3000.00 €
![]() |
3000.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Non Commercial Use - ELRA END USER |
0.00 €
![]() |
3000.00 €
![]() |
Licence: Commercial Use - ELRA VAR |
3000.00 €
![]() |
3000.00 €
![]() |


- English
- Polish
ID: ELRA-W0186
ISLRN: 792-786-685-848-5This dataset has been created within the framework of the European Language Resource Coordination (ELRC) Connecting Europe Facility - Automated Translation (CEF.AT) action. For further information on the project: http://lr-coordination.eu. A collection of parallel Polish-English texts published ...
MEMBER | academic | commercial |
---|---|---|
Licence: Other - Public Domain |
0.00 €
![]() |
0.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Other - Public Domain |
0.00 €
![]() |
0.00 €
![]() |


- Bulgarian
- Czech
- Dutch; Flemish
- English
- Finnish
- French
- German
- Hungarian
- Italian
- Persian
- Portuguese
- Russian
- Spanish; Castilian
- Swedish
ID: ELRA-E0036
ISLRN: 378-279-085-589-0The Cross-Language Evaluation Forum (CLEF) promotes R&D in multilingual information access (MLIA) by (i) developing an infrastructure for the testing, tuning and evaluation of information retrieval systems operating on European languages in both monolingual and cross-language contexts, and (ii) c...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
![]() |
500.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
![]() |
1000.00 €
![]() |
Special offers are also available. Check here for details.


- English
- German
- Russian
ID: ELRA-E0037
ISLRN: 609-362-685-537-2The Cross-Language Evaluation Forum (CLEF) promotes R&D in multilingual information access (MLIA) by (i) developing an infrastructure for the testing, tuning and evaluation of information retrieval systems operating on European languages in both monolingual and cross-language contexts, and (ii) c...
MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
150.00 €
![]() |
500.00 €
![]() |
NON MEMBER | academic | commercial |
---|---|---|
Licence: Evaluation Use - ELRA EVALUATION |
300.00 €
![]() |
1000.00 €
![]() |
Special offers are also available. Check here for details.