113 Language Resources (Page 1 of 6)

« Previous | Next »Order by:

 AUDIO Human Voice Pronunciations - Chinese (Simplified)    
  • Chinese

ID: ELRA-S0490-03

ISLRN: 569-723-482-271-6

Human voice recordings of single-word lemmas and multiword expressions, besides IPA (International Phonetic Alphabet) and alternative scripts (Japanese – Romaji/Kanji/Hiragana; Chinese – Pinyin; Arabic and Hebrew – w/out diacritics), distributed as distinct sets (from ELRA-S0490-01 to ELRA-S0490-...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
471.90 € submit
471.90 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
495.50 € submit
495.50 € submit

Special offers are also available. Check here for details.

 Bitext Lexical Dataset - Chinese (Simplified)    
  • Chinese

ID: ELRA-L0137

ISLRN: 803-896-567-451-2

The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - Chinese (Simplified) consists of 75,000 lemmas (f...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
85000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
85000.00 € submit
 Bitext Lexical Dataset - Chinese (Traditional)    
  • Chinese

ID: ELRA-L0138

ISLRN: 934-287-681-414-0

The series of Bitext Lexical Datasets includes Lemmas, POS tagging, Frequency, Named Entities and Offensive features. Depending on the dataset and language, other syntactic and morphological features are also provided. The Bitext Lexical Dataset - Chinese (Traditional) consists of 75,000 lemmas (...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
85000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
85000.00 € submit
 Bitext Lexical Dataset - Language Variants - Chinese    
  • Chinese

ID: ELRA-L0152

ISLRN: 345-861-801-718-8

As a complement to the generic vocabulary provided in ELRA-L0137 and ELRA-L0138, the following language variants of Chinese are provided: - Chinese Simplified: 74,000 lemmas (forms) - Chinese Traditional: 74,000 lemmas (forms)

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
78000.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
78000.00 € submit
 Cantonese Conversational Speech Data by Mobile Phone and Voice Recorder - 607 Hours    
  • Chinese

ID: ELRA-S0427

ISLRN: 722-447-977-629-5

995 local Cantonese speakers participated in the recording, and conducted face-to-face communication in a natural way. They had free discussion on a number of given topics, with a wide range of fields; the voice was natural and fluent, in line with the actual dialogue scene. Text is transcribed m...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
98030.50 € submit
98030.50 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
98030.50 € submit
98030.50 € submit

Special offers are also available. Check here for details.

 Cantonese Dialect Speech Data by Mobile Phone - 1,652 Hours    
  • Chinese

ID: ELRA-S0478

ISLRN: 049-624-028-135-7

It collects 4,888 speakers from Guangdong Province and is recorded in quiet indoor environment. The recorded content covers 500,000 commonly used spoken sentences, including high-frequency words in weico and daily used expressions. The average number of repetitions is 1.5 and the average sentence...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
141246.00 € submit
141246.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
141246.00 € submit
141246.00 € submit

Special offers are also available. Check here for details.

 Cantonese Readings Database    
  • Chinese

ID: ELRA-L0101

ISLRN: 634-690-317-631-5

This database is not only comprehensive but also linguistically accurate. It is based on solid principles of Cantonese phonology and semantics, and takes into account the phenomena of polyphony as well as tone change, which is unpredictable and requires manual proofreading. It covers 300,000 entr...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
9000.00 € submit
15000.00 € submit
Licence: Commercial Use - ELRA VAR
18000.00 € submit
30000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
11250.00 € submit
18750.00 € submit
Licence: Commercial Use - ELRA VAR
22500.00 € submit
37500.00 € submit
 Cantonese Speecon database    
  • Chinese

ID: ELRA-S0287

ISLRN: 537-563-219-913-3

The Cantonese Speecon database is divided into 2 sets: 1) The first set comprises the recordings of 550 adult Cantonese speakers (273 males, 277 females), recorded over 4 microphone channels in 4 recording environments (office, entertainment, car, public place). 2) The second set comprises the ...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
50000.00 € submit
67000.00 € submit
Licence: Commercial Use - ELRA VAR
67000.00 € submit
67000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
60000.00 € submit
75000.00 € submit
Licence: Commercial Use - ELRA VAR
75000.00 € submit
75000.00 € submit
 Changsha Dialect Speech Data by Mobile Phone - 997 Hours    
  • Chinese

ID: ELRA-S0453

ISLRN: 520-610-210-012-3

2,000 Changsha natives participated in the recording, covering multiple age groups, with a balanced gender distribution and authentic accent. The recorded text is rich in content, covering general, interactive, car, home and other categories. Local people in changsha check and proofread. The ac...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
94715.00 € submit
94715.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
94715.00 € submit
94715.00 € submit

Special offers are also available. Check here for details.

 Chinese Children Speech data by Mobile phone - 3,255 Hours    
  • Chinese

ID: ELRA-S0458

ISLRN: 607-995-858-759-4

Mobile phone captured audio data of Chinese children, with total duration of 3,255 hours. 9,780 speakers are children aged 6 to 12, with accent covering seven dialect areas; the recorded text contains common children languages such as essay stories, numbers, and their interactions on cars, at hom...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
247380.00 € submit
247380.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
247380.00 € submit
247380.00 € submit

Special offers are also available. Check here for details.

 Chinese Digital Speech Data by Mobile Phone - 11,010 People    
  • Chinese

ID: ELRA-S0419

ISLRN: 434-094-443-871-0

11,010 Chinese native speakers participated in the recording with equal gender. Each speaker reads 30 sentences of 4 -8 digit number. Format:16kHz, 16bit, uncompressed wav, mono channel Recording environment:quiet indoor environment, without echo Recording content (read speech):four to eight...

MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
41838.00 € submit
41838.00 € submit
NON MEMBERacademiccommercial
Licence: Commercial Use - ELRA VAR
41838.00 € submit
41838.00 € submit

Special offers are also available. Check here for details.

 Chinese Lexical Database    
  • Chinese

ID: ELRA-L0107

ISLRN: 500-068-723-953-8

A comprehensive monolingual lexical database of Chinese consisting of Simplified and Traditional Chinese modules, covering general vocabulary and important technical terms. Each entry is accompanied by various attributes, such as phonological, grammatical, and morphological information, as well a...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4500.00 € submit
7500.00 € submit
Licence: Commercial Use - ELRA VAR
9000.00 € submit
15000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5625.00 € submit
9375.00 € submit
Licence: Commercial Use - ELRA VAR
11250.00 € submit
18750.00 € submit
 Chinese Mandarin (North) database    
  • Chinese

ID: ELRA-S0398

ISLRN: 353-548-770-894-7

This database contains the recordings of 500 Chinese Mandarin speakers from Northern China (250 males and 250 females), from 18 to 60 years’ old, recorded in quiet studios located in Shenzhen and in Hong Kong Special Administrative Region, People’s Republic of China. Demographics of native sp...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5200.00 € submit
7400.00 € submit
Licence: Commercial Use - ELRA VAR
7400.00 € submit
7400.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
7400.00 € submit
7400.00 € submit
Licence: Commercial Use - ELRA VAR
7400.00 € submit
7400.00 € submit
 Chinese Mandarin (South) database    
  • Chinese

ID: ELRA-S0397

ISLRN: 503-886-852-083-2

This database contains the recordings of 1000 Chinese Mandarin speakers from Southern China (500 males and 500 females), from 18 to 60 years’ old, recorded in quiet studios located in Shenzhen and in Hong Kong Special Administrative Region, People’s Republic of China. Demographics of native s...

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
10400.00 € submit
14800.00 € submit
Licence: Commercial Use - ELRA VAR
14800.00 € submit
14800.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
14800.00 € submit
14800.00 € submit
Licence: Commercial Use - ELRA VAR
14800.00 € submit
14800.00 € submit
 Chinese Mandarin Speech Recognition Corpus (Mobile) - 204.2 hours    
  • Chinese

ID: ELRA-S0228-67

ISLRN: 509-044-363-238-7

This corpus comprises 120,144 entries uttered by 400 speakers (199 males and 201 females), recorded over the mobile telephone network. Speech samples are stored as a sequence of 16-bit 16 kHz for a total of 204.2 hours of speech.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
24000.00 € submit
24000.00 € submit
Licence: Commercial Use - ELRA VAR
24000.00 € submit
24000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
24000.00 € submit
24000.00 € submit
Licence: Commercial Use - ELRA VAR
24000.00 € submit
24000.00 € submit
 Chinese Mandarin Speech Recognition Corpus (Mobile) - 67.4 hours    
  • Chinese

ID: ELRA-S0228-61

ISLRN: 599-273-322-100-1

This corpus comprises 91,729 entries uttered by 304 speakers (151 males and 153 females), recorded over the mobile telephone network. Speech samples are stored as a sequence of 16-bit 16kHz for a total of 67.4 hours of speech.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
18000.00 € submit
18000.00 € submit
Licence: Commercial Use - ELRA VAR
18000.00 € submit
18000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
18000.00 € submit
18000.00 € submit
Licence: Commercial Use - ELRA VAR
18000.00 € submit
18000.00 € submit
 Chinese Mandarin Speech Recognition Corpus (Mobile) - 85 hours    
  • Chinese

ID: ELRA-S0228-60

ISLRN: 654-695-177-609-6

This corpus comprises 60,216 entries uttered by 201 speakers (101 males and 100 females), recorded over the mobile telephone network. Speech samples are stored as a sequence of 16-bit 16kHz for a total of 85 hours of speech.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
12000.00 € submit
12000.00 € submit
Licence: Commercial Use - ELRA VAR
12000.00 € submit
12000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
12000.00 € submit
12000.00 € submit
Licence: Commercial Use - ELRA VAR
12000.00 € submit
12000.00 € submit
 Chinese Morphological Database    
  • Chinese

ID: ELRA-L0108

ISLRN: 279-636-746-963-2

This is a comprehensive database of Chinese derivative affixes with adjacency attributes.

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
5100.00 € submit
8500.00 € submit
Licence: Commercial Use - ELRA VAR
10200.00 € submit
17000.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
6375.00 € submit
10625.00 € submit
Licence: Commercial Use - ELRA VAR
12750.00 € submit
21250.00 € submit
 Chinese Phonological Database    
  • Chinese

ID: ELRA-L0102

ISLRN: 968-547-869-011-3

A large-scale database of Chinese pinyin readings. Especially noteworthy are the differences in pronunciation between Taiwan and the PRC, for example 期待 qí dài (Taiwan) and qī dài (PRC).

MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
3450.00 € submit
5750.00 € submit
Licence: Commercial Use - ELRA VAR
6900.00 € submit
11500.00 € submit
NON MEMBERacademiccommercial
Licence: Non Commercial Use - ELRA END USER
4313.00 € submit
7188.00 € submit
Licence: Commercial Use - ELRA VAR
8625.00 € submit
14375.00 € submit

« Previous | Next »