134 Hours - Malay Speech Data by Mobile Phone_Reading
156 Speakers - Mobile Telephony Malay Speech Data_Reading is recorded by native Malay speakers in the quiet environment. The recording is rich in content, covering multiple categories such as economy, entertainment, news, oral language, numbers, and letters. Around 450 sentences for each speaker. The effective time is 134 hours. All texts are manually transcribed to ensure high accuracy.
Malay speechMalaysiaMobile phoneReadingSample
1,796 Hours - German Speech Data by Mobile Phone
German audio data captured by mobile phone, 1,796 hours in total, recorded by 3,442 German native speakers. The recorded text is designed by linguistic experts, covering generic, interactive, on-board, home and other categories. The text has been proofread manually with high accuracy; this data can be used for automatic speech recognition, machine translation, and voiceprint recognition.
GermanGermanyMobile phoneReadingSample
101,702 Japanese Pronunciation Dictionary
The data contains 101,702 entries. All words and pronunciations are produced by Japanese linguists. It can be used in the research and development of Japanese ASR technology.
Sample
9,435 People 60,872 Images Cross-age Faces Data
9,435 People 60,872 Images Cross-age Faces Data. The data includes indoor and outdoor scenes. The dataset includes female and male(Chinese). For most people, the age spans are 10 years at least, the age spans of only a few people are less than 10 years (130 people). For each person, at least 4 front side images were collected. The data can be used for tasks such as cross-age face recognition.
Several images for one personDifferent scenesDifferent age periodsSample
397 People - Hindi Speech Data by Mobile Phone_Guiding
The data is recorded by 397 Indian with authentic accent, 50 sentences for each speaker, total 8.6 hours. The recording content involves car scene, smart home, intelligent voice assistant. This data can be used for corpus construction of machine translation, model training and algorithm research for voiceprint recognition.
HindiIndiaMobile PhoneGuidingSample
201 Hours – North American English Speech Data by Mobile Phone and PC
The data set contains 302 North American speakers' speech data. The recording contents include phrases and sentences with rich scenes. The valid time is 201 hours. The recording environment is quiet indoor. The recording device includes PC, android cellphone, and iPhone. This data can be used in speech recognition research in North American area.
EnglishNorth AmericaReadingSample
CUSTOMIZED COLLECTION & ANNOTATION SERVICES
1,000,000+ crowdsourcing to perform complex and professional projects