474 Hours-Japanese Speech Data By Mobile Phone
- Licensed Off-the-shelf Datasets to Boost AI Projects Development.
Japanese(Japan) Scripted Monologue Smartphone speech dataset, collected from monologue based on given scripts, covering generic domain, human-machine interaction, smart home command and in-car command, numbers and other domains. Transcribed with text content and other attributes. Our dataset was collected from extensive and diversify speakers(1,245 speakers in total), geographicly speaking, enhancing model performance in real and complex tasks.rnQuality tested by various AI companies. We strictly adhere to data protection regulations and privacy standards, ensuring the maintenance of user privacy and legal rights throughout the data collection, storage, and usage processes, our datasets are all GDPR, CCPA, PIPL complied.
Ask For a Quote Get Data SampleSpecifications
- Format
- 16kHz, 16bit, uncompressed wav, mono channel;
- Recording condition
- Low background noise(indoor), without echo;
- Content category
- Generic domain; human-machine interaction; smart home command and control; in-car command and control; numbers
- Recording device
- Android Smartphone, iPhone;
- Speaker
- 1,245 speakers, with 606 males and 639 females; 559 speakers in the age group of 16~25, 567 speakers in the age group of 26~45, 119 speakers are older than 45;
- Country
- Japan(JPN);
- Language(Region) Code
- ja-JP;
- Language
- Japanese;
- Features of annotation
- Transcription text;
- Accuracy Rate
- Sentence Accuracy Rate (SAR) 95%
-
零九零七四二五零一八三
-
八街市今から雨が降るんかな
-
助手室の温度調節してもらいたい
-
会社はどうなんだ
-
空気指数詳しく知りたい