All Datasets

Explore All MDT Corpora

Browse multilingual datasets across audio, text, image, and more.

7 datasets found
Mandarin Text Classification Dataset
View Detail
25,000 samples Mandarin Chinese
Industry: Financial Services
Application: Voice Commerce & Consumer Service
Type: Test Set
Region: China
English Conversational Speech Dataset
View Detail
Open-Source 50,000 samples English
Industry: Automotive
Application: Automotive Virtual Assistant
Type: Training Set
Region: USA
Japanese Voice Command Dataset
View Detail
Open-Source 30,000 samples Japanese
Industry: Smart Home
Application: Smart Home Controls
Type: Training Set
Region: Japan
Korean Sentiment Analysis Dataset
View Detail
Open-Source 40,000 samples Korean
Industry: Social Networks
Application: Voice Commerce & Consumer Service
Type: Test Set
Region: Korea
German Automotive Commands Dataset
View Detail
20,000 samples German
Industry: Automotive
Application: Automotive Virtual Assistant
Type: Training Set
Region: Germany
Spanish Customer Service Dataset
View Detail
35,000 samples Spanish
Industry: Financial Services
Application: Voice Commerce & Consumer Service
Type: Training Set
Region: Spain
Medical Image Recognition Dataset
View Detail
15,000 samples English
Industry: Healthcare
Application: Healthcare
Type: Training Set
Region: USA
Page 1 of 1
Comprehensive Coverage
Cross-domain corpora spanning languages, regions, and applications.
Quality & Compliance
Curated datasets with robust consent, security, and review flows.
Scale & Reliability
Proven collection pipelines and validation for enterprise workloads.

Need a custom corpus or procurement help?

Reach out to our solutions team to source or tailor datasets.

Contact Us