logo
  • iconAll Datasets
  • iconSpeech Datasets
  • iconImage Datasets
  • iconText Datasets
  • iconVideo Datasets
  • iconMulti-Modal Datasets
AI
Ready-to-Use AI Datasets!

Explore 2000+ Unbiased & Ethically sourced datasets across various AI technologies like Speech Recognition, Computer Vision, Natural Language Processing, Optical Character Recognition, Generative AI, Machine Translation, etc!

Explore 2000+ Unbiased & Ethically sourced datasets across various AI technologies like Speech AI, Vision AI, Language AI, Generative AI, etc!

All Datasets
Arrow
Speech Recognition
Arrow
Computer Vision
Arrow
Natural Language Processing
Arrow
Generative AI
Arrow
Multi-Modal Learning
Arrow
Machine Translation
Arrow
    iconAR/VR
    iconAutomotive
    icon Banking & Finance
    iconHealthcare
    iconRetail & E-commerce
    iconSafety & Surveillance
    iconReal Estate
    iconTelecom
icon
  • iconAI Data Collection & Curation
  • iconGenerative AI Services
  • iconData Annotation
  • iconData Transcription
  • iconAdd-On AI Services
  • iconSaas AI Platforms
Diverse Speech DatasetsAbout Gradient Line
AI/ML Data Collection
Speech Data Collection
Image Data Collection
Text Data Collection
Video Data Collection
Multimodal Data Collection
Synthetic Data Collection
    iconBlog
    iconCase Study
    iconFAQs
    iconKnowledge Hub
Speech-Datasets-in-Indian-languages-for-TTS

Explore Our Latest Insightful Blog

Arrow
    iconAbout Us
    iconContact Us
    iconPolicies
    iconMonetize Dataset
    iconCrowd-as-a-Service
    iconJoin Community
logo
logo

Powering the Next Generation of AI with Ethical and Reliable Data!

Subscribe for tips, news, and offers.

SERVICES

Card Head Line
AI Data CollectionOTS DatasetsData AnnotationCrowd-as-a-ServiceAI Platforms

INDUSTRY

Card Head Line
AR/VRAutonomous VehiclesBanking & FinanceHealthcareRetail & E-commerceSafety & SurveillanceReal EstateTelecom

RESOURCES

Card Head Line
BlogsCase StudiesKnowledge HubFAQs

COMPANY

Card Head Line
About UsContact UsJoin CommunityPolicies

COMMUNITY

Card Head Line
Explore CommunityJoin Community

Follow Us!

Instagram
Instagram gradient
Facebook
Facebook gradient
Linkedin
Linkedin gradient
Twitter
Twitter gradient
Youtube
Youtube gradient
Privacy PolicyCard Head LineCookie Policy

Follow Us!

Instagram
Instagram gradient
Facebook
Facebook gradient
Linkedin
Linkedin gradient
Twitter
Twitter gradient
Youtube
Youtube gradient
Privacy PolicyCard Head LineCookie Policy

Subscribe for tips, news, and offers.

Copyright â“’ 2025 FutureBeeAI. All rights reserved.

Wake Words & Voice Commands Datasets

About Gradient Line

Discover a comprehensive collection of wake words and commands speech datasets. These audio datasets are tailored to help you develop and refine speech recognition models, empowering voice assistants and smart devices to better understand the wake words and respond to user commands.

Our wake word and voice command voice datasets include high-quality speech data, accurate transcriptions, and detailed metadata.

Leverage these ready-to-use datasets to train and fine-tune speech recognition models, improve voice assistant responsiveness and user experience, and expand the capabilities of smart devices and voice-enabled applications!

Contact Us
Decorative Lines

I want to explore

Wake Words & Commands
All
General Conversation
Call Center Conversation
Scripted Monologue
Wake Words & Commands
In-car Wake Words & Commands

Speech Datasets!

Type

Wake Words & Commands
All
General Conversation
Call Center Conversation
Scripted Monologue
Wake Words & Commands
In-car Wake Words & Commands

FB Logo
Filter(54)
Language Icon

Language

Filter Search Icon
Icon

Wake Words & Voice Command Datasets

Wake words & Command dataset for training & fine-tuning of voice assistants in Arabic (Saudi Arabia)

Saudi Arabian Arabic Wake Words & Commands Data

Wake words and commands audio recordings in Saudi Arabian Arabic

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in English (India)

Indian English Wake Words & Commands Data

Wake words and commands audio recordings in Indian English

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in English (UK)

UK English Wake Words & Commands Dataset

Wake words and commands audio recordings in UK English

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in English (US)

US English Wake Words & Commands Dataset

Wake words and commands audio recordings in US English

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in German (Germany)

German Wake Words & Commands Dataset

Wake words and commands audio recordings in German

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Japanese (Japan)

Japanese Wake Words & Commands Dataset

Wake words and commands audio recordings in Japanese

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Korean (South Korea)

Korean Wake Words & Commands Dataset

Wake words and commands audio recordings in Korean

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Spanish (Spain)

Spanish Wake Words & Commands Dataset

Wake words and commands audio recordings in Spanish

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Swedish (Sweden)

Swedish Wake Words & Commands Dataset

Wake words and commands audio recordings in Swedish

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Arabic (Algeria)

Algerian Arabic Wake Words & Commands Data

Wake words and commands audio recordings in Algerian Arabic

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Arabic (Egypt)

Egyptian Arabic Wake Words & Commands Data

Wake words and commands audio recordings in Egyptian Arabic

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Bahasa (Indonesia)

Bahasa Wake Words & Commands Dataset

Wake words and commands audio recordings in Bahasa

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Bengali (Bangladesh)

Bangladesh Bengali Wake Words & Commands Data

Wake words and commands audio recordings in Bangladesh Bengali

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Bengali (India)

Indian Bengali Wake Words & Commands Data

Wake words and commands audio recordings in Indian Bengali

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Bulgarian (Bulgaria)

Bulgarian Wake Words & Commands Dataset

Wake words and commands audio recordings in Bulgarian

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Czech (Czech Republic)

Czech Wake Words & Commands Dataset

Wake words and commands audio recordings in Czech

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Danish (Denmark)

Danish Wake Words & Commands Dataset

Wake words and commands audio recordings in Danish

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Dutch (Netherlands)

Dutch Wake Words & Commands Dataset

Wake words and commands audio recordings in Dutch

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in English (Australia)

Australian English Wake Words & Commands Data

Wake words and commands audio recordings in Australian English

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in English (Canada)

Canadian English Wake Words & Commands Data

Wake words and commands audio recordings in Canadian English

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in English (New Zealand)

New Zealand English Wake Words & Commands Data

Wake words and commands audio recordings in New Zealand English

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in English (Philippines)

Philippines English Wake Words & Commands Data

Wake words and commands audio recordings in Philippines English

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Filipino (Philippines)

Filipino Wake Words & Commands Dataset

Wake words and commands audio recordings in Filipino

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Finnish (Finland)

Finnish Wake Words & Commands Dataset

Wake words and commands audio recordings in Finnish

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in French (Canada)

Canadian French Wake Words & Commands Data

Wake words and commands audio recordings in Canadian French

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in French (France)

French Wake Words & Commands Dataset

Wake words and commands audio recordings in French

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in German (Switzerland)

Swiss German Wake Words & Commands Data

Wake words and commands audio recordings in Swiss German

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Gujarati (India)

Gujarati Wake Words & Commands Dataset

Wake words and commands audio recordings in Gujarati

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Hindi (India)

Hindi Wake Words & Commands Dataset

Wake words and commands audio recordings in Hindi

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Italian (Italy)

Italian Wake Words & Commands Dataset

Wake words and commands audio recordings in Italian

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Kannada (India)

Kannada Wake Words & Commands Dataset

Wake words and commands audio recordings in Kannada

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Malay (Malaysia)

Malay Wake Words & Commands Dataset

Wake words and commands audio recordings in Malay

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Malayalam (India)

Malayalam Wake Words & Commands Dataset

Wake words and commands audio recordings in Malayalam

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Mandarin (China)

Mandarin Wake Words & Commands Dataset

Wake words and commands audio recordings in Mandarin

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Marathi (India)

Marathi Wake Words & Commands Dataset

Wake words and commands audio recordings in Marathi

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Norwegian (Norway)

Norwegian Wake Words & Commands Dataset

Wake words and commands audio recordings in Norwegian

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Odia (India)

Odia Wake Words & Commands Dataset

Wake words and commands audio recordings in Odia

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Polish (Poland)

Polish Wake Words & Commands Dataset

Wake words and commands audio recordings in Polish

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Portuguese(Brazil)

Brazilian Portuguese Wake Words & Commands Data

Wake words and commands audio recordings in Brazilian Portuguese

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Portuguese (Portugal)

Portuguese Wake Words & Commands Dataset

Wake words and commands audio recordings in Portuguese

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Punjabi (India)

Punjabi Wake Words & Commands Dataset

Wake words and commands audio recordings in Punjabi

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Romanian (Romania)

Romanian Wake Words & Commands Dataset

Wake words and commands audio recordings in Romanian

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Russian (Russia)

Russian Wake Words & Commands Dataset

Wake words and commands audio recordings in Russian

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Spanish (Argentina)

Argentine Spanish Wake Words & Commands Data

Wake words and commands audio recordings in Argentine Spanish

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Spanish (Colombia)

Colombian Spanish Wake Words & Commands Data

Wake words and commands audio recordings in Colombian Spanish

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Spanish (Mexico)

Mexican Spanish Wake Words & Commands Data

Wake words and commands audio recordings in Mexican Spanish

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Spanish (USA)

US Spanish Wake Words & Commands Dataset

Wake words and commands audio recordings in US Spanish

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Tamil (India)

Tamil Wake Words & Commands Dataset

Wake words and commands audio recordings in Tamil

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Telugu (India)

Telugu Wake Words & Commands Dataset

Wake words and commands audio recordings in Telugu

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Thai (Thailand)

Thai Wake Words & Commands Dataset

Wake words and commands audio recordings in Thai

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Turkish (Turkey)

Turkish Wake Words & Commands Dataset

Wake words and commands audio recordings in Turkish

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Ukrainian (Ukraine)

Ukrainian Wake Words & Commands Dataset

Wake words and commands audio recordings in Ukrainian

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Urdu (Pakistan)

Urdu Wake Words & Commands Dataset

Wake words and commands audio recordings in Urdu

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Vietnamese (Vietnam)

Vietnamese Wake Words & Commands Dataset

Wake words and commands audio recordings in Vietnamese

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition

Train voice assistants with Wake Words & Commands Speech Dataset!

Contact Usarrow
CTA illustration