logo
  • iconAll Datasets
  • iconSpeech Datasets
  • iconImage Datasets
  • iconText Datasets
  • iconVideo Datasets
  • iconMulti-Modal Datasets
AI
Ready-to-Use AI Datasets!

Explore 2000+ Unbiased & Ethically sourced datasets across various AI technologies like Speech Recognition, Computer Vision, Natural Language Processing, Optical Character Recognition, Generative AI, Machine Translation, etc!

Explore 2000+ Unbiased & Ethically sourced datasets across various AI technologies like Speech AI, Vision AI, Language AI, Generative AI, etc!

All Datasets
Arrow
Speech Recognition
Arrow
Computer Vision
Arrow
Natural Language Processing
Arrow
Generative AI
Arrow
Multi-Modal Learning
Arrow
Machine Translation
Arrow
    iconAR/VR
    iconAutomotive
    icon Banking & Finance
    iconHealthcare
    iconRetail & E-commerce
    iconSafety & Surveillance
    iconReal Estate
    iconTelecom
icon
  • iconAI Data Collection & Curation
  • iconGenerative AI Services
  • iconData Annotation
  • iconData Transcription
  • iconAdd-On AI Services
  • iconSaas AI Platforms
Diverse Speech DatasetsAbout Gradient Line
AI/ML Data Collection
Speech Data Collection
Image Data Collection
Text Data Collection
Video Data Collection
Multimodal Data Collection
Synthetic Data Collection
    iconBlog
    iconCase Study
    iconFAQs
    iconKnowledge Hub
Speech-Datasets-in-Indian-languages-for-TTS

Explore Our Latest Insightful Blog

Arrow
    iconAbout Us
    iconContact Us
    iconPolicies
    iconMonetize Dataset
    iconCrowd-as-a-Service
    iconJoin Community
logo
logo

Powering the Next Generation of AI with Ethical and Reliable Data!

Subscribe for tips, news, and offers.

SERVICES

Card Head Line
AI Data CollectionOTS DatasetsData AnnotationCrowd-as-a-ServiceAI Platforms

INDUSTRY

Card Head Line
AR/VRAutonomous VehiclesBanking & FinanceHealthcareRetail & E-commerceSafety & SurveillanceReal EstateTelecom

RESOURCES

Card Head Line
BlogsCase StudiesKnowledge HubFAQs

COMPANY

Card Head Line
About UsContact UsJoin CommunityPolicies

COMMUNITY

Card Head Line
Explore CommunityJoin Community

Follow Us!

Instagram
Instagram gradient
Facebook
Facebook gradient
Linkedin
Linkedin gradient
Twitter
Twitter gradient
Youtube
Youtube gradient
Privacy PolicyCard Head LineCookie Policy

Follow Us!

Instagram
Instagram gradient
Facebook
Facebook gradient
Linkedin
Linkedin gradient
Twitter
Twitter gradient
Youtube
Youtube gradient
Privacy PolicyCard Head LineCookie Policy

Subscribe for tips, news, and offers.

Copyright ⓒ 2025 FutureBeeAI. All rights reserved.

Arabic Speech Datasets

About Gradient Line

Explore the collection of arabic-dataset language speech datasets! It includes diverse range of speech data like General Conversation, Call Center Conversation, Scripted Monologues, Wake words and Commands.

Leverage these ready-to-deploy arabic-dataset language audio datasets in building robust Automatic Speech Recognition (ASR), Text-to-Speech (TTS), Conversational AI, and Voice assistant models.

Each voice dataset includes high-quality and realistic audio data, accurate transcription, and detailed metadata!

Contact Us
Decorative Lines

I want to explore

All
All
General Conversation
Call Center Conversation
Scripted Monologue
Wake Words & Commands
In-car Wake Words & Commands

Speech Datasets across

Arabic
Arabic
Bahasa
Bengali
Bulgarian
Czech
Danish
Dutch
English
Finnish
French
German
Gujarati
Hindi
Italian
Japanese
Kannada
Korean
Malay
Malayalam
Mandarin
Marathi
Norwegian
Odia
Polish
Portuguese
Punjabi
Romanian
Russian
Spanish
Swedish
Filipino
Tamil
Telugu
Thai
Turkish
Ukrainian
Urdu
Vietnamese

Languages!

Type

All
All
General Conversation
Call Center Conversation
Scripted Monologue
Wake Words & Commands
In-car Wake Words & Commands

Languages

Arabic
Arabic
Bahasa
Bengali
Bulgarian
Czech
Danish
Dutch
English
Finnish
French
German
Gujarati
Hindi
Italian
Japanese
Kannada
Korean
Malay
Malayalam
Mandarin
Marathi
Norwegian
Odia
Polish
Portuguese
Punjabi
Romanian
Russian
Spanish
Swedish
Filipino
Tamil
Telugu
Thai
Turkish
Ukrainian
Urdu
Vietnamese
FB Logo
Filter(52)
speechtype Icon

Type

Icon

Wake Words & Voice Command Datasets

Wake words & Command dataset for training & fine-tuning of voice assistants in Arabic (Saudi Arabia)

Saudi Arabian Arabic Wake Words & Commands Data

Wake words and commands audio recordings in Saudi Arabian Arabic

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Arabic (Algeria)

Algerian Arabic Wake Words & Commands Data

Wake words and commands audio recordings in Algerian Arabic

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Wake words & Command dataset for training & fine-tuning of voice assistants in Arabic (Egypt)

Egyptian Arabic Wake Words & Commands Data

Wake words and commands audio recordings in Egyptian Arabic

20000+ Recordings
50+ people
Wake Word DetectionCommand Recognition
Icon

In-Car Speech Datasets

Saudi Arabic In-car speech dataset
Arabic (Saudi Arabia)

Saudi Arabic In-car Speech Dataset

Automobile-specific wake words & commands in the in-car environment.

5000+ Recordings
50+ people
In-car ASRDriver Assistance
Icon

General Conversation Speech Datasets

Arabic (Algeria) Speech dataset for Conversational AI
Arabic (Algeria)

Algerian Arabic General Conversation Speech Data

Unscripted conversation audio data in Algerian Arabic.

50 Speech Hours
70 People
ASRConversational AI
Arabic (Egypt) Audio Dataset for Conversational AI
Arabic (Egypt)

Egyptian Arabic General Conversation Speech Data

Unscripted conversation audio data in Egyptian Arabic.

50 Speech Hours
70 People
ASRConversational AI
Arabic (Saudi Arabia) Voice dataset for Conversational AI
Arabic (Saudi Arabia)

Saudi Arabian Arabic General Conversation Speech Data

Unscripted conversation audio data in Saudi Arabian Arabic.

50 Speech Hours
70 People
ASRConversational AI
Icon

General Domain Scripted Monologue Speech Datasets

Arabic (Egypt) general domain scripted prompts dataset
Arabic (Egypt)

Egyptian Arabic General Scripted Monologue Data

Recordings of scripted prompts in Egyptian Arabic for General domain.

5000+ prompts
40+ people
ASRConversational AI
Arabic (Algeria) scripted promts dataset
Arabic (Algeria)

Algerian Arabic General Scripted Monologue Data

Recordings of scripted prompts in Algerian Arabic for General domain.

5000+ prompts
40+ people
ASRConversational AI
Arabic (Saudi Arabia) scripted monologues speech corpus for Speech recognition
Arabic (Saudi Arabia)

Saudi Arabian Arabic General Scripted Monologue Data

Recordings of scripted prompts in Saudi Arabian Arabic for General domain.

5000+ prompts
40+ people
ASRConversational AI
Icon

Retail & E-Commerce Call Center Speech Datasets

Arabic (Egypt) training dataset for Retail and E-commerce AI
Arabic (Egypt)

Egyptian Arabic Retail & E-com CC Speech Data

Retail & E-commerce call center audio data in Egyptian Arabic.

40 Speech Hours
80 People
ASRConversational AI
Arabic (Saudi Arabia) call center speech data for voicebot
Arabic (Saudi Arabia)

Saudi Arabian Retail & E-com CC Speech Data

Retail & E-commerce call center audio data in Saudi Arabian Arabic.

40 Speech Hours
80 People
ASRConversational AI
Arabic (Algeria) Speech to text dataset for Retail and E-commerce call center
Arabic (Algeria)

Algerian Arabic Retail & E-com CC Speech Data

Retail & E-commerce call center audio data in Algerian Arabic.

30 Speech Hours
60 People
ASRConversational AI
Icon

BFSI Call Center Speech Datasets

Arabic (Algeria) Speech to text dataset for BFSI call center
Arabic (Algeria)

Algerian Arabic BFSI CC Speech Data

BFSI call center audio data in Algerian Arabic.

30 Speech Hours
60 People
ASRChatbot
Arabic (Egypt) training dataset for BFSI AI
Arabic (Egypt)

Egyptian Arabic BFSI CC Speech Data

BFSI call center audio data in Egyptian Arabic.

40 Speech Hours
80 People
ASRChatbot
Arabic (Saudi Arabia) call center speech data for voicebot
Arabic (Saudi Arabia)

Saudi Arabian BFSI CC Speech Data

BFSI call center audio data in Saudi Arabian Arabic.

40 Speech Hours
80 People
ASRChatbot
Icon

Telecom Call Center Speech Datasets

Arabic (Algeria) Speech to text dataset for Telecom call center
Arabic (Algeria)

Algerian Arabic Telecom CC Speech Data

Telecom call center audio data in Algerian Arabic.

30 Speech Hours
60 People
ASRChatbot
Arabic (Saudi Arabia) call center speech data for voicebot
Arabic (Saudi Arabia)

Saudi Arabian Telecom CC Speech Data

Telecom call center audio data in Saudi Arabian Arabic.

40 Speech Hours
80 People
ASRChatbot
Arabic (Egypt) training dataset for Telecom AI
Arabic (Egypt)

Egyptian Arabic Telecom CC Speech Data

Telecom call center audio data in Egyptian Arabic.

40 Speech Hours
80 People
ASRChatbot
Icon

Delivery & Logistics Call Center Speech Datasets

Arabic (Algeria) Speech to text dataset for Delivery and logistics call center
Arabic (Algeria)

Algerian Arabic Delivery & Lgc CC Speech Data

Delivery & Logistics call center audio data in Algerian Arabic.

30 Speech Hours
60 People
ASRConversational AI
Arabic (Saudi Arabia) call center speech data for voicebot
Arabic (Saudi Arabia)

Saudi Arabian Delivery & Lgc CC Speech Data

Delivery & Logistics call center audio data in Saudi Arabian Arabic.

40 Speech Hours
80 People
ASRChatbot
Arabic (Egypt) training dataset for Delivery and logistics AI
Arabic (Egypt)

Egyptian Arabic Delivery & Lgc CC Speech Data

Delivery & Logistics call center audio data in Egyptian Arabic.

40 Speech Hours
80 People
ASRChatbot
Icon

Healthcare Call Center Speech Datasets

Arabic (Algeria) Speech to text dataset for Healthcare call center
Arabic (Algeria)

Algerian Arabic Healthcare CC Speech Data

Healthcare call center audio data in Algerian Arabic.

30 Speech Hours
60 People
ASRChatbot
Arabic (Egypt) training dataset for Healthcare AI
Arabic (Egypt)

Egyptian Arabic Healthcare CC Speech Data

Healthcare call center audio data in Egyptian Arabic.

40 Speech Hours
80 People
ASRChatbot
Arabic (Saudi Arabia) call center speech data for voicebot
Arabic (Saudi Arabia)

Saudi Arabian Healthcare CC Speech Data

Healthcare call center audio data in Saudi Arabian Arabic.

40 Speech Hours
80 People
ASRChatbot
Icon

Real Estate Call Center Speech Datasets

Arabic (Saudi Arabia) call center speech data for voicebot
Arabic (Saudi Arabia)

Saudi Arabian Real Estate CC Speech Data

Real Estate call center audio data in Saudi Arabian Arabic.

40 Speech Hours
80 People
ASRChatbot
Arabic (Algeria) Speech to text dataset for Realestate call center
Arabic (Algeria)

Algerian Arabic Real Estate CC Speech Data

Real Estate call center audio data in Algerian Arabic.

30 Speech Hours
60 People
ASRChatbot
Arabic (Egypt) training dataset for Realestate AI
Arabic (Egypt)

Egyptian Arabic Real Estate CC Speech Data

Real Estate call center audio data in Egyptian Arabic.

40 Speech Hours
80 People
ASRChatbot
Icon

Travel Call Center Speech Datasets

Arabic (Algeria) Speech to text dataset for Travel call center
Arabic (Algeria)

Algerian Arabic Travel CC Speech Data

Travel call center audio data in Algerian Arabic.

30 Speech Hours
60 People
ASRChatbot
Arabic (Egypt) training dataset for Travel AI
Arabic (Egypt)

Egyptian Arabic Travel CC Speech Data

Travel call center audio data in Egyptian Arabic.

40 Speech Hours
80 People
ASRChatbot
Arabic (Saudi Arabia) call center speech data for voicebot
Arabic (Saudi Arabia)

Saudi Arabian Travel CC Speech Data

Travel call center audio data in Saudi Arabian Arabic.

40 Speech Hours
80 People
ASRChatbot
Icon

Retail & E-Commerce Scripted Monologue Speech Datasets

Retail & E-commerce scripted monologue speech data for Machine learning in Arabic (Egypt)
Arabic (Egypt)

Egyptian Arabic Retail Scripted Monologue Data

Recordings of scripted prompts in Egyptian Arabic for Retail & E-commerce.

6000+ prompts
60+ people
ASRConversational AI
Scripted sentence recording dataset for conversational AI for Retail & E-commerce domain in Arabic (Algeria)
Arabic (Algeria)

Algerian Arabic Retail Scripted Monologue Data

Recordings of scripted prompts in Algerian Arabic for Retail & E-commerce.

6000+ prompts
60+ people
ASRConversational AI
Retail & E-commerce scripted monologue speech data for ASR in Arabic (Saudi Arabia)
Arabic (Saudi Arabia)

Saudi Arabian Arabic Retail Scripted Monologue Data

Recordings of scripted prompts in Saudi Arabian Arabic for Retail & E-commerce.

6000+ prompts
60+ people
ASRConversational AI
Icon

BFSI Scripted Monologue Speech Datasets

BFSI scripted monologue speech data for Machine learning in Arabic (Egypt)
Arabic (Egypt)

Egyptian Arabic BFSI Scripted Monologue Data

Audio recordings of scripted prompts in Egyptian Arabic for BFSI domain.

6000+ prompts
60+ people
ASRConversational AI
Scripted sentence recording dataset for conversational AI for BFSI domain in Arabic (Algeria)
Arabic (Algeria)

Algerian Arabic BFSI Scripted Monologue Data

Audio recordings of scripted prompts in Algerian Arabic for BFSI domain.

6000+ prompts
60+ people
ASRConversational AI
BFSI scripted monologue speech data for ASR in Arabic (Saudi Arabia)
Arabic (Saudi Arabia)

Saudi Arabian Arabic BFSI Scripted Monologue Data

Audio recordings of scripted prompts in Saudi Arabian Arabic for BFSI domain.

6000+ prompts
60+ people
ASRConversational AI
Icon

Telecom Scripted Monologue Speech Datasets

Telecom scripted monologue speech data for Machine learning in Arabic (Egypt)
Arabic (Egypt)

Egyptian Arabic Telecom Scripted Monologue Data

Audio recordings of scripted prompts in Egyptian Arabic for Telecom domain.

6000+ prompts
60+ people
ASRConversational AI
Scripted sentence recording dataset for conversational AI for Telecom domain in Arabic (Algeria)
Arabic (Algeria)

Algerian Arabic Telecom Scripted Monologue Data

Audio recordings of scripted prompts in Algerian Arabic for Telecom domain.

6000+ prompts
60+ people
ASRConversational AI
Telecom scripted monologue speech data for ASR in Arabic (Saudi Arabia)
Arabic (Saudi Arabia)

Saudi Arabian Arabic Telecom Scripted Monologue

Audio recordings of scripted prompts in Saudi Arabian Arabic for Telecom domain.

6000+ prompts
60+ people
ASRConversational AI
Icon

Delivery & Logistics Scripted Monologue Speech Datasets

Delivery & Logistics scripted monologue speech data for Machine learning in Arabic (Egypt)
Arabic (Egypt)

Egyptian Arabic Delivery & Lgc Monologue Data

Recordings of scripted prompts in Egyptian Arabic for Delivery & Logistics.

6000+ prompts
60+ people
ASRConversational AI
Scripted sentence recording dataset for conversational AI for Delivery & Logistics domain in Arabic (Algeria)
Arabic (Algeria)

Algerian Arabic Delivery & Lgc Monologue Data

Recordings of scripted prompts in Algerian Arabic for Delivery & Logistics.

6000+ prompts
60+ people
ASRConversational AI
Delivery & Logistics scripted monologue speech data for ASR in Arabic (Saudi Arabia)
Arabic (Saudi Arabia)

Saudi Arabian Arabic Delivery & Lgc Monologue

Recordings of scripted prompts in Saudi Arabian Arabic for Delivery & Logistics.

6000+ prompts
60+ people
ASRConversational AI
Icon

Healthcare Scripted Monologue Speech Datasets

Healthcare scripted monologue speech data for Machine learning in Arabic (Egypt)
Arabic (Egypt)

Egyptian Arabic Healthcare Monologue Data

Audio recordings of scripted prompts in Egyptian Arabic for Healthcare domain.

6000+ prompts
60+ people
ASRConversational AI
Scripted sentence recording dataset for conversational AI for Healthcare domain in Arabic (Algeria)
Arabic (Algeria)

Algerian Arabic Healthcare Monologue Data

Audio recordings of scripted prompts in Algerian Arabic for Healthcare domain.

6000+ prompts
60+ people
ASRConversational AI
Healthcare scripted monologue speech data for ASR in Arabic (Saudi Arabia)
Arabic (Saudi Arabia)

Saudi Arabian Arabic Healthcare Monologue

Audio recordings of scripted prompts in Saudi Arabian Arabic for Healthcare domain.

6000+ prompts
60+ people
ASRConversational AI
Icon

Real Estate Scripted Monologue Speech Datasets

Realestate scripted monologue speech data for Machine learning in Arabic (Egypt)
Arabic (Egypt)

Egyptian Arabic Real Estate Scripted Monologue Data

Audio recordings of scripted prompts in Egyptian Arabic for Real Estate domain.

6000+ prompts
60+ people
ASRConversational AI
Scripted sentence recording dataset for conversational AI for Realestate domain in Arabic (Algeria)
Arabic (Algeria)

Algerian Arabic Real Estate Scripted Monologue Data

Audio recordings of scripted prompts in Algerian Arabic for Real Estate domain.

6000+ prompts
60+ people
ASRConversational AI
Realestate scripted monologue speech data for ASR in Arabic (Saudi Arabia)
Arabic (Saudi Arabia)

Saudi Arabian Arabic Real Estate Scripted Monologue

Audio recordings of scripted prompts in Saudi Arabian Arabic for Real Estate domain.

6000+ prompts
60+ people
ASRConversational AI
Icon

Travel Scripted Monologue Speech Datasets

Travel scripted monologue speech data for Machine learning in Arabic (Egypt)
Arabic (Egypt)

Egyptian Arabic Travel Scripted Monologue Data

Audio recordings of scripted prompts in Egyptian Arabic for Travel domain.

6000+ prompts
60+ people
ASRConversational AI
Scripted sentence recording dataset for conversational AI for Travel domain in Arabic (Algeria)
Arabic (Algeria)

Algerian Arabic Travel Scripted Monologue Data

Audio recordings of scripted prompts in Algerian Arabic for Travel domain.

6000+ prompts
60+ people
ASRConversational AI
Travel scripted monologue speech data for ASR in Arabic (Saudi Arabia)
Arabic (Saudi Arabia)

Saudi Arabian Arabic Travel Scripted Monologue Data

Audio recordings of scripted prompts in Saudi Arabian Arabic for Travel domain.

6000+ prompts
60+ people
ASRConversational AI

Enhance your Speech Model’s performance with Multi-lingual Speech datasets!

Contact Usarrow
CTA illustration