Egyptian Arabic Scripted Monologue Speech Dataset for Retail & E-commerce Domain

Scripted monologue speech dataset in Egyptian Arabic for the Retail & E-commerce sector. Includes clean audio recordings, accurate transcriptions, and metadata for use in ASR and conversational AI development.

Category

Scripted Prompt Recordings

Total Volume

6000+ prompts

Last updated

June 2025

Number of participants

60+

Retail & E-commerce scripted monologue speech data for Machine learning in Arabic (Egypt)
Download
Download Icon

About this Off-the-shelf Speech Dataset

Card Head Line

Introduction

Welcome to the Egyptian Arabic Scripted Monologue Speech Dataset for the Retail & E-commerce domain. This dataset is built to accelerate the development of Arabic language speech technologies especially for use in retail-focused automatic speech recognition (ASR), natural language processing (NLP), voicebots, and conversational AI applications.

Speech Data

This training dataset includes 6,000+ high-quality scripted audio recordings in Egyptian Arabic, created to reflect real-world scenarios in the Retail & E-commerce sector. These prompts are tailored to improve the accuracy and robustness of customer-facing speech technologies.

  • Participant Diversity
  • Speakers: 60 native Arabic speakers from across Egypt
  • Geographic Coverage: Multiple Egypt regions to ensure dialect and accent diversity
  • Demographics: Participants aged 18 to 70, with a 60:40 male-to-female distribution
  • Recording Details
  • Nature of Recording: Scripted monologue-style speech prompts
  • Duration: Each recording spans 5 to 30 seconds
  • Audio Format: WAV format, mono channel, 16-bit depth, and 8kHz / 16kHz sample rates
  • Environment: Recorded in quiet conditions, free from background noise and echo
  • Topic Diversity

    This dataset includes a comprehensive set of retail-specific topics to ensure wide linguistic coverage for AI training:

  • Customer Service Interactions
  • Order Placement and Payment Processes
  • Product and Service Inquiries
  • Technical Support Queries
  • General Information and Guidance
  • Promotional and Sales Announcements
  • Domain-Specific Service Statements
  • Contextual Enrichment

    To increase training utility, prompts include contextual data such as:

  • Region-Specific Names: Common Egypt male and female names in diverse formats
  • Addresses: Localized address variations spoken naturally
  • Dates & Times: Realistic phrasing in delivery, promotions, and return policies
  • Product References: Real-world product names, brands, and categories
  • Numerical Data: Spoken numbers and prices used in transactions and offers
  • Order IDs & Tracking Numbers: Common references in customer service calls
  • These additions help your models learn to recognize structured and unstructured retail-related speech.

    Transcription

    Every audio file is paired with a verbatim transcription, ensuring consistency and alignment for model training.

  • Content: Exact scripted prompts as spoken by the participant
  • Format: Provided in plain text (.TXT) format with filenames matching the associated audio
  • Quality Assurance: All transcripts are verified for accuracy by native Arabic transcribers
  • Metadata

    Detailed metadata is included to support filtering, analysis, and model evaluation:

  • Participant Metadata: Unique speaker ID, age, gender, region (country, state), and dialect
  • Recording Metadata: Transcript, recording environment, device used, bit depth, sample rate, and file format
  • Usage & Applications

    This dataset supports a wide range of use cases within AI and speech technology development:

  • Speech Recognition Training: Fine-tune Arabic ASR models
  • Voice Synthesis & TTS: Generate synthetic voices based on real Egyptian Arabic samples
  • Retail Voice Assistants: Build voice-first shopping and support experiences
  • Chatbot Development: Train NLU engines for product and service inquiries
  • Named Entity Recognition (NER): Extract names, dates, prices, and order details
  • Language Understanding: Enhance sentiment analysis and topic modeling for retail interactions
  • Secure & Ethical Collection

    All data was collected through FutureBeeAI’s proprietary and secure Yugo platform.

  • Data never left the secure environment
  • Ethical collection standards followed with full participant consent
  • No personally identifiable information (PII) is included
  • Fully compliant and safe for commercial and academic use
  • License

    This Egyptian Arabic Retail & E-commerce Scripted Monologue Speech Dataset is created by FutureBeeAI and is available for commercial use.

    Use Cases

    Use of scripted speech monologues datasets for Automatic Speech Recognition

    ASR

    Use of scripted speech monologues datasets for Conversational AI

    Conversational AI

    Use of scripted speech monologues datasets for Chatbot

    Chatbot

    Use of scripted speech monologues datasets for TTS

    TTS

    Use of scripted speech monologues datasets for Speech analytics

    Speech Analytics

    Use of scripted speech monologues datasets for Mobile speech

    Mobile Speech

    Dataset Sample(s)

    Card Head Line

    TRANSCRIPTION

    SPEAKERDURATIONTRANSCRIPT

    Dataset Details

    Card Head Line

    Language

    Arabic

    Language code

    ar-eg

    Country

    Egypt

    Accents

    Damietta, Al Sharqia ...more

    Gender Distribution

    M:60, F:40

    Age Group

    18-70 Years

    File Details

    Card Head Line

    Environment

    Silent

    Bit Depth

    16 bit

    Sample rate

    8KHz & 16KHz

    Channel

    Mono

    Audio file duration

    5 to 30 seconds

    Need datasets for a specific AI/ML use case?
    Don't worry, we've got you covered! 👍

    Contact Us
    Prompt 2 Bg