Urdu Scripted Monologue Speech Dataset for Real Estate Domain

The audio dataset comprises scripted monologue speech data in the Real Estate domain, featuring native Urdu speakers from Pakistan. It includes speech data, detailed metadata, and accurate transcriptions.

About this Off-the-shelf Speech Dataset

Introduction

Introducing the Urdu Scripted Monologue Speech Dataset for the Real Estate Domain, a dataset designed to support the development of Urdu speech recognition and conversational AI technologies tailored for the real estate industry.

Speech Data

This dataset includes over 6,000 high-quality scripted prompt recordings in Urdu. The speech content reflects a wide range of real estate interactions to help build intelligent, domain-specific customer support systems and speech-enabled tools.

•Participant Diversity

•

Speakers: 60 native Urdu speakers from across Pakistan

•

Regional Variation: Balanced representation of regional dialects and speaking styles

•

Demographics: Ages 18–70, with a 60:40 male-to-female ratio

•Recording Specifications

•

Type: Scripted monologue recordings

•

Duration: 5–30 seconds per audio clip

•

Audio Format: WAV, mono channel, 16-bit, sampled at 8 kHz and 16 kHz

•

Recording Environment: Quiet, echo-free settings with no background noise

Topic and Scenario Coverage

This dataset captures a broad spectrum of use cases and conversational themes within the real estate sector, such as:

•Property inquiries and viewing appointments

•Price negotiations and financial discussions

•Contractual and legal clarifications

•Relocation coordination and service support

•Real estate agent interactions

•Regulatory information and buyer/seller advisory

•Domain-specific spoken statements and service dialogues

Contextual Depth

Each scripted prompt incorporates key elements to simulate realistic real estate conversations:

•

Names: Culturally appropriate Pakistan names in various spoken formats

•

Addresses: Detailed location references, including cities, districts, and street names

•

Dates & Times: Contextual references to appointments, contract timelines, or move-in dates

•

Property Descriptions: Features, measurements, and amenities of real estate listings

•

Financial Details: Prices, rental amounts, down payments, deposits, and loan-related figures

•

Legal Terms: Frequently used terms in property contracts and documentation

Transcription

To ensure precision in model training, each audio recording is paired with a verbatim text transcription:

•

Content: Exact scripted text for each corresponding audio prompt

•

Format: Plain text (.TXT) files named to match their associated audio recordings

•

Quality Control: All transcriptions are manually reviewed by native Urdu linguists for consistency and correctness

Metadata

Each data sample is enriched with detailed metadata to enhance usability:

•

Participant Metadata: Speaker ID, age, gender, region, dialect

•

Audio Metadata: Prompt transcript, recording conditions. device used, sample rate, bit depth, and file format

This metadata provides critical context for domain adaptation, performance analysis, and model fine-tuning.

Usage and Applications

This dataset is highly adaptable for a range of speech AI and NLP use cases in the real estate domain:

•

ASR Model Training: Build robust Urdu speech recognition systems for real estate services

•

TTS & Voice Synthesis: Create synthetic voices for virtual property agents

•

Voice Assistants: Train voice-first real estate bots and assistants

•

Chatbots & Virtual Agents: Enhance customer experience with intelligent dialogue models

•

NER and Intent Recognition: Extract property details, names, numbers, and transactional entities

•

Sentiment & Topic Analysis: Analyze customer sentiment and common concerns in real estate conversations

Secure & Ethical Collection

All data was collected using FutureBeeAI’s secure and proprietary platform, Yugo

•The process followed strict ethical and privacy guidelines, with full participant consent

•No personally identifiable information (PII) is present in the dataset, ensuring full compliance and safe usage

License

This dataset is created and distributed by FutureBeeAI and is available for commercial use, empowering organizations to build high-performance voice and language solutions for the real estate sector.