Introduction
This Spanish Call Center Speech Dataset for the Telecom industry is purpose-built to accelerate the development of speech recognition, spoken language understanding, and conversational AI systems tailored for Spanish-speaking telecom customers. Featuring over 30 hours of real-world, unscripted audio, it delivers authentic customer-agent interactions across key telecom support scenarios to help train robust ASR models.
Curated by FutureBeeAI, this dataset empowers voice AI engineers, telecom automation teams, and NLP researchers to build high-accuracy, production-ready models for telecom-specific use cases.
Speech Data
The dataset contains 30 hours of dual-channel call center recordings between native Spanish speakers. Captured in realistic customer support settings, these conversations span a wide range of telecom topics from network complaints to billing issues, offering a strong foundation for training and evaluating telecom voice AI solutions.
        •
        
        Speakers:
         60 native Spanish speakers from our verified contributor pool.
        
         
        •
        
        Regions:
         Representing multiple provinces across Spain to ensure coverage of various accents and dialects.
        
         
        •
        
        Participant Profile:
         Balanced gender mix (60% male, 40% female) with age distribution from 18 to 70 years.
        
         
        •
        
        Conversation Nature:
         Naturally flowing, unscripted interactions between agents and customers.
        
         
        •
        
        Call Duration:
         Ranges from 5 to 15 minutes.
        
         
        •
        
        Audio Format:
         Stereo WAV files, 16-bit depth, at 8kHz and 16kHz sample rates.
        
         
        •
        
        Recording Environment:
         Captured in clean conditions with no echo or background noise.
        
         Topic Diversity
This speech corpus includes both inbound and outbound calls with varied conversational outcomes like positive, negative, and neutral ensuring broad scenario coverage for telecom AI development.
•Network Connectivity Issues
•International Roaming Enquiry
•Refund Requests and Billing Adjustments
•Emergency Service Access, and others
•Welcome Calls & Onboarding
•Customer Satisfaction Surveys
•Network Complaint Status Calls, and more
This variety helps train telecom-specific models to manage real-world customer interactions and understand context-specific voice patterns.
Transcription
All audio files are accompanied by manually curated, time-coded verbatim transcriptions in JSON format.
•Speaker-Segmented Dialogues
•Non-speech Tags (e.g., pauses, coughs)
•High transcription accuracy with word error rate < 5% thanks to dual-layered quality checks.
These transcriptions are production-ready, allowing for faster development of ASR and conversational AI systems in the Telecom domain.
Metadata
Rich metadata is available for each participant and conversation:
            •
            
            Participant Metadata:
             ID, age, gender, accent, dialect, and location.
            
             
            •
            
            Conversation Metadata:
             Topic, sentiment, call type, sample rate, and technical specs.
            
             This metadata supports fine-grained analysis, dialect-specific tuning, and precise dataset segmentation.
Usage and Applications
This dataset is ideal for a range of telecom AI and NLP applications:
            •
            
            Automatic Speech Recognition (ASR):
             Fine-tune Spanish speech-to-text systems for telecom interactions.
            
             
            •
            
            Speech Analytics:
             Identify user pain points and improve telecom service delivery.
            
             
            •
            
            Voice Assistants & Chatbots:
             Build telecom virtual assistants for customer self-service.
            
             
            •
            
            Sentiment Analysis:
             Detect customer frustration or satisfaction in support calls.
            
             
            •
            
            Generative AI:
             Train telecom-specific summarization and response generation models.
            
             Secure and Ethical Collection
•All data was collected using “Yugo,” FutureBeeAI’s proprietary platform under strict ethical and security standards.
•No personally identifiable information is included.
• The Dataset complies with global data privacy guidelines and is copyright-free.
Updates and Customization
We regularly expand this dataset with new telecom voice data and support full customization:
        •
        
        Acoustic Environment:
         Silent or noisy upon request.
        
         
        •
        
        Sample Rate:
         Customizable from 8kHz to 48kHz.
        
         
        •
        
        Transcription Format:
         Can follow your QA and formatting requirements.
        
         License
This Telecom domain dataset is commercially licensed and ready for integration into Spanish ASR, NLP, and voice AI solutions.