Bulgarian Real Estate Conversational Chat Dataset

This dataset features Bulgarian text-based chat conversations between customers and call center agents, specifically focused on Real Estate domain interactions. Covering a wide range of real-world topics, the dataset captures the authentic language, tone, and flow of Bulgarian customer service dialogues. It is ideal for training chatbots, virtual assistants, and NLP models for telecom-focused applications.

Category

Conversational Chat Dataset

Total volume

10K+ chats

Last Updated

July 2025

Number of participants

150 people

Realestate NLP conversational chat dataset in Bulgarian

About This OTS Dataset

Card Head Line

Introduction

The Bulgarian Real Estate Chat Dataset is a high-quality collection of over 10,000 text-based conversations between customers and call center agents. These conversations reflect real-world scenarios within the Real Estate sector, offering rich linguistic data for training conversational AI, chatbots, and NLP systems focused on property-related interactions in Bulgarian-speaking regions.

Participant & Chat Overview

  • Participants: 150+ native Bulgarian speakers from the FutureBeeAI Crowd Community
  • Conversation Length: 300–700 words per chat
  • Turns per Chat: 50–150 dialogue turns across both speakers
  • Chat Types: Inbound and outbound
  • Sentiment Coverage: Positive, neutral, and negative interactions included
  • Topic Diversity

    The dataset spans a broad range of Real Estate service conversations, covering various customer intents and agent support tasks:

  • Inbound Chats (Customer-Initiated)
  • Property inquiries (buy/rent)
  • Rental property availability
  • Renovation and maintenance inquiries
  • Property features and amenities
  • Investment advice and ROI analysis
  • Property ownership and legal history
  • Outbound Chats (Agent-Initiated)
  • New property listing announcements
  • Post-purchase follow-ups
  • Investment opportunity alerts
  • Property valuation updates
  • Customer satisfaction and feedback surveys
  • This topic variety enables realistic model training for both lead generation and post-sale engagement scenarios.

    Language Nuance & Authenticity

    Conversations are reflective of natural Bulgarian used in the Real Estate domain, incorporating:

  • Cultural Naming Patterns: Personal names, agency names, and developer brands
  • Localized Contact Info: Phone numbers, email addresses, and geographic locations across Bulgarian-speaking regions
  • Numeric and Temporal Language: Dates, prices, unit sizes, and time references formatted in Bulgarian conventions
  • Informal and Domain-Specific Language: Real estate slang, idioms, and casual tone used in property discussions
  • This level of linguistic realism supports model generalization across dialects and user demographics.

    Conversational Structure & Flow

    Conversations include a mix of short inquiries and detailed advisory sessions, capturing full customer journeys:

  • Dialogue Types
  • General inquiries
  • Sales consultations
  • Investment advisory
  • Follow-up coordination
  • Complaint handling and support
  • Flow Components
  • Greetings and identity verification
  • Intent identification and context gathering
  • Solution explanation or recommendations
  • Resolution or next steps
  • Closing and optional feedback
  • This structure supports training of AI systems that can handle multi-turn dialogues and dynamic user needs.

    Data Format & Structure

    Available in JSON, CSV, and TXT formats, each record includes:

  • Full dialogue history
  • Participant identifiers
  • Optional metadata such as sentiment, topic, or region tags
  • Format compatible with popular NLP toolkits

    Applications

    This dataset is ideal for a wide range of AI and NLP applications within the Real Estate domain:

  • Real Estate Chatbots & Virtual Assistants
  • Intent Detection and Dialogue Flow Modeling
  • Lead Qualification and Sales Automation
  • NER for Entity Extraction (e.g., location, price, unit type)
  • Text Summarization and Generation
  • Bulgarian NLP Research for Real Estate Vertical
  • Secure & Ethical Collection

  • Consent-Based Participation: All contributors participated with informed consent
  • Privacy-Preserved: No personally identifiable information (PII) is included
  • Secure Platform: All data was handled and stored within FutureBeeAI’s secure data environment
  • Ethical Compliance: Collection and usage aligned with responsible AI and data governance standards
  • Dataset Expansion & Customization

    This dataset is actively maintained and can be extended or customized based on your requirements:

  • Custom Annotations: Named Entity Recognition (NER), sentiment, intent labels, etc.
  • Topic-Specific Collection: e.g., mortgage advisory, vacation rentals, commercial property
  • Region-Specific Language: Country/dialect-focused data collection in Bulgarian
  • Multilingual Options: Data available in other languages on request
  • Licensing

    The dataset is developed and owned by FutureBeeAI and is available for commercial licensing. Flexible terms are available for enterprises, startups, and academic use.

    Use Cases

    Use of conversational chat dataset in Chatbot

    Chatbot

    Use of conversational chat dataset in Text Recognition

    Text Recognition

    Use of conversational chat dataset in Text Analytics

    Text Analysis

    Use of conversational chat dataset in Text Prediction

    Text Prediction

    Use of conversational chat dataset in Smart Assistant

    Smart Assistants

    Dataset Sample(s)

    Card Head Line

    Dataset Details

    Card Head Line

    Dataset type

    Real Estate Conversational Chats

    Volume

    10K+ chats

    Media type

    Text Only

    Language

    Bulgarian

    Topics

    100+

    File Details

    Card Head Line

    Turns per Chat

    50-150

    Word count

    300-700 words

    Format

    TXT, DOCS, JSON or CSV

    Annotation

    On Request

    Need datasets for a specific AI/ML use case?
    Don't worry, we've got you covered! 👍

    Contact Us
    Prompt 2 Bg