English (India) Call Center Speech Dataset for Real Estate

The audio dataset comprises call center conversations for the Real Estate domain, featuring native English speakers from India. It includes speech data, detailed metadata and accurate transcriptions.

Category

Unscripted Call Center Conversations

Total Volume

30 Speech Hours

Last updated

Jun 2024

Number of participants

60

English (India) call center audio recording for Realestate industry
Download
Download Icon

About this Off-the-shelf Speech Dataset

Card Head Line

Introduction

Welcome to the Indian English Call Center Speech Dataset for the Real Estate domain designed to enhance the development of call center speech recognition models specifically for the Real Estate industry. This dataset is meticulously curated to support advanced speech recognition, natural language processing, conversational AI, and generative voice AI algorithms.

Speech Data:

This training dataset comprises 30 Hours of call center audio recordings covering various topics and scenarios related to the Real Estate domain, designed to build robust and accurate customer service speech technology.

  • Participant Diversity:
  • Speakers: 60 expert native Indian English speakers from the FutureBeeAI Community.
  • Regions: Different states/provinces of India, ensuring a balanced representation of Indian accents, dialects, and demographics.
  • Participant Profile: Participants range from 18 to 70 years old, representing both males and females in a 60:40 ratio, respectively.
  • Recording Details:
  • Conversation Nature: Unscripted and spontaneous conversations between call center agents and customers.
  • Call Duration: Average duration of 5 to 15 minutes per call.
  • Formats: WAV format with stereo channels, a bit depth of 16 bits, and a sample rate of 8 and 16 kHz.
  • Environment: Without background noise and without echo.
  • Topic Diversity

    This dataset offers a diverse range of conversation topics, call types, and outcomes, including both inbound and outbound calls with positive, neutral, and negative outcomes.

  • Inbound Calls:
  • Property Inquiry
  • Rental Property Search & Availability
  • Renovation Inquiries
  • Property Features & Amenities Inquiry
  • Investment Property Analysis & Advice
  • Property History & Ownership Details, and many more
  • Outbound Calls:
  • New Property Listing Update
  • Post Purchase Follow-ups
  • Investment Opportunities & Property Recommendations
  • Property Value Updates
  • Customer Satisfaction Surveys, and many more
  • This extensive coverage ensures the dataset includes realistic call center scenarios, which is essential for developing effective customer support speech recognition models.

    Transcription

    To facilitate your workflow, the dataset includes manual verbatim transcriptions of each call center audio file in JSON format. These transcriptions feature:

  • Speaker-wise Segmentation: Time-coded segments for both agents and customers.
  • Non-Speech Labels: Tags and labels for non-speech elements.
  • Word Error Rate: Word error rate is less than 5% thanks to the dual layer of QA.
  • These ready-to-use transcriptions accelerate the development of the Real Estate domain call center conversational AI and ASR models for the Indian English language.

    Metadata

    The dataset provides comprehensive metadata for each conversation and participant:

  • Participant Metadata: Unique identifier, age, gender, country, state, district, accent and dialect.
  • Conversation Metadata: Domain, topic, call type, outcome/sentiment, bit depth, and sample rate.
  • This metadata is a powerful tool for understanding and characterizing the data, enabling informed decision-making in the development of Indian English call center speech recognition models.

    Usage and Applications

    This dataset can be used for various applications in the fields of speech recognition, natural language processing, and conversational AI, specifically tailored to the Real Estate domain. Potential use cases include:

  • Speech Recognition Models: Training and fine-tuning speech recognition models for Indian English.
  • Speech Analytics Models: Building speech analytics models to extract insights, identify patterns, and glean valuable information from customer conversation, enables data-driven decision-making and process optimization within the Real Estate sector.
  • Smart Assistants and Chatbots: Developing conversational agents and virtual assistants for customer service in the Real Estate industries.
  • Sentiment Analysis: Analyzing customer sentiment and improving customer experience based on call center interactions.
  • Generative AI: Training generative AI models capable of generating human-like responses, summaries, or content tailored to the Real Estate domain.
  • Secure and Ethical Collection

  • Our proprietary data collection and transcription platform, “Yugo” was used throughout the process of this dataset creation.
  • Throughout the data collection process, the data remained within our secure platform and did not leave our environment, ensuring data security and confidentiality.
  • The data collection process adhered to strict ethical guidelines, ensuring the privacy and consent of all participants.
  • It does not include any personally identifiable information about any participant, which makes the dataset safe to use.
  • The dataset does not contain any copyrighted content.
  • Updates and Customization

    Understanding the importance of diverse environments for robust ASR models, our call center voice dataset is regularly updated with new audio data captured in various real-world conditions.

  • Customization & Custom Collection Options:
  • Environmental Conditions: Custom collection in specific environmental conditions upon request.
  • Sample Rates: Customizable from 8kHz to 48kHz.
  • Transcription Customization: Tailored to specific guidelines and requirements.
  • License

    This Real Estate domain call center audio dataset is created by FutureBeeAI and is available for commercial use.

    Use Cases

    Use of speech data in Conversational AI

    Call Center Conversational AI

    Use of speech data for Automatic Speech Recognition

    ASR

    Use of speech data for Chatbot & voicebot creation

    Chatbot

    Use of speech data in Language Modeling

    Language Modelling

    Use of speech data in Text-into-speech

    TTS

    Speech data usecase in Speech Analytics

    Speech Analytics

    Dataset Sample(s)

    Card Head Line
    00:00

    ATTRIBUTES

    TRANSCRIPTION

    TIME
    TRANSCRIPT
    1.769 - 2.655
    Hello Futurebee
    3.653 - 4.740
    Hello Futurebee
    5.855 - 6.689
    Good morning ma'am
    6.065 - 6.727
    -
    7.615 - 7.767
    -
    8.275 - 9.064
    Good morning
    9.502 - 11.487
    Welcome to <initial>JPS</initial> property maker.
    13.993 - 15.019
    Yes ma'am
    15.727 - 17.094
    How can I help you ma'am ?
    18.143 - 22.734
    [filler] ma'am I want I want I want to buy one land.
    24.143 - 27.085
    Okay ma'am we are getting around tutukadi
    26.925 - 27.350
    -
    28.568 - 30.277
    Yes ma'am I am tutukadi
    30.713 - 35.003
    Okay ma'am in tutukadi which area you prefer to get your land ?
    36.179 - 43.722
    In tutukadi American hospital nearby American hospital one empty land I saw one empty land is there
    44.323 - 49.587
    [filler] Can you say how much amount it will cost will be?
    45.371 - 46.048
    -
    50.911 - 54.344
    So you are expecting like a plot or house?
    52.923 - 54.734
    -
    55.261 - 55.493
    -
    55.734 - 59.314
    [filler] I am expecting to make a home
    56.996 - 58.121
    -
    59.978 - 61.084
    our own house .
    61.310 - 66.265
    [noise] Okay ma'am then you are going to buy an empty land your going to build by yourself
    62.609 - 63.634
    I want to build
    67.397 - 68.072
    Yes ma'am
    68.227 - 73.802
    Okay ma'am then empty land in the sense one plot costs around five point five lakhs ma'am
    72.319 - 72.915
    -
    73.867 - 74.177
    -
    74.959 - 75.039
    -
    75.772 - 80.825
    one plot [filler] One plot means how many cent it will be ?
    77.361 - 77.727
    Yes
    81.870 - 83.430
    Two point five cents ma'am
    82.733 - 83.474
    -
    83.870 - 84.837
    Two point five cents
    85.265 - 88.284
    Two point five cents five lakhs rupees
    85.873 - 86.515
    Yes
    88.936 - 90.566
    Yeah five point five lakhs ma'am
    90.784 - 93.040
    Five point five lakhs okay ma'am okay
    93.659 - 96.010
    [filler] Okay what are the
    95.066 - 96.117
    If you need
    97.168 - 98.061
    Yeah tell me ma'am
    98.831 - 111.001
    If you need to construct even we can construct and give you as plot itself from completely full furnished plot also we are giving and a semi furnished plot also we are giving and empty plot also we are giving ma'am.
    112.299 - 117.417
    [filler] Okay okay ma'am then you I you you will make that plot ma'am
    118.444 - 119.739
    For us
    118.459 - 123.180
    Okay ma'am then what is your requirement how many <initial>BHK</initial> you need ?
    124.796 - 131.355
    okay In first floor [filler] I need one car parking
    131.894 - 137.798
    [filler] area nearby car parking area [filler] Plantation trees
    138.449 - 142.180
    That and all I need I want to create that area
    142.770 - 150.889
    In first floor okay ground ground area then next in first floor one kitchen in first floor
    144.139 - 144.943
    Okay ma'am
    146.431 - 146.907
    Okay
    148.127 - 148.735
    Okay
    151.205 - 155.919
    One bedroom with attached bathroom and a hall, balcony
    156.526 - 160.513
    With balcony I need a one first floor
    162.181 - 167.048
    Okay ma'am then you are in the ground floor you are asking only for planting and car parking
    162.242 - 162.721
    okay
    167.358 - 170.389
    Yes ma'am only car parking and planting
    171.050 - 172.074
    no
    171.086 - 175.032
    In the first floor only you need kitchen and hall and one bedroom
    174.358 - 175.098
    -
    175.502 - 186.788
    One bedroom and also in above the second floor the same thing[filler] a bedroom double two bedroom and one kitchen and one[filler] balcony.
    187.276 - 193.252
    In bedroom I need balcony attached with our bedroom
    195.072 - 199.276
    Okay okay ma'am in the first floor also you need same like that balcony attached with bedroom
    199.471 - 200.792
    Yes yes ofcourse
    200.979 - 204.913
    Okay ma'am in first floor and second floor you need a balcony attached with bedroom
    206.877 - 208.818
    Yeah in the side
    207.479 - 208.739
    You need dining area?
    209.805 - 213.758
    Dining area is along with that kitchen
    214.959 - 221.585
    Okay ma'am I have an option ma'am you need the dining area like kitchen come dining or hall come dining
    223.080 - 237.669
    no no kitchen come dining hall separate oh I forget to add hall away kitchen with dining hall then hall that TV and all I see
    229.235 - 229.770
    Okay
    235.675 - 236.419
    Okay
    238.876 - 252.912
    Okay ma'am as per your requirement we can have some designs so we can share you in your whatsapp so that you can choose the like plans we can choose the plans first you can come
    245.770 - 246.441
    Okay ma'am
    250.887 - 251.311
    Oh
    254.359 - 255.407
    Okay ma'am
    255.151 - 260.161
    So that we can make documentation for getting that land
    262.043 - 262.858
    Okay ma'am
    263.119 - 269.639
    So after that we can proceed with the building construction because after getting the documentation only we can move for construction
    270.951 - 275.052
    Yeah ofcourse ma'am after finishing only
    271.980 - 272.973
    So
    275.653 - 288.117
    Yeah sure ma'am for the empty plot for that two point five cents plot you have to pay five point five lakhs and after the completion of documentation process we can start the construction for the construction
    288.579 - 293.305
    First square feet we offer like two thousand four hundred we are charging
    295.271 - 296.127
    For what ma'am
    296.730 - 299.834
    For construction we are charging yes
    298.338 - 306.781
    Okay but another I have one doubt that plot two point five cents is five point five lakhs
    307.129 - 311.841
    If I register that land means it will come more no
    307.992 - 308.612
    Yes
    313.357 - 314.968
    Yes it will come more
    313.572 - 317.377
    That how much I will pay
    317.973 - 321.670
    For documentation process it will come around eighty thousand ma'am
    322.153 - 325.605
    Eighty thousand okay for two point five cent land
    326.370 - 327.309
    Yes ma'am
    328.023 - 332.324
    oaky okay ma'am then after that only we construction building
    333.807 - 334.396
    yes
    334.490 - 337.329
    okay ma'am okay okay ma'am
    335.944 - 350.257
    so building construction cost is like for square feet two thousand four hundred that is like only for the budling not for the interior in that for interior we will be cost charging sperate amount and also for wood work will be charging sperate amount
    339.814 - 341.264
    two thousand four hundred
    343.350 - 344.951
    -
    351.932 - 358.343
    okay ma'am okay (()) inside the house or you put outside
    359.221 - 363.129
    That is as per your require ma'am you need inside or outside
    364.031 - 366.531
    I wish to put inside
    367.396 - 379.608
    Okay ma'am sure ma'am we can give you a construction as per your dream ma'am so it is completely customisable in the two point five cents we can put some plans like four to five plans I will share you
    381.060 - 381.709
    Okay ma'am
    381.314 - 392.079
    So you can check which plan is convenient for you you choose
    393.271 - 395.406
    Okay ma'am sure thank you
    395.701 - 399.951
    And also like you need to do inter
    401.384 - 404.971
    Yeah ofcourse I will do interior also
    402.384 - 402.990
    -
    405.747 - 419.471
    Okay ma'am then for interior we have many things
    419.826 - 426.596
    Not using the completely wood but the wood texture we can bring
    428.259 - 437.122
    yeah I like wood construction only ma'am but here wood construction is not possible for because of our
    429.319 - 429.910
    -
    434.367 - 435.262
    -
    437.151 - 438.983
    Climate
    438.105 - 439.146
    Climate yes
    439.466 - 452.665
    Yes ma'am but we can bring it in some other way so I will give you in that way same like wood we have like miniature theme miniature theme in the sense some people will not like too many clumsy items and all it will be very simple.
    452.935 - 460.389
    So miniature theme also we will give you and also rustik feel , rustic feel in the sense it will look like old a traditional.
    461.605 - 466.038
    I like traditional
    462.076 - 463.204
    Somewhat like that
    464.884 - 466.175
    Okay
    466.778 - 476.045
    Okay ma'am sure ma'am in the themes I will give you themes like around ten to twelve themes we will give you a based on the themes your selecting we will put the interior ma'am.
    477.185 - 481.148
    Can you share that teams and plans to my WhatsApp number ?
    482.016 - 486.115
    Yeah , sure ma'am I will initially I will give you the details regarding the plot
    486.923 - 494.932
    After that I will give you the plan models interior models and indoor Plantation models also ma'am.
    487.086 - 487.884
    Okay
    495.564 - 497.653
    And the themes also okay
    497.470 - 498.149
    Okay ma'am
    499.086 - 503.723
    So completely you can give everything to us and you can feel free to connect this ma'am
    504.822 - 507.127
    Sure ma'am sure sure I will connect okay
    507.317 - 518.240
    In your busy schedule you cannot took over each and everything so we are here to take care of all the things ma'am you can say whatever you need so that we will fulfill everything ma'am.
    519.879 - 521.908
    Okay ma'am okay thank you
    522.224 - 524.043
    Okay thank you for choosing us ma'am.
    525.259 - 528.158
    Thank you ma'am thank you thank you so much

    Dataset Details

    Card Head Line

    Language

    English

    Language code

    en-In

    Country

    India

    Accents

    Chandigarh, Chhattisgarh ...more

    Gender Distribution

    M:60, F:40

    Age Group

    18-70

    File Details

    Card Head Line

    Environment

    Silent, Noisy

    Bit Depth

    16 bit

    Format

    wav

    Sample rate

    8khz & 16khz

    Channel

    Stereo

    Audio file duration

    5-15 minutes

    Need datasets for a specific AI/ML use case?
    Don't worry, we've got you covered! 👍

    Contact Us
    Prompt 2 Bg