English (India) Call Center Speech Dataset for Travel

The audio dataset comprises call center conversations for the Travel domain, featuring native English speakers from India. It includes speech data, detailed metadata and accurate transcriptions.

Category

Unscripted Call Center Conversations

Total Volume

30 Speech Hours

Last updated

Jun 2024

Number of participants

60

English (India) call center audio recording for Travel industry
Download
Download Icon

About this Off-the-shelf Speech Dataset

Card Head Line

Introduction

Welcome to the Indian English Call Center Speech Dataset for the Travel domain designed to enhance the development of call center speech recognition models specifically for the Travel industry. This dataset is meticulously curated to support advanced speech recognition, natural language processing, conversational AI, and generative voice AI algorithms.

Speech Data:

This training dataset comprises 30 Hours of call center audio recordings covering various topics and scenarios related to the Travel domain, designed to build robust and accurate customer service speech technology.

  • Participant Diversity:
  • Speakers: 60 expert native Indian English speakers from the FutureBeeAI Community.
  • Regions: Different states/provinces of India, ensuring a balanced representation of Indian accents, dialects, and demographics.
  • Participant Profile: Participants range from 18 to 70 years old, representing both males and females in a 60:40 ratio, respectively.
  • Recording Details:
  • Conversation Nature: Unscripted and spontaneous conversations between call center agents and customers.
  • Call Duration: Average duration of 5 to 15 minutes per call.
  • Formats: WAV format with stereo channels, a bit depth of 16 bits, and a sample rate of 8 and 16 kHz.
  • Environment: Without background noise and without echo.
  • Topic Diversity

    This dataset offers a diverse range of conversation topics, call types, and outcomes, including both inbound and outbound calls with positive, neutral, and negative outcomes.

  • Inbound Calls:
  • Booking inquiries and assistance
  • Destination information and recommendations
  • Assistance with flight delays or cancellations
  • Special assistance for passengers with disabilities
  • Travel-related health and safety inquiry
  • Assistance with lost or delayed baggage, and many more
  • Outbound Calls:
  • Promotional offers and package deals
  • Customer satisfaction surveys
  • Booking confirmations and updates
  • Flight schedule changes and notifications
  • Customer feedback collection
  • Reminders for passport or visa expiration date, and many more
  • This extensive coverage ensures the dataset includes realistic call center scenarios, which is essential for developing effective customer support speech recognition models.

    Transcription

    To facilitate your workflow, the dataset includes manual verbatim transcriptions of each call center audio file in JSON format. These transcriptions feature:

  • Speaker-wise Segmentation: Time-coded segments for both agents and customers.
  • Non-Speech Labels: Tags and labels for non-speech elements.
  • Word Error Rate: Word error rate is less than 5% thanks to the dual layer of QA.
  • These ready-to-use transcriptions accelerate the development of the Travel domain call center conversational AI and ASR models for the Indian English language.

    Metadata

    The dataset provides comprehensive metadata for each conversation and participant:

  • Participant Metadata: Unique identifier, age, gender, country, state, district, accent and dialect.
  • Conversation Metadata: Domain, topic, call type, outcome/sentiment, bit depth, and sample rate.
  • This metadata is a powerful tool for understanding and characterizing the data, enabling informed decision-making in the development of Indian English call center speech recognition models.

    Usage and Applications

    This dataset can be used for various applications in the fields of speech recognition, natural language processing, and conversational AI, specifically tailored to the Travel domain. Potential use cases include:

  • Speech Recognition Models: Training and fine-tuning speech recognition models for Indian English.
  • Speech Analytics Models: Building speech analytics models to extract insights, identify patterns, and glean valuable information from customer conversation, enables data-driven decision-making and process optimization within the Travel sector.
  • Smart Assistants and Chatbots: Developing conversational agents and virtual assistants for customer service in the Travel industries.
  • Sentiment Analysis: Analyzing customer sentiment and improving customer experience based on call center interactions.
  • Generative AI: Training generative AI models capable of generating human-like responses, summaries, or content tailored to the Travel domain.
  • Secure and Ethical Collection

  • Our proprietary data collection and transcription platform, “Yugo” was used throughout the process of this dataset creation.
  • Throughout the data collection process, the data remained within our secure platform and did not leave our environment, ensuring data security and confidentiality.
  • The data collection process adhered to strict ethical guidelines, ensuring the privacy and consent of all participants.
  • It does not include any personally identifiable information about any participant, which makes the dataset safe to use.
  • The dataset does not contain any copyrighted content.
  • Updates and Customization

    Understanding the importance of diverse environments for robust ASR models, our call center voice dataset is regularly updated with new audio data captured in various real-world conditions.

  • Customization & Custom Collection Options:
  • Environmental Conditions: Custom collection in specific environmental conditions upon request.
  • Sample Rates: Customizable from 8kHz to 48kHz.
  • Transcription Customization: Tailored to specific guidelines and requirements.
  • License

    This Travel domain call center audio dataset is created by FutureBeeAI and is available for commercial use.

    Use Cases

    Use of speech data in Conversational AI

    Call Center Conversational AI

    Use of speech data for Automatic Speech Recognition

    ASR

    Use of speech data for Chatbot & voicebot creation

    Chatbot

    Use of speech data in Language Modeling

    Language Modelling

    Use of speech data in Text-into-speech

    TTS

    Speech data usecase in Speech Analytics

    Speech Analytics

    Dataset Sample(s)

    Card Head Line
    00:00

    ATTRIBUTES

    TRANSCRIPTION

    TIME
    TRANSCRIPT
    1.375 - 2.750
    Hello Futurebee.
    1.975 - 3.575
    Hello Futurebee.
    4.125 - 4.825
    -
    5.525 - 6.349
    Hello ma'am.
    6.724 - 9.698
    Hello good morning. Welcome to <initial>SJ</initial> Holidays.
    10.121 - 10.570
    -
    11.875 - 13.000
    How can I help you?
    13.550 - 16.750
    Ma'am I planning for trip with my husband.
    18.725 - 19.600
    Okay.
    19.167 - 22.568
    So, can you give me some suggestions to plan the trip?
    22.713 - 23.789
    can you guide me?
    25.734 - 31.152
    Yeah, yes of course. But I need some information first. How many days trip you have plan?
    32.371 - 34.298
    I am planning to go for three days.
    35.639 - 39.963
    Three days okay. what type of place you would like to go?
    38.461 - 38.961
    -
    40.310 - 42.536
    [filler]Talking about climate.
    40.981 - 42.082
    -
    42.783 - 44.283
    [filler] Especially.
    45.195 - 47.896
    I need a cool climate not very hot.
    49.009 - 57.134
    Okay, [filler] then I suggest you go to Wayanad. That's a there is a beautiful place in Kerala. It's a chill climate.
    57.426 - 58.426
    [filler]
    58.170 - 59.045
    -
    58.643 - 60.368
    You hear about that place?
    61.639 - 66.614
    Yeah. I have heard and I heard it is near to Bangalore or Kerala. I don't know.
    68.087 - 70.061
    [filler] Yeah. It's a Kerala only.
    70.924 - 72.224
    Okay okay.
    74.141 - 75.242
    [filler]
    74.310 - 82.459
    So, can you say me how to go there? Will you arrange any travels or how your <initial>SJ</initial> Holidays working? how?
    79.968 - 81.394
    [filler]
    83.956 - 92.456
    Yeah, yes ma'am. [filler] We have three packages. [filler] Two days package and four package and one days package.
    88.772 - 89.447
    Okay.
    92.688 - 94.765
    [filler] Which one you want?
    93.623 - 94.224
    [filler]
    96.769 - 98.644
    I prefer four days package.
    99.090 - 106.215
    Four days package. [filler] Then you don't worry. Our agency will help you. Okay? [filler] They will
    105.774 - 106.447
    Okay.
    106.340 - 110.864
    [filler] guide you and they will book the hotels
    111.028 - 114.727
    and what are the tourist places are there in the Wayanad.
    114.944 - 117.545
    They will explain to you.
    118.394 - 123.870
    Okay. Now can you give me some explanation? How that trip will be? What are the sites seen there?
    119.977 - 120.554
    Okay.
    125.519 - 126.920
    [filler] Yeah. Of course.
    127.209 - 127.986
    [filler]
    128.449 - 139.377
    There [filler] more [filler] more site visiting places are there. One is zipline. It's a longest zipline [filler] place in Kerala only.
    140.389 - 143.764
    And then one trekking, trekking place is there.
    140.734 - 141.484
    Okay.
    144.133 - 153.961
    [filler] Another one is water falls, [filler] main [filler] Soochipara water fall is there. It is really very chill place.
    145.080 - 145.532
    Okay.
    149.116 - 149.616
    -
    154.257 - 161.979
    [filler] You, you in that place you should walk for nearly two to three kilometers.
    155.068 - 155.794
    Okay.
    163.532 - 167.556
    Okay. So, I have to walk two to three kilometers to reach the Soochipara falls.
    167.430 - 169.430
    Reach there yes yes yes.
    169.770 - 173.169
    So, there is no vehicles only the way mode is walking?
    172.389 - 180.264
    No no no. Only the way you have you walk only. Because stones and path is very narrow.
    177.133 - 177.610
    Okay.
    180.770 - 184.371
    So, vehicles are not allowed in that place.
    181.627 - 182.430
    Okay.
    184.895 - 189.443
    You should walk. But it's very really it's a good [filler]
    190.387 - 198.263
    [filler] It's perfect place to go with husband. You have to share your memories while walking is a very good
    198.520 - 199.169
    thing.
    199.985 - 201.187
    Yeah yeah yeah.
    201.519 - 206.842
    Okay, then how about the accommodation? You will arrange accommodation or we have to take care of it?
    202.110 - 203.032
    [filler]
    208.508 - 211.008
    No no. We will arrange the accommodations.
    211.245 - 217.645
    Okay. The packages will completely the accommodations and some to theme parks.
    218.669 - 222.247
    Not theme parks, it's a zipline and also the trekking.
    219.090 - 219.590
    Okay.
    223.026 - 223.877
    Okay.
    224.306 - 225.959
    We will arrange. Okay?
    226.120 - 230.645
    Only zipline and trekking you will arrange? Or some other site seen also?
    230.413 - 230.985
    Yeah.
    231.627 - 235.502
    Some others, if you want some other places we will arrange that also.
    236.520 - 239.645
    How many site seen is possible to cover in four days?
    240.663 - 244.937
    [filler] Four days. In one day you will cover three places.
    245.209 - 246.109
    [filler]
    245.728 - 246.824
    -
    247.294 - 256.096
    Yeah. [filler] One day we will cover the zipline, water falls and trekking. The next day you have to plan other [filler]
    252.723 - 253.775
    [filler]
    256.213 - 258.814
    places in Wayanad. Boating
    257.846 - 258.495
    -
    259.305 - 259.754
    Okay.
    260.115 - 263.591
    Okay. Boating and another dam
    260.480 - 261.105
    [filler]
    264.665 - 265.740
    (())
    265.103 - 265.978
    Okay.
    265.908 - 267.759
    That places not there in Wayanad
    268.274 - 271.074
    So, you don't worry about this. We will take care of it.
    272.584 - 275.560
    Yeah sure sure. Then how about the foods involve?
    276.744 - 280.793
    Foods involve maximum [filler] the food is
    281.156 - 286.483
    in Kerala style only. [filler] They will give food to
    286.737 - 288.211
    some (())
    287.901 - 288.576
    [filler]
    288.708 - 291.608
    like that they, they will give.
    288.995 - 290.021
    Okay okay.
    292.721 - 294.596
    That is also provided by
    296.338 - 297.939
    (()) or by
    297.083 - 299.283
    Sorry ma'am. I can't able to hear.
    301.826 - 303.153
    ourself?
    303.324 - 303.848
    hello?
    305.125 - 308.526
    Foods are provided by your side or we have to take care of it?
    306.165 - 307.014
    [filler] Yes ma'am.
    310.838 - 313.012
    No no. Food also
    311.651 - 313.026
    Okay ma'am. Okay okay.
    313.218 - 327.245
    Food also come with that packages only. [filler] You will pay first the packages fully. If the package is two thousand means that accommodation and food is including only.
    327.704 - 329.579
    So, you don't pay the extra.
    328.146 - 329.596
    Okay ma'am. How much
    330.377 - 331.927
    How much is the package?
    331.245 - 332.069
    [filler]
    332.983 - 341.533
    [filler] In a two days package means it's a fifteen thousand. [filler] Four days package means it will be twenty five thousand
    341.920 - 342.470
    nearly.
    342.165 - 342.740
    Okay.
    342.901 - 347.026
    Okay okay okay. Then how I have to book it?
    348.920 - 351.120
    [filler] You go to our website ma'am.
    351.266 - 360.540
    (()) website and choose your package and what are the things you have [filler] you need means you choose that package and
    351.372 - 352.197
    Okay.
    360.810 - 364.360
    [filler] select whatever you want.
    365.447 - 376.773
    Okay ma'am. Then in accommodation and all we can choose by ourself by seeing the hotels will that will you give any photos for choosing the hotels or randomly you will give?
    373.141 - 373.891
    -
    375.918 - 377.442
    Yes yes yes ma'am.
    378.028 - 390.230
    No no. In that website we upload more accommodation photos, more hotels more resorts images of that. You have to choose and in (())
    390.781 - 394.432
    the packages hotel rents
    394.696 - 395.721
    is
    396.355 - 398.129
    there in that image only.
    399.485 - 404.305
    Okay okay. I got your point ma'am. And one more thing I want to ask is
    402.543 - 403.043
    Okay.
    404.576 - 409.326
    if I am coming with my husband alone that is two people means you are saying twenty five thousand.
    409.521 - 414.721
    If I am getting [filler] my kids also two more means how the package is the same?
    414.915 - 416.867
    Or it will increase?
    418.766 - 424.391
    It little bit increase. Just thirty thousand around four members means thirty around.
    425.103 - 429.177
    Okay okay ma'am. And before how many days I have to book in your
    427.646 - 428.271
    Okay.
    429.374 - 430.124
    website?
    432.470 - 433.694
    [filler] How many?
    433.487 - 437.362
    Before how many days of this trip I have to book in your website?
    439.629 - 442.555
    For example if I (()) going next week means
    441.752 - 442.475
    [filler]
    442.886 - 447.911
    I can book one day before the trip or I have to book one week before?
    444.980 - 445.430
    (())
    447.122 - 447.771
    No no no.
    448.516 - 451.218
    No no no. You have to book at least ten days before.
    451.468 - 457.543
    Okay. If I'm cancelling in any case will the amount will refund completely or partially?
    454.338 - 454.737
    [filler]
    455.136 - 455.610
    (())
    457.610 - 458.286
    No no no.
    458.709 - 464.459
    Not completely. We five percentage we will take that amount. If you
    465.774 - 467.199
    cancel the trip.
    467.523 - 469.523
    Okay. If I am changing the dates means
    470.595 - 477.920
    Yeah. Changing the date means it's not a problem. [filler] We are not cancelling. You are not charging [filler]
    476.319 - 477.194
    (())
    477.766 - 485.180
    Okay okay ma'am. [filler] And also will you arrange the cab for me to [filler] get out from a home
    485.925 - 487.449
    to the flight or train?
    487.718 - 488.418
    [filler]
    488.620 - 490.146
    For the start of journey no?
    488.843 - 491.451
    No ma'am. No. That's no no.
    491.495 - 492.596
    [noise] Okay.
    493.232 - 498.982
    [filler] And also from I am from south side from here to Kerala also you will book?
    499.528 - 501.528
    local trip only you won't book, right?
    502.968 - 504.319
    Yes ma'am yes.
    504.204 - 506.305
    Okay okay. I got your point ma'am.
    506.617 - 508.817
    And after finishing the trip
    510.384 - 511.685
    the same procedure?
    510.485 - 511.084
    -
    511.997 - 514.072
    You will drop where I depart.
    513.182 - 514.081
    Yes.
    514.615 - 515.442
    Okay.
    515.846 - 516.649
    [filler] Yes ma'am.
    516.375 - 520.926
    Okay ma'am. Thank you ma'am. I will ask my husband and I will update you
    517.062 - 517.687
    [filler]
    520.326 - 521.000
    Thank you.
    521.149 - 523.576
    by tomorrow or day after tomorrow ma'am.
    523.600 - 523.875
    -
    525.975 - 527.524
    Okay ma'am okay. Thank you.
    526.475 - 527.600
    Thank you.
    529.024 - 529.801
    Thank you.

    Dataset Details

    Card Head Line

    Language

    English

    Language code

    en-In

    Country

    India

    Accents

    Chandigarh, Chhattisgarh ...more

    Gender Distribution

    M:60, F:40

    Age Group

    18-70

    File Details

    Card Head Line

    Environment

    Silent, Noisy

    Bit Depth

    16 bit

    Format

    wav

    Sample rate

    8khz & 16khz

    Channel

    Stereo

    Audio file duration

    5-15 minutes

    Need datasets for a specific AI/ML use case?
    Don't worry, we've got you covered! 👍

    Contact Us
    Prompt 2 Bg