English (UK) Call Center Speech Dataset for Telecom

The audio dataset comprises call center conversations for the Telecom domain, featuring native English speakers from UK. It includes speech data, detailed metadata and accurate transcriptions.

Category

Unscripted Call Center Conversations

Total Volume

30 Speech Hours

Last updated

Jun 2024

Number of participants

60

English (UK) call center audio recording for Telecom industry
Download
Download Icon

About this Off-the-shelf Speech Dataset

Card Head Line

Introduction

Welcome to the UK English Call Center Speech Dataset for the Telecom domain designed to enhance the development of call center speech recognition models specifically for the Telecom industry. This dataset is meticulously curated to support advanced speech recognition, natural language processing, conversational AI, and generative voice AI algorithms.

Speech Data

This training dataset comprises 30 Hours of call center audio recordings covering various topics and scenarios related to the Telecom domain, designed to build robust and accurate customer service speech technology.

  • Participant Diversity:
  • Speakers: 60 expert native UK English speakers from the FutureBeeAI Community.
  • Regions: Different regions of United Kingdom, ensuring a balanced representation of UK accents, dialects, and demographics.
  • Participant Profile: Participants range from 18 to 70 years old, representing both males and females in a 60:40 ratio, respectively.
  • Recording Details:
  • Conversation Nature: Unscripted and spontaneous conversations between call center agents and customers.
  • Call Duration: Average duration of 5 to 15 minutes per call.
  • Formats: WAV format with stereo channels, a bit depth of 16 bits, and a sample rate of 8 and 16 kHz.
  • Environment: Without background noise and without echo.
  • Topic Diversity

    This dataset offers a diverse range of conversation topics, call types, and outcomes, including both inbound and outbound calls with positive, neutral, and negative outcomes.

  • Inbound Calls:
  • Phone Number Porting
  • Network Connectivity Issues
  • Billing and Payments
  • Technical Support
  • Service Activation
  • International Roaming Enquiry
  • Refunds and Billing Adjustments
  • Emergency Service Access, and many more
  • Outbound Calls:
  • Welcome Calls / Onboarding Process
  • Payment Reminders
  • Customer Surveys
  • Technical Updates
  • Service Usage Reviews
  • Network Compliant Status Call, and many more
  • This extensive coverage ensures the dataset includes realistic call center scenarios, which is essential for developing effective customer support speech recognition models.

    Transcription

    To facilitate your workflow, the dataset includes manual verbatim transcriptions of each call center audio file in JSON format. These transcriptions feature:

  • Speaker-wise Segmentation: Time-coded segments for both agents and customers.
  • Non-Speech Labels: Tags and labels for non-speech elements.
  • Word Error Rate: Word error rate is less than 5% thanks to the dual layer of QA.
  • These ready-to-use transcriptions accelerate the development of the Telecom domain call center conversational AI and ASR models for the UK English language.

    Metadata

    The dataset provides comprehensive metadata for each conversation and participant:

  • Participant Metadata: Unique identifier, age, gender, country, state, district, accent and dialect.
  • Conversation Metadata: Domain, topic, call type, outcome/sentiment, bit depth, and sample rate.
  • This metadata is a powerful tool for understanding and characterizing the data, enabling informed decision-making in the development of UK English call center speech recognition models.

    Usage and Applications

    This dataset can be used for various applications in the fields of speech recognition, natural language processing, and conversational AI, specifically tailored to the Telecom domain. Potential use cases include:

  • Speech Recognition Models: Training and fine-tuning speech recognition models for UK English.
  • Speech Analytics Models: Building speech analytics models to extract insights, identify patterns, and glean valuable information from customer conversation, enables data-driven decision-making and process optimization within the Telecom sector.
  • Smart Assistants and Chatbots: Developing conversational agents and virtual assistants for customer service in the Telecom industries.
  • Sentiment Analysis: Analyzing customer sentiment and improving customer experience based on call center interactions.
  • Generative AI: Training generative AI models capable of generating human-like responses, summaries, or content tailored to the Telecom domain.
  • Secure and Ethical Collection

  • Our proprietary data collection and transcription platform, “Yugo” was used throughout the process of this dataset creation.
  • Throughout the data collection process, the data remained within our secure platform and did not leave our environment, ensuring data security and confidentiality.
  • The data collection process adhered to strict ethical guidelines, ensuring the privacy and consent of all participants.
  • It does not include any personally identifiable information about any participant, which makes the dataset safe to use.
  • The dataset does not contain any copyrighted content.
  • Updates and Customization

    Understanding the importance of diverse environments for robust ASR models, our call center voice dataset is regularly updated with new audio data captured in various real-world conditions.

  • Customization & Custom Collection Options:
  • Environmental Conditions: Custom collection in specific environmental conditions upon request.
  • Sample Rates: Customizable from 8kHz to 48kHz.
  • Transcription Customization: Tailored to specific guidelines and requirements.
  • License

    This Telecom domain call center audio dataset is created by FutureBeeAI and is available for commercial use.

    Use Cases

    Use of speech data in Conversational AI

    Call Center Conversational AI

    Use of speech data for Automatic Speech Recognition

    ASR

    Use of speech data for Chatbot & voicebot creation

    Chatbot

    Use of speech data in Language Modeling

    Language Modelling

    Use of speech data in Text-into-speech

    TTS

    Speech data usecase in Speech Analytics

    Speech Analytics

    Dataset Sample(s)

    Card Head Line
    00:00

    ATTRIBUTES

    TRANSCRIPTION

    TIME
    TRANSCRIPT
    0.709 - 1.504
    Hello, Future Bee.
    2.450 - 3.504
    [noise] Hello, Future Bee.
    4.453 - 5.070
    10.015 - 12.960
    Hi, I'm I'm just calling to inquire about broadband. [noise]
    15.163 - 16.277
    Okay, #Amm you
    16.835 - 18.725
    #Ah interested in a landline with that as well?
    18.739 - 18.986
    -
    19.826 - 21.341
    Yeah. [noise]
    21.326 - 24.106
    Okay, right, because most of our broadband packages #Ah include a landline.
    24.657 - 25.059
    #Amm
    25.617 - 26.632
    Okay, so [noise]
    26.501 - 27.117
    Okay, good. [noise]
    27.835 - 31.899
    #Amm I've got a few packages here that I can #Amm talk to you about if that's all right.
    33.792 - 35.780
    Yeah, that'd be great. Thank you. [noise]
    33.899 - 34.398
    #Amm
    35.469 - 39.215
    So we've got five different packages here and it really depends on #Amm
    37.134 - 37.350
    -
    41.075 - 41.783
    #Amm
    43.143 - 46.334
    who you've got in the household, really, #Ah how many people you've got in the household and what
    46.828 - 48.457
    kind of things they like to use the internet for.
    49.820 - 51.334
    #Amm So
    51.573 - 52.292
    -
    52.615 - 57.533
    #Ah Yeah I'm (()) things (())things So our like our
    54.341 - 54.868
    [noise] I think
    58.262 - 60.963
    cheapest package that we have is the full fiber two,
    62.246 - 62.756
    #Amm
    63.215 - 67.656
    nah has a download speed of seventy-three #Ah megabytes per second.
    68.876 - 74.769
    #Amm And it has a guaranteed #Amm download speed of thirty-seven megabytes per second.
    75.302 - 75.870
    -
    76.757 - 77.668
    #Ah So.
    77.936 - 84.364
    Okay. So what's the what's the difference between the guaranteed and the download speed? [noise]
    84.983 - 90.239
    Okay. So #Amm the the download speed, so the higher number, the seventy-three megabytes per second,
    90.840 - 92.078
    #Amm that one
    92.784 - 95.114
    is #Ah how it should just generally run.
    95.587 - 98.757
    #Amm So tha~ that's that's around where it should be all the time.
    99.525 - 101.179
    #Amm But then sometimes
    102.230 - 102.804
    #Amm
    104.575 - 105.522
    there #Amm
    106.075 - 112.784
    (()) when the running speed is a bit slower, #Amm for whatever reason, #Ah time of day, that sort of thing, #Ah affects it,
    113.358 - 118.373
    #Ah then it shouldn't go below thirty-seven megabytes per second. So that's the guaranteed
    119.299 - 120.668
    #Ah minimum. It's gonna go
    121.171 - 122.349
    [noise] It's not to go beyond.
    124.739 - 125.700
    Okay, thank you.
    124.801 - 125.426
    #Amm
    127.406 - 131.395
    So then #Ah we've got a upload speed of eighteen mega bytes per second.
    129.068 - 129.454
    -
    132.518 - 135.973
    #Ah How many people do you #Amm use the internet for streaming?
    138.733 - 148.996
    Yes sir. you #Amm you you ask about how many people in my house. (()) [noise] And there's me, there's my partner and our son. [noise] He's #Amm
    144.330 - 144.443
    149.592 - 152.663
    sixteen and he he does a lot of gaming.
    153.157 - 154.872
    #Amm He ~ he does it for about,
    156.312 - 160.657
    he actually does it for about eight hours a day. [noise] It's really, yeah it's just that age I think.
    160.661 - 160.984
    163.354 - 167.526
    Yeah, so so you're going to want something that's going to be really good for gaming, so you're going to want a faster speed.
    168.020 - 169.115
    #Amm #Ah
    168.580 - 170.693
    Yeah, he's gonna he's gonna really want that. [noise]
    171.860 - 179.348
    Okay. #Amm Do you do you and your partner use the use the internet for streaming? Do you watch Netflix and things like that? Youtube?
    179.092 - 192.346
    Yeah, we we watch #Amm we watch Netflix and #Amm we have we connect to Amazon Prime Video as well, and #Amm <initial>BBC</initial> I Player. So yeah, we do quite a lot of streaming.
    194.358 - 197.258
    Okay, brilliant. Right, so #Amm how many
    198.127 - 200.949
    How many people stream in your house at any one time, would you say?
    201.711 - 202.323
    #Amm
    204.294 - 205.143
    #Amm
    205.143 - 205.518
    -
    206.830 - 212.241
    It's it's it's normally [noise] more than two, sorry, more than one. And
    213.044 - 216.425
    [noise] Yeah, like Its I don't think it's very common that
    214.235 - 214.335
    -
    216.943 - 219.431
    my partner will stream at separate times.
    221.252 - 223.252
    But that's on his streaming. [noise]
    223.324 - 223.960
    Right. Okay.
    225.115 - 227.967
    Okay. So, #Amm because we've got different packages for
    228.473 - 228.984
    #Amm
    229.877 - 232.024
    different amounts of people #Ah to stream
    232.586 - 236.060
    for household. So the package that I was reading about before is #Amm
    237.115 - 239.651
    #Ah allows two simultaneous streaming #Amm
    240.554 - 241.794
    to go on at the same time. #Amm
    243.729 - 247.020
    But you probably want a bit of #Mmm a faster,
    248.330 - 248.793
    #Ah you
    249.336 - 255.048
    probably want it to be a bit faster if you're gonna to have, if you're trying to be on all the time online gaming and things,
    252.072 - 253.568
    Yeah (())
    255.872 - 256.769
    #Ah in #Amm
    258.824 - 262.617
    because that can make the, if you're watching something at the same time, it can make that a bit slower
    263.201 - 264.728
    So #Amm
    265.884 - 266.704
    -
    266.458 - 268.007
    We've got a package here for
    268.980 - 269.473
    #Amm
    270.084 - 270.637
    our full fiber
    271.194 - 272.187
    #Amm one hundred
    272.872 - 274.797
    is #Ah one hundred megabytes per second.
    275.920 - 276.875
    #Amm
    277.692 - 279.398
    #Ah Guaranteed #Ah download speed
    280.194 - 281.389
    of #Amm
    282.374 - 283.528
    fifty megabytes per second.
    284.598 - 286.896
    And you can four people can stream at the same time.
    288.526 - 289.084
    #Amm
    289.875 - 291.848
    Does your does your son download a lot of games?
    294.389 - 295.266
    sounds like a bus
    294.891 - 297.141
    #Amm Yeah, yeah he does, yeah.
    297.637 - 301.600
    There's #Amm different packages that allow different
    302.170 - 302.848
    #Amm
    303.745 - 305.410
    download times for for games
    306.064 - 306.745
    So
    307.473 - 308.076
    #Amm
    310.004 - 311.358
    Yeah, please tell me about them. [noise]
    310.442 - 310.841
    -
    312.586 - 313.490
    Yeah, we have from #Amm
    315.410 - 319.319
    #Ah a highest of ten minutes, and then there's six minutes, and then there's #Amm
    317.413 - 317.574
    319.836 - 320.805
    three minutes as well.
    321.322 - 323.677
    After that, after that it gets really quick. So,
    324.201 - 326.757
    #Amm our more expensive packages are
    327.797 - 332.136
    #Amm twenty-nine seconds and nineteen seconds, but those I think are for,
    333.218 - 337.165
    #Ah I don't think you probably need those. therefore #Amm
    338.031 - 339.247
    They're for really big households. So
    339.809 - 343.310
    they allow #Ah twenty twenty and thirty six people to stream at the same time.
    342.283 - 343.024
    [noise] I see
    344.387 - 345.470
    #Amm But maybe
    346.557 - 351.781
    May be your #Ah Core Fiber two hundred would be #Amm good for you because
    347.725 - 348.237
    Okay.
    352.725 - 353.247
    #Amm
    353.812 - 356.358
    that has two hundred megabytes per second download speed.
    356.829 - 357.869
    #Amm So
    358.586 - 359.055
    #Ah should
    359.600 - 365.680
    #Amm it should lag too much if #Ah your son is gaming all day and you and your husband want to download something to watch.
    366.800 - 369.629
    #Amm #Amm Likewise, if you're watching something, it shouldn't
    370.127 - 372.576
    lag much when your son is trying to download something either.
    373.230 - 380.187
    #Amm It's got it's got a guaranteed #Ah download speed of hundred megabytes per second. So #Hmm
    380.908 - 384.177
    you're you gonna to be okay. #Amm No matter what the Internet is doing, really.
    385.362 - 386.516
    #Amm You've got.
    386.860 - 392.482
    Okay, that sounds [noise] that sounds great. Can you tell me about the upload speed? [noise] #Mmm
    392.411 - 396.156
    #Ah The upload speed for that one is #Ah twenty-seven megabytes per second.
    397.326 - 404.625
    #Ah So that's also pretty good. It allows eight people to stream at the same time, so I don't know if you're #Amm going to have lots of guests
    398.091 - 398.564
    Okay.
    398.987 - 399.437
    -
    403.350 - 403.593
    -
    405.283 - 407.485
    #Amm on the <initial>WIFI</initial> #Amm
    408.079 - 411.048
    Yeah, I I mean he might have his friends around sometimes. [noise]
    410.959 - 412.872
    Yeah, yeah, it's a good point. They might want to
    413.634 - 414.605
    #Amm you might all want to
    415.262 - 416.286
    have the phones on so
    417.148 - 423.833
    yeah, [noise] #Amm and then that one as well, you've got #Amm the game download speed #Amm
    424.562 - 425.271
    is
    425.992 - 428.144
    four point five megabytes, #Ah gigabytes
    429.574 - 435.206
    #Ah across the board, #Amm , or they all have that but some of them are a bit quicker than others. So that one takes three minutes. So
    435.773 - 435.884
    -
    435.915 - 438.901
    #Amm even if he's an impatient teenager, I think three minutes is
    439.812 - 440.889
    [noise] is a
    441.449 - 443.000
    not something he's gonna to get too upset about.
    444.060 - 445.127
    #Amm [noise] G
    444.872 - 447.331
    Yeah, I'm sure he'll be very happy about that.
    448.172 - 452.504
    [laugh] Yeah. #Amm So do you have, #Ah do you upload a lot of pictures as well?
    454.853 - 461.600
    #Amm Yeah, yeah. My husband's a photographer, so [noise] yeah, he actually, #Amm he does take a lot of photos.
    463.259 - 464.473
    #Ah Excellent. So it's, #Amm
    463.406 - 463.677
    -
    465.055 - 467.939
    you know, right uploads #Ah two hundred and fifty megabytes
    468.889 - 469.925
    oh #Amm
    470.677 - 472.423
    of pictures in one minute.
    472.944 - 474.350
    So with this with this package, so
    474.821 - 477.295
    he's he's taken a lot of pictures, what's (()) now and
    478.132 - 479.562
    this one, it comes with, #Amm
    480.115 - 480.785
    gonna am a needs now
    481.889 - 482.394
    #Ah
    483.213 - 486.646
    Okay #Ah this one comes with do you have a landline already? [noise]
    488.995 - 490.694
    Yeah, we do #Amm [noise]
    491.175 - 496.271
    Yeah, I'm not sure who it's it's currently with at the moment, but #Amm yeah.
    497.865 - 500.653
    Well, #Ah this this comes with a landline as part of the package.
    501.163 - 503.509
    #Amm Okay. We can we can #Amm
    501.776 - 502.338
    [noise]Okay
    504.435 - 506.903
    transfer your old number onto this one free of charge.
    507.845 - 511.451
    So the landmine is not gonna to cost you any more. So
    508.891 - 509.540
    Okay [noise]
    512.666 - 514.518
    #Ah is that something that you'd be interested in?
    516.211 - 518.104
    Yeah, that that would be good actually, yeah.
    518.831 - 523.394
    Okay perfect #Amm So this package is #Ah from thirty- five pounds amount
    523.408 - 523.509
    -
    525.662 - 526.259
    the sound okay.
    526.980 - 543.802
    [noise] If I consummate #Amm [noise] Okay, yeah, I think we're on about twenty-five at the moment for our current broadband, so, #Amm but it is isn't isn't very fast at the moment and we have been, #Amm yeah, you know, that's why [noise]
    537.346 - 537.677
    -
    544.557 - 547.849
    I'm looking to ch~ ch~ change that and
    548.346 - 557.363
    make it quicker for, main~ mainly for my son really, but also because because the rest of the house, we just don't really get so much. #Amm My niece-son is [noise]
    558.008 - 559.365
    My niece-son is games.
    561.923 - 567.975
    Okay, well, #Ah this should be much better for you then. It's an extra ten pounds a month. So, #Amm yeah, perfect. Shall I
    567.408 - 567.817
    yeah
    568.937 - 569.346
    shall I
    570.821 - 573.375
    #Amm send you all of this in an email so you can
    574.000 - 575.211
    have a chat with your husband about it?
    575.211 - 579.024
    Yeah, please. Please send it all over an email, that'd be great. Okay,
    580.240 - 586.518
    Okay I'll send you everything to look at and then just get back in touch whenever you're ready and we'll get you sorted out.
    588.331 - 590.254
    That's great, thank you so much for your help today.
    591.899 - 592.947
    Oh, thank you. #Ah
    593.432 - 594.363
    Have a great day.
    595.350 - 595.666
    Bye. [noise]
    596.201 - 598.129
    [noise]Yeah, you too. Thanks, bye.

    Dataset Details

    Card Head Line

    Language

    English

    Language code

    en-gb

    Country

    UK

    Accents

    English - East and Central Midlands, English - East Anglia ...more

    Gender Distribution

    M:60, F:40

    Age Group

    18-70

    File Details

    Card Head Line

    Environment

    Silent, Noisy

    Bit Depth

    16 bit

    Format

    wav

    Sample rate

    8khz & 16 khz

    Channel

    Stereo

    Audio file duration

    5-15 minutes

    Need datasets for a specific AI/ML use case?
    Don't worry, we've got you covered! 👍

    Contact Us
    Prompt 2 Bg