English (UK) Call Center Speech Dataset for Retail & E-commerce

The audio dataset includes call center conversations in Retail & E-commerce, featuring native English speakers from UK, with detailed metadata and accurate transcriptions.

Category

Unscripted Call Center Conversations

Total Volume

30 Speech Hours

Last updated

July 2023

Number of participants

60

Get this Speech Dataset

Get Dataset Btn

About this Off-the-shelf Speech Dataset

About Gradiet Line

What’s Included

Welcome to the English Language Call Center Speech Dataset for the Retail and E-commerce domain. It is a specialized and comprehensive collection of voice data designed to enhance the development of call center speech recognition models specifically for the Retail and E-commerce industry.


With high-quality call center audio recordings, detailed metadata, and accurate transcriptions, it empowers researchers and developers to enhance natural language processing, conversational AI, and generative voice AI algorithms in the Retail and E-commerce domain. Moreover, it facilitates the creation of sophisticated voice assistants and voice bots tailored to the unique linguistic nuances found in the English language spoken in United Kingdom.


Speech Data:

This training dataset comprises 30 hours of call center audio recordings covering various topics and scenarios related to the Retail and E-commerce domain, to build robust and accurate customer service speech technology.


To curate realistic call center interactions, we collaborated with a diverse network of 60 expert native English speakers from different states/provinces of United Kingdom. This collaborative effort ensures a balanced representation of British accents, dialects, and demographics, promoting inclusivity and reducing biases in the dataset.


Each audio recording captures the essence of unscripted and spontaneous conversations between call center agents and customers, with an average duration ranging from 5 to 15 minutes per call. The dataset includes both inbound and outbound calls, covering scenarios such as inquiries, promotional offers, complaints, technical support, and more. Additionally, the dataset contains call center conversations with both positive and negative outcomes, providing a diverse and realistic dataset.


The speech data is available in WAV format with stereo channels, a bit depth of 16 bits, and a sample rate of 8 kHz, ensuring high-quality audio for accurate analysis. The recording environment is generally quiet, without background noise and echo.


Metadata:

In addition to the audio recordings, our dataset provides comprehensive metadata for each participant. This includes the participant’s age, gender, country, state, and dialect. Additionally, it includes metadata like domain, topic, call type, outcome, bit depth, and sample rate for each conversation.


The metadata serves as a powerful tool for understanding and characterizing the data, enabling informed decision-making in the development of English language call center speech recognition models for the Retail and E-commerce domain.


Transcription:

To facilitate your workflow, the dataset includes manual verbatim transcriptions of each call center audio file in JSON format. The transcriptions capture speaker-wise transcription with time-coded segmentation along with non-speech labels and tags, covering both the agent and customer conversations.


These ready-to-use transcriptions accelerate the development of Retail and E-commerce call center conversational AI and ASR models for the English language.


Updates and Customization:

We understand the importance of collecting data in various environments to build robust ASR models. Therefore, our call center voice dataset is regularly updated with new audio data captured in diverse real-world conditions.


If you require a custom training dataset with specific environmental conditions, we can accommodate your request. We can provide voice data with customized sample rates ranging from 8kHz to 48kHz, allowing you to fine-tune your models for different audio recording setups. Additionally, we can also customize the transcription following your specific guidelines and requirements, to further support your ASR development process.


License:

This Retail and E-commerce call center audio dataset is created by FutureBeeAI and is available for commercial use!


Conclusion:

Whether you are training or fine-tuning speech recognition models, advancing NLP algorithms, or building state-of-the-art voice assistants to improve customer experiences in the Retail and E-commerce sector, our dataset serves as a trusted resource to meet your goals


Use Cases

Use of speech data for Automatic Speech Recognition

ASR

Use of speech data in Conversational AI

Conversational AI

Use of speech data for Chatbot & voicebot creation

Chatbot

Use of speech data in Language Modeling

Language Modelling

Use of speech data in Text-into-speech

TTS

Speech data usecase in Speech Analytics

Speech Analytics

Dataset Sample(s)

Sample Line

ATTRIBUTES

Channel 1Channel 2Format
Male(23)Male(22)wav, json

TRANSCRIPTION

LABELSTARTENDCHANNELTRANSCRIPT
Speech1.5599.695Speaker 1Hi, this is <PII>John Smith</PII> from Prince [filler] Prince Aras. We are a small printing company here in
Noise9.95210.111--
Speech10.16911.852Speaker 1in the south of London
Speech12.34218.314Speaker 1[filler] and I am contacting you [filler] here at the [filler] obviously if you know the e-commerce [filler]
Speech18.95129.867Speaker 1e-commerce be [filler] because I was wondering how you guys could help me move my small printing company and grow it with the presence of the Internet
Noise24.83825.138--
Speech30.83233.493Speaker 1and and using your services so[filler]
Speech34.12734.450Speaker 1Yeah
Speech34.48439.103Speaker 2Hello, <PII>Mr. John Smith</PII>, so good to speak to you today [filler] you know, before we get started
Speech39.58150.429Speaker 2I just wanna clarify, by printing you mean you know three d, you're talking about three d printers, printing small toys and objects for business, correct? [filler] Or are you refering to printing paper?
Speech47.40848.423Speaker 1We, we do
Noise48.49351.075--
Speech51.11564.840Speaker 1So when I said printing obviously [filler] I am not sure if you are aware of the printing industry [filler] so we are involved in many aspects of the printing industry, so as you said we do some three d printing, altough it's a small aspect of our corporation and business at the moment.
Noise64.88465.221--
Speech65.32273.563Speaker 1Our sort of main focus is actual print, so [filler] when we have a big request from companies [filler] we print leaflets
Speech73.99878.896Speaker 1[filler] magazines sometimes or like graduation books are a common thing for us
Noise75.67375.927--
Speech79.28681.983Speaker 1so yearbook, so we print them
Noise79.43379.668--
Speech82.49586.310Speaker 1It's a printing company so we do a lot of actual in paper printing
Speech86.83499.938Speaker 1[filler] as I said for adverts and other stuff, magazines but also [filler] three d printing and then and I actually we recently [filler] we've started [filler] where we have a recent contract [filler] it's currently being signed, it's not, it's not [filler]
Noise93.54693.703--
Speech100.304109.328Speaker 1[filler] confidential sign allowed to reveal it but with [filler] gunsmith so soon we'll be able to print [filler] three d guns which are obviously for training purposes, not actual guns
Noise109.325109.561--
Speech109.545111.150Speaker 2Oh wow, that's pretty interesting
Speech109.787113.224Speaker 1so this will you know, the company, the company is going in good directions but
Speech114.016128.439Speaker 1and so I'm contacting you, we are [filler] years behind probably in the e-commerce front but we've been very you know <initial>B</initial> to <initial>B</initial> on how we are business[filler] (()) in the past so if you know [filler] person to person
Speech128.567135.864Speaker 1[filler] you know business to business and seeing the clients you know in physical meetings whereas nowadays we feel like we are falling behind especially our
Noise135.888136.282--
Speech136.598148.216Speaker 1our rivals, now competition and then [bg-speech] [filler] we've decided that as a corporation we want to move so that's why we are doing this introducting call here with you today and I was wondering you know what sort of services we can get from you guys
Speech138.543138.889Speaker 2Okay
Speech144.574145.360Speaker 2Definitely
Speech148.585160.062Speaker 2[filler] yes, definitely, well [filler] I must say [filler] you know e-commerce is the future my friend, so I don't know how (()) you are with the e-commerce but basically refers to any type of online selling
Noise149.246150.889--
Speech154.882155.364Speaker 1Okay
Speech160.643167.169Speaker 2yo, that is e-commerce right so if you are wanting to sell and upscale your business by
Speech167.538171.139Speaker 2you know, reaching more markets, larger markets, etc
Speech171.770173.205Speaker 2e-commerce is your place my friend
Speech173.561180.717Speaker 2okay, so it's a, it's a very good place and ways [filler] to expand your business, to grow, to reach your customers
Noise180.711181.324--

TRANSCRIPTION

TIMETRANSCRIPT
1.559
9.695
Hi, this is <PII>John Smith</PII> from Prince [filler] Prince Aras. We are a small printing company here in
9.952
10.111
-
10.169
11.852
in the south of London
12.342
18.314
[filler] and I am contacting you [filler] here at the [filler] obviously if you know the e-commerce [filler]
18.951
29.867
e-commerce be [filler] because I was wondering how you guys could help me move my small printing company and grow it with the presence of the Internet
24.838
25.138
-
30.832
33.493
and and using your services so[filler]
34.127
34.450
Yeah
34.484
39.103
Hello, <PII>Mr. John Smith</PII>, so good to speak to you today [filler] you know, before we get started
39.581
50.429
I just wanna clarify, by printing you mean you know three d, you're talking about three d printers, printing small toys and objects for business, correct? [filler] Or are you refering to printing paper?
47.408
48.423
We, we do
48.493
51.075
-
51.115
64.840
So when I said printing obviously [filler] I am not sure if you are aware of the printing industry [filler] so we are involved in many aspects of the printing industry, so as you said we do some three d printing, altough it's a small aspect of our corporation and business at the moment.
64.884
65.221
-
65.322
73.563
Our sort of main focus is actual print, so [filler] when we have a big request from companies [filler] we print leaflets
73.998
78.896
[filler] magazines sometimes or like graduation books are a common thing for us
75.673
75.927
-
79.286
81.983
so yearbook, so we print them
79.433
79.668
-
82.495
86.310
It's a printing company so we do a lot of actual in paper printing
86.834
99.938
[filler] as I said for adverts and other stuff, magazines but also [filler] three d printing and then and I actually we recently [filler] we've started [filler] where we have a recent contract [filler] it's currently being signed, it's not, it's not [filler]
93.546
93.703
-
100.304
109.328
[filler] confidential sign allowed to reveal it but with [filler] gunsmith so soon we'll be able to print [filler] three d guns which are obviously for training purposes, not actual guns
109.325
109.561
-
109.545
111.150
Oh wow, that's pretty interesting
109.787
113.224
so this will you know, the company, the company is going in good directions but
114.016
128.439
and so I'm contacting you, we are [filler] years behind probably in the e-commerce front but we've been very you know <initial>B</initial> to <initial>B</initial> on how we are business[filler] (()) in the past so if you know [filler] person to person
128.567
135.864
[filler] you know business to business and seeing the clients you know in physical meetings whereas nowadays we feel like we are falling behind especially our
135.888
136.282
-
136.598
148.216
our rivals, now competition and then [bg-speech] [filler] we've decided that as a corporation we want to move so that's why we are doing this introducting call here with you today and I was wondering you know what sort of services we can get from you guys
138.543
138.889
Okay
144.574
145.360
Definitely
148.585
160.062
[filler] yes, definitely, well [filler] I must say [filler] you know e-commerce is the future my friend, so I don't know how (()) you are with the e-commerce but basically refers to any type of online selling
149.246
150.889
-
154.882
155.364
Okay
160.643
167.169
yo, that is e-commerce right so if you are wanting to sell and upscale your business by
167.538
171.139
you know, reaching more markets, larger markets, etc
171.770
173.205
e-commerce is your place my friend
173.561
180.717
okay, so it's a, it's a very good place and ways [filler] to expand your business, to grow, to reach your customers
180.711
181.324
-

Dataset Demographics

Details Headline

Language

English

Language code

en-gb

Country

UK

Accents

English - East and C,...more

Gender Distribution

M:55, F:45

Age Group

18-70

Audio File Details

Details Headline

Environment

Silent, Noisy

Bit Depth

16 bit

Format

wav

Sample rate

8khz

Channel

Dual separate channel

Audio file duration

5-15 minutes

Download Sample Speech Dataset Now!

Explore Audio Data, Metadata and Transcription to get more clarity and hands on experience of this dataset.

Download Free Dataset

Audio Download Btn
Audio Promp Bg
Audio Promp Bg

Start your AI/ML model creation journey with FutureBeeAI!

Contact Us

Audio Arrow BtnAudio Arrow Btn Black
Audio Promp 2 Bg