Business

Meta to train speech recognition engines on 'clusters' of speakers using new dataset

Fri, Jul 14 2023 01:47:44 PM

San Francisco, Jul 14 (IANS): Meta (formerly Facebook) has developed a new dataset which the company will use to improve the performance of automatic speech recognition (ASR) tools by clustering speech at the "utterance level".

As part of Meta's continued commitment to improving ASR performance, the company has taught ASRs to train without transcripts, recognise over 4,000 spoken languages, and even read lips more accurately than humans.

However, many of the datasets used to train ASR models are organised by demographic such as age group, gender, nationality, and English accent, which limits the variation of pronunciations that models are trained on, ultimately hampering their function in understanding a wide range of users.

In order to overcome this, Meta AI developed a dataset that relies instead on utterance clustering.

"Instead of dividing a dataset based on speakers' demographic information -- such as their age group or gender -- our proposed algorithm clusters speech at the utterance level," Meta said in a blogpost on Thursday.

"A single cluster will contain similar utterances from a diverse group of speakers. We can then train our model using the various clusters and use fairness datasets to measure how the model impacts outcomes across different demographic groups," it added.

The company's resulting dataset includes about 27,055 utterances in a recorded speech by 595 people in the US who were paid to record and submit audio of themselves saying commands.

Their utterances are organised around seven main themes -- music, capture, utilities, notification control, messaging, calling, and dictation, which other researchers can use to train their own models and digital assistants on.

The speakers were asked how they would voice search for a song, make plans with friends, and decide where to meet up.

To evaluate this new system, Meta trained their model on de-identified, publicly available Facebook videos in English which were evaluated on two datasets.

The first was a de-identified dataset collected from a data supplier for ASR that includes 48,000 utterances from 867 speakers, and the second dataset is Casual Conversations v1, a dataset of transcribed speech that Meta built and made publicly available in 2021.

"During testing, we observed that a model trained in this manner improved speech recognition accuracy for all measured demographic groups, and in particular for different accents, which are identified in sociolinguistics as a way of pronouncing a language that is distinctive to a country, area, social class, or individual," Meta said.

"While our proposed algorithm was built using English-language data, we hope these approaches can be extended to work for other languages as well," it added.

Follow Daijiworld News Network on

Latest

Indonesia signs deal with India to procure BrahMos missile system

Rupee hits record low of 92.33 as oil prices surge amid Middle East tensions

Beyond Batteries: What Are Green Fuels and Why Do They Matter?

Stock markets plunge as oil prices surge amid Iran–Israel conflict

Oil surges past $110 as US-Israel war with Iran rattles global markets

India diversifies crude oil sources amid West Asia tensions

Air India adds 78 extra flights amid West Asia travel constraints

Business

Meta to train speech recognition engines on 'clusters' of speakers using new dataset

Top Stories

Much-awaited 39th 'Harshotsava' shopping festival kicks off across state

Leave a Comment Your Email address will not be published.

Title: Meta to train speech recognition engines on 'clusters' of speakers using new dataset

You might also like

US lawmaker introduces bill to reverse Trump’s H-1B visa restrictions

US vows to secure Strait of Hormuz as tensions with Iran escalate

US airport queues stretch for hours as government shutdown hits security staffing

NYPD officer dies in Kuwait during US military deployment

US strike on suspected drug-smuggling boat leaves six dead in Pacific

Indian-American FDA vaccine chief Vinay Prasad resigns amid regulatory disputes

Trump pays tribute to fallen US soldiers at Dover Air Force Base transfer

Indian nationals held by US immigration authorities in San Diego

Trump slams UK PM Starmer over Iran war stance, says US ‘doesn’t need’ British carriers

US judge rules Kari Lake’s leadership of USAGM violated law, voids VOA actions

Mangaluru: St Agnes CBSE School celebrates International Women’s Day

Yenepoya (Deemed to be University) marks International Women’s Day with 'Give to Gain' theme

Srinivas University hosts national techno-cultural & sports fest ‘Tech Yuva – 2026’

Milagres College of Nursing marks International Women’s Day with wellness programme

St Agnes PU College celebrates International Women’s Day with enthusiasm

St Cecily’s English Medium Higher Pry School (CBSE) commemorates women power

Excelso-26: National-level UG inter-collegiate fest held at Milagres College

Mount Carmel Central School celebrates International Women’s Day 2026

Mangaluru: St Agnes College hosts intra-department fest 'Sangam 2026'

Ladyhill English Medium Higher Primary School celebrates International Women’s Day 2026