Londonchiropracter.com

This domain is available to be leased

Menu
Menu

Startup harnesses self-supervised learning to tackle speech recognition biases

Posted on November 8, 2021 by admin

Speech recognition systems struggle to understand African American Vernacular English (AAVE). In a 2020 study by Stanford University researchers, the software performed so poorly for AAVE that some leading systems made correct transcriptions for barely half the words spoken.

The researchers speculated that the systems had a common flaw: “insufficient audio data from Black speakers when training the models.”

A startup called Speechmatics has developed a technique that appears to reduce this data gap.

The company announced last week that its software had “an overall accuracy of 82.8% for African American voices” based on datasets used in the Stanford study. In comparison, the systems developed by Google and Amazon both recorded an accuracy of only 68.6%.

Speechmatics attributed much of its performance to a technique called self-supervised learning.

Training school

The advantage of self-supervised models is that they don’t require all their training data to be labeled by humans. As a result, they can enable AI systems to learn from a much larger pool of information.

This helped Speechmatics increase its training data from around 30,000 hours of audio to around 1.1 million hours.

Will Williams, the company’s VP of machine learning, told TNW that the approach improved the software’s performance across a variety of speech patterns:

What we’re looking to do is build scalable methods that let us attack a broad range of accents at once.

Learning like a child

One of the technique’s benefits was closing Speechmatics’ age understanding gap.

Based on the open-source project Common Voice, the software had a 92% accuracy rate on children’s voices. The Google system, by comparison, had an accuracy of 83.4%.

Williams said enhancing the recognition of kids’ voices was never a specific objective:

We’re training on millions of hours of audio, and just like how a child learns, we’re exposing our learning systems to all this online audio… Inside those millions of hours, there will be children’s voices, so it will learn how to deal with them — but without them being labelled.

That doesn’t mean that self-supervised learning alone can eliminate AI biases. Allison Koenecke, the lead author of the Stanford study, noted that other issues also need to be addressed: 

We also strongly believe that achieving fair outcomes is as much a ‘people problem’ as a ‘data problem.’ That is, we hope that ASR [automatic speech recognition] developers themselves understand the need to be broadly inclusive.

Nonetheless, the performance of Speechmatics suggests that self-supervised learning can at least mitigate dataset biases.

Source

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Recent Posts

  • Trump says Anthropic Pentagon deal is ‘possible’, weeks after blacklisting the company as a national security risk
  • Samsung and IKEA just made the $6 smart home real, and your TV is already the hub
  • OpenAI recruits Cognizant and CGI to take Codex into enterprise software shops worldwide
  • Lovable left thousands of projects exposed for 48 days, and the vibe coding security crisis is only getting worse
  • Humble emerges from stealth with $24M and a cableless autonomous electric truck built to go dock-to-dock

Recent Comments

    Archives

    • April 2026
    • March 2026
    • February 2026
    • January 2026
    • December 2025
    • September 2025
    • August 2025
    • July 2025
    • June 2025
    • May 2025
    • April 2025
    • March 2025
    • February 2025
    • January 2025
    • December 2024
    • November 2024
    • October 2024
    • September 2024
    • August 2024
    • July 2024
    • June 2024
    • May 2024
    • April 2024
    • March 2024
    • February 2024
    • January 2024
    • December 2023
    • November 2023
    • October 2023
    • September 2023
    • August 2023
    • July 2023
    • June 2023
    • May 2023
    • April 2023
    • March 2023
    • February 2023
    • January 2023
    • December 2022
    • November 2022
    • October 2022
    • September 2022
    • August 2022
    • July 2022
    • June 2022
    • May 2022
    • April 2022
    • March 2022
    • February 2022
    • January 2022
    • December 2021
    • November 2021
    • October 2021
    • September 2021
    • August 2021
    • July 2021
    • June 2021
    • May 2021
    • April 2021
    • March 2021
    • February 2021
    • January 2021
    • December 2020
    • November 2020
    • October 2020

    Categories

    • Uncategorized

    Meta

    • Log in
    • Entries feed
    • Comments feed
    • WordPress.org
    ©2026 Londonchiropracter.com | Design: Newspaperly WordPress Theme