Tech News

Tech Business News

  • Home
  • Technology
  • Business
  • News
    • Technology News
    • Local Tech News
    • World Tech News
    • General News
    • News Stories
  • Media Releases
    • Tech Media Releases
    • General Media Releases
  • Advertisers
    • Advertiser Content
    • Promoted Content
    • Sponsored Whitepapers
    • Advertising Options
  • Cyber
  • Reports
  • People
  • Science
  • Articles
    • Opinion
    • Digital Marketing
    • Guest Publishers
  • About
    • Tech Business News
    • News Contributions -Submit
    • Journalist Application
    • Contact Us
Reading: Google Announce Latest Update To Its 1,000 + Language AI Universal Speech Model (USM)
Share
Font ResizerAa
Tech Business NewsTech Business News
  • Home
  • Technology News
  • Business News
  • News Stories
  • General News
  • World News
  • Media Releases
Search
  • News
    • Technology News
    • Business News
    • Local News
    • News Stories
    • General News
    • World News
    • Global News
  • Media Releases
    • Tech Media Releases
    • General Press
  • Categories
    • Crypto News
    • Cyber
    • Digital Marketing
    • Education
    • Gadgets
    • Technology
    • Guest Publishers
    • IT Security
    • People In Technology
    • Reports
    • Science
    • Software
    • Stock Market
  • Promoted Content
    • Advertisers
    • Promoted
    • Sponsored Whitepapers
  • Contact & About
    • Contact Information
    • About Tech Business News
    • News Contributions & Submissions
Follow US
© 2022 Tech Business News- Australian Technology News. All Rights Reserved.
Tech Business News > World Tech > Google Announce Latest Update To Its 1,000 + Language AI Universal Speech Model (USM)
World Tech

Google Announce Latest Update To Its 1,000 + Language AI Universal Speech Model (USM)

The creation of USM is crucial for Google to realise its objective of organising and enhancing worldwide access to information.

Matthew Giannelis
Last updated: March 14, 2023 6:15 am
Matthew Giannelis
Share
SHARE

To keep up with the rising popularity of Open AI, Google has released its latest update and progress report for it’s Universal Speech Model (USM). Presently, the USM can handle roughly 300 languages, but the company has set its sights on expanding its capabilities to 1,000 languages.

The development of Google’s Universal Speech Model (USM) marks a crucial initial step towards building artificial intelligence (AI) capable of comprehending and translating 1,000 languages.

Google released groundbreaking details of their AI universal speech model, specially created to comprehend hundreds of spoken languages. The model’s remarkable capabilities are attributed to its extensive training on an impressive 12 million hours of speech and a staggering 28 billion sentences of text, encompassing over 300 languages.

This game-changing technology is poised to transform the landscape of global communication, and the world waits with bated breath for its next move.

The Universal Speech Model (USM) is engineered with a standardised encoding-decoding architecture that utilizes convolutional, attention, and feed-forward modules for its encoder.

This innovative design makes it compatible with LAS, CTC, and RNN-T decoders, pushing the boundaries of what is possible in the realm of artificial intelligence.

Google is leaving no stone unturned in its pursuit of achieving its lofty 1000-language objective. The tech giant is utilising a powerful machine learning model that focuses on the most frequently used languages. Nonetheless, numerous languages have a minimal number of speakers, which restricts the availability of data

In a bid to overcome this hurdle, Google is taking a multi-pronged approach by aggregating and analysing data from various language sources. The goal is to create more accurate results through automatic speech recognition scaling, laying the foundation for a future where language barriers are no longer an obstacle.

According to recent reports, the data available on YouTube is relatively limited in comparison to that of OpenAI. Nevertheless, the Universal Speech Model (USM) has been found to generate 30% fewer word errors than its OpenAI counterpart.

Google says that early tests of the model were designed to create captions on YouTube videos and can perform automatic speech recognition on 100 languages. Some of the languages are spoken by fewer than twenty million people, making it very difficult to find the training data.

This cutting-edge technology marks a significant step forward in the tech giant’s overarching mission to make information easily accessible to everyone, everywhere. With the USM leading the way, the world can look forward to a future where language barriers are no longer a hindrance to communication and understanding.

The machine learning model’s training pipeline is a multi-step process that starts with self-supervised learning of audio files that span numerous languages. Afterward, the quality and language coverage of the data are optimised by incorporating text data wherever possible.

The next step is the fine-tuning of downstream operations, which leverages supervised data to enhance the model’s performance even further. This comprehensive training regimen is a testament to the tech giant’s unwavering commitment to pushing the boundaries of what is possible in the realm of artificial intelligence.

According to a research paper by Google, which was authored by numerous contributors, there are two types of models generated by pre-trained models that can be fine-tuned for downstream tasks. In addition, there are generic automatic speech recognition (ASR) models, which the researchers believe are not fine-tuned downstream.

Google’s researchers assert that these generic ASR models are scalable and can improve the performance of models trained on shorter utterances when applied to longer speech inputs.

Google has also prioritised improving its speech-to-text transcription accuracy as an important area of focus. Specifically, the company has been developing methods to minimise errors and inaccuracies in transcriptions, which can pose significant challenges in certain situations.

USM now supports all 73 languages in the YouTube Captions’ Test Set and outperforms Whisper on the languages it can support with lower than 40% WER. Lower WER is better.

Also, the Google Artificial Intelligence team has been investigating novel approaches to enhance the precision of its speech models by utilising better training data. These endeavors have involved gathering more varied and inclusive datasets, which can aid in improving the model’s capacity to identify and transcribe speech from diverse sources.

In addition, the algorithm is currently confronting several challenges. Based on research and ongoing competitions, the comprehension algorithm must be versatile, robust, and adaptable to facilitate model improvement in a computationally efficient manner, while expanding language coverage and proficiency.

The algorithm should have the ability to process large volumes of data from various sources, generalize to new languages and use cases, and facilitate model upgrades without requiring extensive retraining.

Undoubtedly, there are apprehensions and concerns about the potential misuse or abuse of these technologies. For instance, concerns have been raised regarding the reliability of speech recognition technologies when utilised in legal proceedings or when transcribing discussions that involve delicate or confidential information.

In general, Google’s initiatives in this domain are part of a larger movement aimed at enhancing AI-driven speech recognition and transcription. As voice-activated interfaces become more prevalent, these technologies will assume an increasingly vital function in a wide range of applications, from virtual assistants to customer service chatbots and beyond.

Despite the aforementioned concerns, it is evident that AI-driven speech recognition and transcription will persist as a significant focus for companies like Google in the coming years.

As these technologies continue to advance, they are likely to proliferate and become even more robust, potentially revolutionising our interactions with computers and each other.

Google published a blog entry by members of the team working on the project. It’s team members also published a paper describing the introduction of its Universal Speech Model (USM) on the arXiv pre-print server.

By Matthew Giannelis
Follow:
Secondary editor and executive officer at Tech Business News. An IT support engineer for 20 years he's also an advocate for cyber security and anti-spam laws.
Previous Article Customers Dont Like Or Hate Chatbots. Annoying Why Customers Don’t Like Or Hate Chatbots – Annoying & Bad For Business
Next Article China Set To Increase Defense Spending By 7.2% China Set To Increase Defense Spending By 7.2% To 1.56 Trillion Yuan ($230 billion)
Leave a comment

Leave a Reply Cancel reply

You must be logged in to post a comment.

Google Universal Speech Model (USM) Progress Report Update

Tech Articles

Healthcare AI

AI Is Transforming Healthcare By Improving Workflows And Resource Management

Artificial Intelligence (AI) is transforming every industry. In healthcare, there…

October 12, 2024
Re-Engagement Campaigns - Inactive Subscribers

Re-Engagement Campaigns To Bring Back Inactive Subscribers

If you’re managing an email list, you’ve probably noticed that…

November 26, 2024
Nations Leading the Charge in Dead and Scam Crypto Projects

Top Nations Behind Scams And Dead Crypto Projects

An analysis of 1,500+ crypto ventures reveals the U.S. leading…

November 29, 2024

Recent News

Global Tech News - US blacklists 4 Chinese tech companies
World Tech

US Adds Four Chinese Tech Firms To Blacklists for Alleged Military Assistance

3 Min Read
russian-attacks-solarwinds
World Tech

Russian SolarWinds hackers launched new phishing campaign

5 Min Read
IBM News
World Tech

IBM Expands Security Portfolio with Plans to Acquire ReaQta

4 Min Read
Aarrest of 39-year-old Pavel Durov last Saturday
World Tech

Telegram Boss Pavel Durov Arrest Fires A Warning Shot To Global Tech Titans

8 Min Read
Tech News

Tech Business News

Stay up to date with the latest technology & business news trends from Australia and the around the world.

Technology News reports and whitepaper publishing services are available along with media and advertising options

Our Australian technology news includes People, Business, Science, World News, Local News, Guest publishers, IT News & Tech News Australia | Tech News was established in 2019

About

About Us 
Contact Us 
Privacy Policy
Copyright Policy
Terms & Conditions

December, 10, 2024

Contact

Contact Information.
Melbourne, Australia

Werribee 3030

Phone: +61 431401041

Hours : Monday to Friday, 9am 530-pm.


Tech News

© Copyright Tech Business News 

Latest Australian Tech News – 2024

Welcome Back!

Sign in to your account