Alexa speech normalization AI reduces errors by up to 81%

Nitin Naresh May 16, 2019

0 271 2 minutes read

Text normalization is a fundamental processing step in most natural language systems. In the case of Amazon’s Alexa, “Book me a table at 5:00 p.m.” might be transcribed by the assistant’s automatic speech recognizer as “five p m” and further reformatted to “5:00PM.” Then again, Alexa might convert “5:00PM” to “five thirty p m” for its text-to-speech synthesizer.
So how does this work? Currently, Amazon’s voice assistant relies on “thousands” of handwritten normalization rules for dates, email addresses, numbers, abbreviations, and other expressions, according to Alexa AI group applied scientist Ming Sun and Alexa Speech machine learning scientist Yuzong Liu. That’s all well and fine for English, but because the approach isn’t particularly adaptable to other languages (without lots of manual labor), Amazon scientists are investigating a more scalable technique driven by machine learning.
In a preprint paper (“Neural Text Normalization with Subword Units”) scheduled to be presented at the North American Chapter of the Association for Computational Linguistics (NAACL), Sun, Liu, and colleagues describe an AI text normalization system that breaks words in input and output streams into smaller strings of characters called subword units. These subword units, Sun and Liu explain in a blog post, reduce the number of inputs the machine learning model must learn and clear up ambiguity in snippets like “Dr.” (which could mean “doctor” or “Drive”) and “2/3” (which could mean “two-thirds” or “February third”).
Furthermore, the subword units enable the AI model to better handle input words it hasn’t seen before. Unfamiliar words might contain familiar subword components, and these are sometimes enough to help the model decide on a course of action.
The researchers’ system created subword units by reducing words in a training data set to individual characters, which an algorithm ingested to identify the most commonly occurring two-character units and three-character units until it reached capacity (around 2,000 subwords). The components were used to train an AI system to output subword units, which a separate algorithm stitched together into complete words.
After training the system on 500,000 examples from a public data set, the researchers say it achieved a 75% reduction in error rate compared with the best-performing machine learning system previously reported and a 63% reduction in latency, or the time it takes to receive a response to a single request. By factoring in additional information, such as parts of speech, position within the sentence, and capitalization, it managed a further error rate reduction of 81% and a word error rate of just 0.2%.
Source: VentureBeat

Follow Us On Facebook, Twitter & Instagram . Please Share Your Stories, Press Release & Articles At [email protected]. To Read More News Daily, Subscribe To Our Push Notification at https://www.inventiva.co.in/

Nitin Naresh May 16, 2019

0 271 2 minutes read

ELSS vs Equity: Which is Best For Tax Saving?

What Has Impacted The Apple iPhone Shipments?

With No Respite To Manipur Violence, There Is Another Side To The Coin – Drugs, Politics And Armed Militia

AI Created Celebrity Porn To Be Reviewed By Meta’s Oversight Board; The Epidemic Of Deepfake Porn Ruining Many Lives And It Is Worse Than You Think

Instagram Influencers with More Followers Than Their Countries’ Populations

Elevate Your Skills with Dynamic Microsoft Project Courses

Israel Army Chief Pledges Iran Response, Bracing For The Worst As Western Countries Urge Restraint

Why Is IndiGo India’s Most Unsafe Airline? Why DGCA & Aviation Ministry Do Not Suspend The License Of Indigo But Are Happy To Play With The Lives Of 100s Of People

Brace For Inflation. Iran’s Attack On Israel Has Deep Implications For The Global Economy And India, Affecting Trade, Oil, And Daily Life. What Should Indian Investors Do?

Embracing Fairness And Beauty At The Cost Of Kidney Damage: From Using Products That Are High In Mercury To Recent Growing Use Of Glutathione, How We Are Still Enslaved In The Prejudices Of The Past!

Alexa speech normalization AI reduces errors by up to 81%

Nitin Naresh

Read Next

Instagram Influencers with More Followers Than Their Countries’ Populations

Israel Army Chief Pledges Iran Response, Bracing For The Worst As Western Countries Urge Restraint

Why Is IndiGo India’s Most Unsafe Airline? Why DGCA & Aviation Ministry Do Not Suspend The License Of Indigo But Are Happy To Play With The Lives Of 100s Of People

Instagram Influencers with More Followers Than Their Countries’ Populations

Israel Army Chief Pledges Iran Response, Bracing For The Worst As Western Countries Urge Restraint

Why Is IndiGo India’s Most Unsafe Airline? Why DGCA & Aviation Ministry Do Not Suspend The License Of Indigo But Are Happy To Play With The Lives Of 100s Of People

Leave a Reply Cancel reply

Top 10 Best Agriculture Companies in India 2022

Top 10 Best Artificial Intelligence (AI) Companies of India in 2022

Ampere launches new chip built from ground up for cloud workloads

Acer may shutter or sell StarVR after location-based VR revenues sink

Indonesia short on oxygen, seeks help as virus cases soar

Floods- Why are Pune and Mumbai prone to it?

The solar storms will hit the Earth and cause disruption in GPS and mobile connectivity.

The death of democracy in India

Employee Engagement In The Hybrid Workplace Of The Future

Read Next

Instagram Influencers with More Followers Than Their Countries’ Populations

Israel Army Chief Pledges Iran Response, Bracing For The Worst As Western Countries Urge Restraint

Why Is IndiGo India’s Most Unsafe Airline? Why DGCA & Aviation Ministry Do Not Suspend The License Of Indigo But Are Happy To Play With The Lives Of 100s Of People

SugarCRM moves into marketing automation with Salesfusion acquisition

Why you should learn Ethical Hacking?

Related Articles

Leave a Reply Cancel reply

Top 10 Best Agriculture Companies in India 2022

Top 10 Best Artificial Intelligence (AI) Companies of India in 2022

Ampere launches new chip built from ground up for cloud workloads

Acer may shutter or sell StarVR after location-based VR revenues sink

Indonesia short on oxygen, seeks help as virus cases soar

Floods- Why are Pune and Mumbai prone to it?

The solar storms will hit the Earth and cause disruption in GPS and mobile connectivity.

The death of democracy in India

Employee Engagement In The Hybrid Workplace Of The Future

Adblock Detected