Amazon researchers boost Alexa’s ability to understand complex commands

Nitin Naresh December 18, 2018

0 358 2 minutes read

Amazon’s Alexa is becoming more proficient at understanding multistep requests in one shot. In a paper (“Parsing Coordination for Spoken Language Understanding”) and accompanying blog post published this morning, Sanchit Agarwal, an applied scientist in the Alexa AI organization, detailed a spoken-language understanding (SLU) system that maps voice commands to actions (intents) and entities (slots) 26 percent more accurately than off-the-shelf alternatives.
Agarwal and colleagues’ work will be presented at the upcoming IEEE Spoken Language Technology conference in Athens, Greece later this month. News of their research comes a day after Amazon scientists described an AI-driven method that can cut Alexa’s skill selection error rate by 40 percent.
“Narrow [SLU systems] usually have rigid constraints, such as allowing only one intent to be associated with an utterance and only one value to be associated with a slot type,” he wrote. “We [propose] a way to enable SLU systems to understand compound entities and intents.”
As Agarwal explained, he and colleagues used a deep neural network — layers of mathematical functions called neurons, loosely modeled on their biological equivalents — that was “taught” from structures in spoken-language data. First, a corpus was labeled according to a scheme indicating groups of words, or “chunks,” that should be treated as ensembles: “B” to indicate the beginning of a chunk, “I” to indicate the inside of a chunk, or “O” to indicate a word that lies outside a chunk. Then, prior to training, the words underwent embedding, a process that involved substituting vectors to represent them.
The embeddings were next passed to a bidirectional long-short-term memory (bi-LSTM) network, a type of recurrent neural network capable of learning long-term dependencies, which output a contextual embedding of each word in the input sentence. Those outputs were combined with a neural network layer that mapped each embedding to a distribution over the output “B,” “I,” and “O” labels, classifying each word of the input according to its most probable output label.
An additional layer, known as a conditional-random-field, or CRF, learned to associate between the output labels and choose the most likely labels from all possible sequences. Thanks to a technique called adversarial training — during which the network was evaluated on how well or poorly it predicted the labels — the model learned to generalize.
“Instead of building separate parsers for different slot types (such as ListItem, FoodItem, Appliance, etc.), we built one parser that can handle multiple slot types,” Agarwal said. “For example, our parser can successfully identify [list items] in the utterance ‘add apples peanut butter and jelly to my list’ and [appliances] in the utterance ‘turn on the living room light and kitchen light’.”
Source: VentureBeat
To Read Our Daily News Updates, Please visit Inventiva or Subscribe Our Newsletter & Push.

Amazon researchers boost Alexa’s ability to understand complex commands

Nitin Naresh

Read Next

MobiKwik IPO: Fully Subscribed in 1-Hour, Fintech’s Triumph

BJP Accuses U.S. Of Colluding With The Opposition !

Suicide Of Bengaluru Techie Exposes The Shocking Reality Of Marital Struggles And Mental Health Crisis!

Hybrid Hosting: PCB’s Shocking Demands Shake ICC 2025

Adani’s ₹7.5 Lakh Crore Rajasthan Plan Amid US Scandal

The Job Market’s Latest Abuse Of Power: The “Apply And Ask To Work From Home Or Freelance Work” Trend

Brian Thompson’s Death: The 1 Shocking Incident

The GST Hike on Apparel: 6 Powerful Reasons to Reconsider

Rape Trial Of 50 Defendants: France’s Case Sparks Outrage And Calls For Reform!

The PM Internship Scheme: Why are candidates backing out?

MobiKwik IPO: Fully Subscribed in 1-Hour, Fintech’s Triumph

BJP Accuses U.S. Of Colluding With The Opposition !

Suicide Of Bengaluru Techie Exposes The Shocking Reality Of Marital Struggles And Mental Health Crisis!

Hybrid Hosting: PCB’s Shocking Demands Shake ICC 2025

Adani’s ₹7.5 Lakh Crore Rajasthan Plan Amid US Scandal

The Job Market’s Latest Abuse Of Power: The “Apply And Ask To Work From Home Or Freelance Work” Trend

Brian Thompson’s Death: The 1 Shocking Incident

The GST Hike on Apparel: 6 Powerful Reasons to Reconsider

Rape Trial Of 50 Defendants: France’s Case Sparks Outrage And Calls For Reform!

The PM Internship Scheme: Why are candidates backing out?

Leave a Reply Cancel reply

Acer may shutter or sell StarVR after location-based VR revenues sink

Indonesia short on oxygen, seeks help as virus cases soar

Floods- Why are Pune and Mumbai prone to it?

The solar storms will hit the Earth and cause disruption in GPS and mobile connectivity.

The death of democracy in India

Employee Engagement In The Hybrid Workplace Of The Future

Read Next

MobiKwik IPO: Fully Subscribed in 1-Hour, Fintech’s Triumph

BJP Accuses U.S. Of Colluding With The Opposition !

Suicide Of Bengaluru Techie Exposes The Shocking Reality Of Marital Struggles And Mental Health Crisis!

Hybrid Hosting: PCB’s Shocking Demands Shake ICC 2025

Adani’s ₹7.5 Lakh Crore Rajasthan Plan Amid US Scandal

The Job Market’s Latest Abuse Of Power: The “Apply And Ask To Work From Home Or Freelance Work” Trend

Brian Thompson’s Death: The 1 Shocking Incident

The GST Hike on Apparel: 6 Powerful Reasons to Reconsider

Rape Trial Of 50 Defendants: France’s Case Sparks Outrage And Calls For Reform!

The PM Internship Scheme: Why are candidates backing out?

Subscribe to our mailing list to get the new updates!

Charter settles New York internet suit with record fine and Spectrum refunds

Amazon Echo Show vs. Facebook Portal vs. Google Home Hub: Which speaks to you?

Related Articles

Leave a Reply Cancel reply

Acer may shutter or sell StarVR after location-based VR revenues sink

Indonesia short on oxygen, seeks help as virus cases soar

Floods- Why are Pune and Mumbai prone to it?

The solar storms will hit the Earth and cause disruption in GPS and mobile connectivity.

The death of democracy in India

Employee Engagement In The Hybrid Workplace Of The Future