Google releases open source reinforcement learning framework for training AI models

Nitin Naresh August 27, 2018

0 642 2 minutes read

Reinforcement learning — an artificial intelligence (AI) technique that uses rewards (or punishments) to drive agents in the direction of specific goals — trained the systems that defeated Alpha Go world champions and mastered Valve’s Dota 2. And it’s a core part Google subsidiary DeepMind’s deep Q-network (DQN), which can distribute learning across multiple workers in the pursuit of, for example, achieving “superhuman” performance in Atari 2600 games. The trouble is, reinforcement learning frameworks take time to master, tend to be inflexible, and aren’t always stable.
That’s why Google’s proposing an alternative: an open-source reinforcement framework based on Tensorflow, its machine learning library. It’s available from Github starting today.
“Inspired by one of the main components in reward-motivated behavior in the brain and reflecting the strong historical connection between neuroscience and reinforcement learning research, this platform aims to enable the kind of speculative research that can drive radical discoveries,” Pablo Samuel Castro and Marc G. Bellemare, researchers on the Google Brain Team, wrote in a blog post. “This release also includes a set of colabs that clarify how to use our framework.”
They and the Google Brain team developed the reinforcement framework with three tenants in mind: flexibility, stability, and reproducibility.

Above: A visualization of AI agents trained using reinforcement learning.

Image Credit: Google

To that end, it includes a compact set of well-documented code (15 Python files) focused on the Arcade Learning Environment — a platform for evaluating AI technology with video games — and four distinct machine learning models: the aforementioned DQN, C51, a simplified variant of the Rainbow agent, and the Implicit Quantile Network. In the interest of reproducibility, the code’s provided with full test coverage and training data (in JSON and Python pickle formats) across the 60 games supported by the Arcade Learning Environment, and follows best practices on standardizing the results for empirical evaluations.
Alongside the release of the reinforcement framework, Google’s launching a website that allows developers to quickly visualize training runs for multiple agents. It’s also making available trained models, raw statistics logs, and TensorFlow event files for plotting with TensorBoard, the Mountain View company’s suite of visualization tools for TensorFlow programs.
“Our hope is that our framework’s flexibility and ease-of-use will empower researchers to try out new ideas, both incremental and radical,” Bellemare and Castro wrote. “We are already actively using it for our research and finding it is giving us the flexibility to iterate quickly over many ideas. We’re excited to see what the larger community can make of it.”
Source: VentureBeat

Google releases open source reinforcement learning framework for training AI models

Nitin Naresh

Read Next

As SpaceX Touches A $1.75 Trillion Valuation, Are Investors Buying A Space Company Or Elon Musk’s Vision Of The Future?

Can Medicines Made In Space Transform Healthcare On Earth? Inside Pharma’s Race To Low Earth Orbit

If Betting Ads Are Banned In India, How Did They End Up On Zepto? The Questions ED’s Probe Could Force The Industry To Answer

TCS Bets On AI Agents And Paytm Hires Thousands. Why India Inc.’s AI Message Sounds Very Different From Silicon Valley

The IDFC Bank ₹645 Crore Fraud, Realty Baron Vikram Wadhwa’s Arrest And The Question India Keeps Avoiding: Who Protects Powerful Builders?

“Do Kaudi Ke Teachers” Said Anjana, Then Filed A ₹2 Crore Case On Khan Sir When He Took A Defense!

Zepto’s Betting Link? ED Says Zepto Promoted Parimatch

Zomato-Swiggy Post IPO Autopsy Amid Zepto IPO

India’s DNA Story; What Modern Genetics Reveals About Our Ancient Past And Future Health Challenges. After AI, Could Genetics Drive The Next Scientific Revolution?

From Investing to Active Trading: How Modern Demat Accounts Power Advanced Market Strategies

As SpaceX Touches A $1.75 Trillion Valuation, Are Investors Buying A Space Company Or Elon Musk’s Vision Of The Future?

Can Medicines Made In Space Transform Healthcare On Earth? Inside Pharma’s Race To Low Earth Orbit

If Betting Ads Are Banned In India, How Did They End Up On Zepto? The Questions ED’s Probe Could Force The Industry To Answer

TCS Bets On AI Agents And Paytm Hires Thousands. Why India Inc.’s AI Message Sounds Very Different From Silicon Valley

The IDFC Bank ₹645 Crore Fraud, Realty Baron Vikram Wadhwa’s Arrest And The Question India Keeps Avoiding: Who Protects Powerful Builders?

“Do Kaudi Ke Teachers” Said Anjana, Then Filed A ₹2 Crore Case On Khan Sir When He Took A Defense!

Zepto’s Betting Link? ED Says Zepto Promoted Parimatch

Zomato-Swiggy Post IPO Autopsy Amid Zepto IPO

India’s DNA Story; What Modern Genetics Reveals About Our Ancient Past And Future Health Challenges. After AI, Could Genetics Drive The Next Scientific Revolution?

From Investing to Active Trading: How Modern Demat Accounts Power Advanced Market Strategies

Leave a Reply Cancel reply

Acer may shutter or sell StarVR after location-based VR revenues sink

Covid-19:Why Indians might struggle against the Possible pandemic’s third wave?

The death of democracy in India

Indonesia short on oxygen, seeks help as virus cases soar

The solar storms will hit the Earth and cause disruption in GPS and mobile connectivity.

Floods- Why are Pune and Mumbai prone to it?

Read Next

As SpaceX Touches A $1.75 Trillion Valuation, Are Investors Buying A Space Company Or Elon Musk’s Vision Of The Future?

Can Medicines Made In Space Transform Healthcare On Earth? Inside Pharma’s Race To Low Earth Orbit

If Betting Ads Are Banned In India, How Did They End Up On Zepto? The Questions ED’s Probe Could Force The Industry To Answer

TCS Bets On AI Agents And Paytm Hires Thousands. Why India Inc.’s AI Message Sounds Very Different From Silicon Valley

The IDFC Bank ₹645 Crore Fraud, Realty Baron Vikram Wadhwa’s Arrest And The Question India Keeps Avoiding: Who Protects Powerful Builders?

“Do Kaudi Ke Teachers” Said Anjana, Then Filed A ₹2 Crore Case On Khan Sir When He Took A Defense!

Zepto’s Betting Link? ED Says Zepto Promoted Parimatch

Zomato-Swiggy Post IPO Autopsy Amid Zepto IPO

India’s DNA Story; What Modern Genetics Reveals About Our Ancient Past And Future Health Challenges. After AI, Could Genetics Drive The Next Scientific Revolution?

From Investing to Active Trading: How Modern Demat Accounts Power Advanced Market Strategies

Subscribe to our mailing list to get the new updates!

Background Check for Marriage

Hangouts Chat, Google’s Slack competitor, gets emoji reactions

Related Articles

Leave a Reply Cancel reply

Acer may shutter or sell StarVR after location-based VR revenues sink

Covid-19:Why Indians might struggle against the Possible pandemic’s third wave?

The death of democracy in India

Indonesia short on oxygen, seeks help as virus cases soar

The solar storms will hit the Earth and cause disruption in GPS and mobile connectivity.

Floods- Why are Pune and Mumbai prone to it?