Facebook open-sources reinforcement learning platform Horizon

Nitin Naresh November 1, 2018

0 287 2 minutes read

Facebook is open-sourcing Horizon, a reinforcement learning platform created by Facebook AI researchers, recommender systems experts, and engineers.
Horizon is made for the deployment of AI at scale so that companies or research teams can carry out operations that may require thousands of CPUs or GPUs to train with billions of observations. However, since it utilizes Apache Spark for preprocessing and PyTorch to train AI systems, Horizon can also be deployed on a single computer.
Product teams at Facebook have used Horizon for things like M Suggestions, a service that can recommend translations, Spotify songs, Food Network recipes, and a myriad of other things based on words used in conversations on Facebook Messenger.
It’s also been used to determine the bitrate of Facebook 360 videos, and to personalize when the Facebook app chooses to send users notifications.
Reinforcement learning uses rewards to drive the activity of agents to reach a desired goal.
Facebook chose to open-source Horizon to move forward the field of reinforcement learning and unsupervised learning methods both among novice practitioners and students as well as large research projects that, like Facebook, need thousands of machines to train AI systems.
“I do think reinforcement learning (RL) is kind of the next frontier when it comes to industrywide, widespread adoption when it comes to machine learning, so we wanted to open-source this to really provide a good platform for people all around the Bay and all around the world to start using RL,” Gauci said.
Facebook is no stranger to open source tools for the training or deployment of AI.
Version 1.0 of popular deep learning framework PyTorch was released in October with integrations for Google Cloud, AWS, and Azure Machine Learning. There’s also Caffe2 and Parlai, a platform for training AI models. Research from Facebook AI Research is also open-sourced.
In addition to using PyTorch and Apache Spark, TensorBoard X is used for training visualizations and ONNX for serving up AI models after training.
Unlike other forms of reinforcement learning at large organizations that may operate live, Horizon trains AI systems offline.
Horizon applies a technique known as counterfactual policy evaluation to evaluate the offline performance of an AI system to determine if alternative approaches may improve performance before going live.
“We can counterfactually look at these alternative actions and say ‘Oh, maybe this alternative action was better in this circumstance,’” he said. “So using this we can — as opposed to like a lot of RL, where they kind of train online and the model’s always changing — we train offline and we have a stage where we evaluate the model, and we come up with some confidence on the model’s performance, and then engineers can choose to deploy that model or not. And the Horizon platform open-sources all of that and makes it all available.”
Horizon is also made to normalize the training of large datasets, a commonly encountered issue with reinforcement learning, Gauci said. The platform comes with step-by-step instructions so it can be utilized by anyone with basic computer science knowledge, not just researchers or experts at companies like Facebook.
“Anyone who has any kind of basic Unix experience can generate a dataset and train a model and see how it works, and that’s one of the things. There’s sort of an educational aspect to this; we want to get a lot of people kind of excited about the field,” he said.
Source: VentureBeat
To Read Our Daily News Updates, Please visit Inventiva or Subscribe Our Newsletter & Push.

What Has Impacted The Apple iPhone Shipments?

With No Respite To Manipur Violence, There Is Another Side To The Coin – Drugs, Politics And Armed Militia

AI Created Celebrity Porn To Be Reviewed By Meta’s Oversight Board; The Epidemic Of Deepfake Porn Ruining Many Lives And It Is Worse Than You Think

Instagram Influencers with More Followers Than Their Countries’ Populations

Elevate Your Skills with Dynamic Microsoft Project Courses

Israel Army Chief Pledges Iran Response, Bracing For The Worst As Western Countries Urge Restraint

Why Is IndiGo India’s Most Unsafe Airline? Why DGCA & Aviation Ministry Do Not Suspend The License Of Indigo But Are Happy To Play With The Lives Of 100s Of People

Brace For Inflation. Iran’s Attack On Israel Has Deep Implications For The Global Economy And India, Affecting Trade, Oil, And Daily Life. What Should Indian Investors Do?

Embracing Fairness And Beauty At The Cost Of Kidney Damage: From Using Products That Are High In Mercury To Recent Growing Use Of Glutathione, How We Are Still Enslaved In The Prejudices Of The Past!

Arjun Mohan, Byju’s India CEO, Calls It Quits; Byju Raveendran To Be Back And Will lead Daily Operations

Facebook open-sources reinforcement learning platform Horizon

Nitin Naresh

Read Next

Instagram Influencers with More Followers Than Their Countries’ Populations

Israel Army Chief Pledges Iran Response, Bracing For The Worst As Western Countries Urge Restraint

Why Is IndiGo India’s Most Unsafe Airline? Why DGCA & Aviation Ministry Do Not Suspend The License Of Indigo But Are Happy To Play With The Lives Of 100s Of People

Instagram Influencers with More Followers Than Their Countries’ Populations

Israel Army Chief Pledges Iran Response, Bracing For The Worst As Western Countries Urge Restraint

Why Is IndiGo India’s Most Unsafe Airline? Why DGCA & Aviation Ministry Do Not Suspend The License Of Indigo But Are Happy To Play With The Lives Of 100s Of People

Leave a Reply Cancel reply

Top 10 Best Agriculture Companies in India 2022

Top 10 Best Artificial Intelligence (AI) Companies of India in 2022

Ampere launches new chip built from ground up for cloud workloads

Acer may shutter or sell StarVR after location-based VR revenues sink

Indonesia short on oxygen, seeks help as virus cases soar

Floods- Why are Pune and Mumbai prone to it?

The solar storms will hit the Earth and cause disruption in GPS and mobile connectivity.

The death of democracy in India

Employee Engagement In The Hybrid Workplace Of The Future

Read Next

Instagram Influencers with More Followers Than Their Countries’ Populations

Israel Army Chief Pledges Iran Response, Bracing For The Worst As Western Countries Urge Restraint

Why Is IndiGo India’s Most Unsafe Airline? Why DGCA & Aviation Ministry Do Not Suspend The License Of Indigo But Are Happy To Play With The Lives Of 100s Of People

Alexa gains Reminders API, calendar availability, and integration with Routines

Facebook Ad Library: Only 3 “Brexit” ads for whole month. Yeah, right!

Related Articles

Leave a Reply Cancel reply

Top 10 Best Agriculture Companies in India 2022

Top 10 Best Artificial Intelligence (AI) Companies of India in 2022

Ampere launches new chip built from ground up for cloud workloads

Acer may shutter or sell StarVR after location-based VR revenues sink

Indonesia short on oxygen, seeks help as virus cases soar

Floods- Why are Pune and Mumbai prone to it?

The solar storms will hit the Earth and cause disruption in GPS and mobile connectivity.

The death of democracy in India

Employee Engagement In The Hybrid Workplace Of The Future

Adblock Detected