New Nvidia AI agent, powered by GPT-4, can train robots

4 Min Read

VentureBeat presents: AI Unleashed – An unique government occasion for enterprise knowledge leaders. Community and study with business friends. Learn More


Nvidia Analysis announced today that it has developed a brand new AI agent, referred to as Eureka, that’s powered by OpenAI’s GPT-4 and may autonomously train robots complicated expertise.

In a blog post, the corporate mentioned Eureka, which autonomously writes reward algorithms, has, for the primary time, skilled a robotic hand to carry out fast pen-spinning methods in addition to a human can. Eureka has additionally taught robots to open drawers and cupboards, toss and catch balls, and manipulate scissors, amongst almost 30 duties.

“Reinforcement studying has enabled spectacular wins over the past decade, but many challenges nonetheless exist, comparable to reward design, which stays a trial-and-error course of,” Anima Anandkumar, senior director of AI analysis at Nvidia and an writer of the Eureka paper, mentioned within the weblog put up. “Eureka is a primary step towards creating new algorithms that combine generative and reinforcement studying strategies to unravel onerous duties.”

Nvidia Analysis additionally revealed the Eureka library of AI algorithms for folks to experiment with them utilizing Nvidia Isaac Gymnasium, a physics simulation reference software for reinforcement studying analysis. Isaac Gymnasium is constructed on Nvidia Omniverse, a growth platform for constructing 3D instruments and functions based mostly on the OpenUSD framework.

Work builds on earlier Nvidia work on AI brokers

Hype over AI agents has been swirling for months, together with with the rise of autonomous AI brokers like Auto-GPTBabyAGI and AgentGPT again in April.

See also  LlamaIndex: Augment your LLM Applications with Custom Data Easily

The present Nvidia Analysis work builds on earlier efforts together with the latest Voyager, an AI agent constructed with GPT-4 that may autonomously play Minecraft. In a New York Times article this week on efforts to remodel chatbots into on-line brokers, Jeff Clune, a pc science professor on the College of British Columbia who was beforehand an OpenAI researcher, mentioned that “it is a enormous industrial alternative, doubtlessly trillions of {dollars},” whereas including that “this has an enormous upside — and large penalties — for society.”

Outperforms knowledgeable human-engineered rewards

In a brand new analysis paper titled “Eureka: Human-level reward design through coding massive language fashions,” the authors mentioned that Eureka “exploits the outstanding zero-shot era, code-writing, and in-context enchancment capabilities of state-of-the-art LLMs, comparable to GPT-4, to carry out evolutionary optimization over reward code.”

The ensuing rewards, they mentioned, can be utilized to accumulate complicated expertise by way of reinforcement studying. “With none task-specific prompting or pre-defined reward templates, Eureka generates reward capabilities that outperform knowledgeable human-engineered rewards. In a various suite of 29 open-source RL environments that embody 10 distinct robotic morphologies, Eureka outperforms human specialists on 83% of the duties, resulting in a median normalized enchancment of 52%.”

“Eureka is a novel mixture of huge language fashions and Nvidia’s GPU-accelerated simulation applied sciences,” mentioned Jim Fan, senior analysis scientist at NVIDIA, who’s one of many mission’s contributors, within the weblog put up. “We consider that Eureka will allow dexterous robotic management and supply a brand new technique to produce bodily lifelike animations for artists.”

See also  Agility is using large language models to communicate with its humanoid robots

Source link

Share This Article
Leave a comment

Leave a Reply

Your email address will not be published. Required fields are marked *

Please enter CoinGecko Free Api Key to get this plugin works.