New Nvidia AI agent, powered by GPT-4, can train robots

VentureBeat presents: AI Unleashed – An unique government occasion for enterprise knowledge leaders. Community and study with business friends. Learn More

Contents

Work builds on earlier Nvidia work on AI brokers Outperforms knowledgeable human-engineered rewards

Nvidia Analysis announced today that it has developed a brand new AI agent, referred to as Eureka, that’s powered by OpenAI’s GPT-4 and may autonomously train robots complicated expertise.

In a blog post, the corporate mentioned Eureka, which autonomously writes reward algorithms, has, for the primary time, skilled a robotic hand to carry out fast pen-spinning methods in addition to a human can. Eureka has additionally taught robots to open drawers and cupboards, toss and catch balls, and manipulate scissors, amongst almost 30 duties.

“Reinforcement studying has enabled spectacular wins over the past decade, but many challenges nonetheless exist, comparable to reward design, which stays a trial-and-error course of,” Anima Anandkumar, senior director of AI analysis at Nvidia and an writer of the Eureka paper, mentioned within the weblog put up. “Eureka is a primary step towards creating new algorithms that combine generative and reinforcement studying strategies to unravel onerous duties.”

Nvidia Analysis additionally revealed the Eureka library of AI algorithms for folks to experiment with them utilizing Nvidia Isaac Gymnasium, a physics simulation reference software for reinforcement studying analysis. Isaac Gymnasium is constructed on Nvidia Omniverse, a growth platform for constructing 3D instruments and functions based mostly on the OpenUSD framework.

Work builds on earlier Nvidia work on AI brokers

Hype over AI agents has been swirling for months, together with with the rise of autonomous AI brokers like Auto-GPT, BabyAGI and AgentGPT again in April.

The present Nvidia Analysis work builds on earlier efforts together with the latest Voyager, an AI agent constructed with GPT-4 that may autonomously play Minecraft. In a New York Times article this week on efforts to remodel chatbots into on-line brokers, Jeff Clune, a pc science professor on the College of British Columbia who was beforehand an OpenAI researcher, mentioned that “it is a enormous industrial alternative, doubtlessly trillions of {dollars},” whereas including that “this has an enormous upside — and large penalties — for society.”

Outperforms knowledgeable human-engineered rewards

In a brand new analysis paper titled “Eureka: Human-level reward design through coding massive language fashions,” the authors mentioned that Eureka “exploits the outstanding zero-shot era, code-writing, and in-context enchancment capabilities of state-of-the-art LLMs, comparable to GPT-4, to carry out evolutionary optimization over reward code.”

The ensuing rewards, they mentioned, can be utilized to accumulate complicated expertise by way of reinforcement studying. “With none task-specific prompting or pre-defined reward templates, Eureka generates reward capabilities that outperform knowledgeable human-engineered rewards. In a various suite of 29 open-source RL environments that embody 10 distinct robotic morphologies, Eureka outperforms human specialists on 83% of the duties, resulting in a median normalized enchancment of 52%.”

“Eureka is a novel mixture of huge language fashions and Nvidia’s GPU-accelerated simulation applied sciences,” mentioned Jim Fan, senior analysis scientist at NVIDIA, who’s one of many mission’s contributors, within the weblog put up. “We consider that Eureka will allow dexterous robotic management and supply a brand new technique to produce bodily lifelike animations for artists.”

Source link

Artificial Intelligence
in Action

Top Stories

How Meta’s CyberSecEval 3 can help combat weaponized LLMs

Forrester’s CISO budget priorities include API, supply chain security

Table-augmented generation shows promise for complex dataset querying, outperforms text-to-SQL

New Nvidia AI agent, powered by GPT-4, can train robots

Work builds on earlier Nvidia work on AI brokers

Outperforms knowledgeable human-engineered rewards

Leave a Reply Cancel reply

Related Strories

Any AI Agent Can Talk. Few Can Be Trusted

Spot AI introduces the world’s first universal AI agent builder for security cameras

Manus AI, Know the Use of General AI Agent, Capabilities & Examples

From Evo 1 to Evo 2: How NVIDIA is Redefining Genomic Research and AI-Driven Biological Innovations

Quick links

Popular Categories

Follow Socials

Artificial Intelligence in Action

Top Stories

How Meta’s CyberSecEval 3 can help combat weaponized LLMs

Forrester’s CISO budget priorities include API, supply chain security

Table-augmented generation shows promise for complex dataset querying, outperforms text-to-SQL

New Nvidia AI agent, powered by GPT-4, can train robots

Work builds on earlier Nvidia work on AI brokers

Outperforms knowledgeable human-engineered rewards

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.

Leave a Reply Cancel reply

Any AI Agent Can Talk. Few Can Be Trusted

Spot AI introduces the world’s first universal AI agent builder for security cameras

Manus AI, Know the Use of General AI Agent, Capabilities & Examples

From Evo 1 to Evo 2: How NVIDIA is Redefining Genomic Research and AI-Driven Biological Innovations

Get Insider Tips and Tricks in Our Newsletter!

Artificial Intelligence
in Action