Meet ‘Smaug-72B’: The new king of open-source AI

A brand new open-source language mannequin has claimed the throne of the very best on the earth, based on the latest rankings from Hugging Face, one of many main platforms for pure language processing (NLP) analysis and functions.

Contents

The open-source benefit Implications for the way forward for AI

The mannequin, referred to as “Smaug-72B,” was launched publicly at the moment by the startup Abacus AI, which helps enterprises resolve troublesome issues within the synthetic intelligence and machine studying house. Smaug-72B is technically a fine-tuned model of “Qwen-72B,” one other highly effective language mannequin that was launched only a few months in the past by Qwen, a staff of researchers at Alibaba Group.

What’s most noteworthy about at the moment’s launch is that Smaug-72B outperforms GPT-3.5 and Mistral Medium, two of essentially the most superior open supply giant language fashions developed by OpenAI and Mistral, respectively, in a number of of the preferred benchmarks. Smaug-72B additionally surpasses Qwen-72B, the mannequin from which it was derived, by a big margin in lots of of those evaluations.

Based on the Hugging Face Open LLM leaderboard, which measures the efficiency of open-source language fashions on quite a lot of pure language understanding and technology duties, Smaug-72B is now the primary and solely open-source mannequin to have a mean rating greater than 80 throughout all main LLM evaluations.

Whereas the mannequin nonetheless falls in need of the 90-100 level common indicative of human-level efficiency, its start alerts that open supply AI could quickly rival Large Tech’s capabilities, which have lengthy been shrouded in secrecy. In brief, the discharge of Smaug-72B might essentially reshape how AI progress unfolds, tapping the ingenuity of these past only a handful of rich corporations.

The open-source benefit

“Smaug-72B from Abacus AI is out there now on Hugging Face, is on high of the LLM leaderboard, and is the primary mannequin with a mean rating of 80!! In different phrases, it’s the world’s greatest open-source basis mannequin,” stated Abacus AI CEO Bindu Reddy in a put up on X.com.

“Our subsequent aim shall be to publish these strategies as a analysis paper and apply them to a number of the greatest Mistral Fashions, together with miqu (a 70B fine-tine of LLama-2),” she added. “The strategies we used particularly goal reasoning and math abilities, which explains the excessive GSM8K scores! Our upcoming paper will clarify extra.”

Smaug-72B – The Finest Open Supply Mannequin In The World – High of Hugging LLM LeaderBoard!!

Smaug72B from Abacus AI is out there now on Hugging Face, is on high of the LLM leaderboard, and is the primary mannequin with a mean rating of 80!!

In different phrases, it’s the world’s greatest… pic.twitter.com/CGHawmLhqI

— Bindu Reddy (@bindureddy) February 6, 2024

With at the moment’s launch, Smaug-72B turns into the primary open-source mannequin to attain a mean rating of 80 on the Hugging Face Open LLM leaderboard, which is taken into account a exceptional feat within the subject of pure language processing and open supply AI.

Smaug-72B excels particularly in reasoning and math duties, because of the strategies that Abacus AI utilized to the fine-tuning course of. These strategies, which shall be detailed in an upcoming research paper, goal the weaknesses of enormous language fashions and improve their capabilities.

Smaug-72B isn’t the one open-source language mannequin that has made headlines just lately. Qwen, the group behind Qwen-72B, additionally released Qwen 1.5, a set of small highly effective language fashions starting from 0.5B to 72B parameters.

Qwen 1.5 outperforms in style open supply fashions like Mistral-Medium and GPT-3.5, has a 32k context size, and works with numerous instruments and platforms for quick and native inference. Qwen additionally open-sourced Qwen-VL-Max, a brand new giant imaginative and prescient language mannequin that rivals Gemini Extremely and GPT-4V, two of essentially the most superior proprietary imaginative and prescient language fashions developed by Google and OpenAI, respectively.

Implications for the way forward for AI

The emergence of Smaug-72B and Qwen 1.5 has sparked numerous pleasure and debate within the AI group and past. Many specialists and influencers have praised the achievements of Abacus AI and Qwen, and expressed their admiration for his or her contribution to open-source AI.

“It’s onerous to consider that lower than a 12 months in the past, all of us bought enthusiastic about fashions like Dolly,” stated Sahar Mor, an AI influencer and analyst, in a Linkedin post, reveling on the progress of open supply fashions previously 12 months.

Smaug-72B and Qwen 1.5 are presently accessible on Hugging Face, the place anybody can obtain, use, and modify them. Abacus AI and Qwen have additionally introduced their plans to submit their fashions to the llmsys human eval leaderboard, which is a brand new benchmark that evaluates the efficiency of language fashions on human-like duties and eventualities. Abacus AI and Qwen have additionally hinted at their future tasks and objectives, which embody creating extra open-source fashions and making use of them to varied domains and functions.

Smaug-72B and Qwen 1.5 are simply the newest examples of the fast and noteworthy evolution of open-source AI this 12 months. They signify a brand new wave of AI innovation and democratization that’s difficult the dominance and monopoly of the large tech corporations and opening new prospects and alternatives for everybody. Solely time will inform how lengthy Smaug-72B will stay on the high of the Hugging Face leaderboard, however for now, its secure to say that open supply AI is having an enormous second to begin the 12 months.

Source link

binance says:

December 19, 2025 at 9:02 pm

Can you be more specific about the content of your article? After reading it, I still have some doubts. Hope you can help me.

sign up binance says:

February 15, 2026 at 2:52 pm

Your point of view caught my eye and was very interesting. Thanks. I have a question for you. https://www.binance.info/en-ZA/register?ref=B4EPR6J0

Artificial Intelligence
in Action

Top Stories

How Meta’s CyberSecEval 3 can help combat weaponized LLMs

Forrester’s CISO budget priorities include API, supply chain security

Table-augmented generation shows promise for complex dataset querying, outperforms text-to-SQL

Meet ‘Smaug-72B’: The new king of open-source AI

The open-source benefit

Implications for the way forward for AI

Leave a Reply Cancel reply

Related Strories

Slash costs, boost growth with open-source AI

Open-Source Alternatives Amid Semgrep Licensing Controversy

Top 10 Open-Source LLMs in 2025 and Their Use Cases

PaddlePaddle: An Open-Source Deep Learning Framework

Quick links

Popular Categories

Follow Socials

Artificial Intelligence in Action

Top Stories

How Meta’s CyberSecEval 3 can help combat weaponized LLMs

Forrester’s CISO budget priorities include API, supply chain security

Table-augmented generation shows promise for complex dataset querying, outperforms text-to-SQL

Meet ‘Smaug-72B’: The new king of open-source AI

The open-source benefit

Implications for the way forward for AI

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.

Leave a Reply Cancel reply

Slash costs, boost growth with open-source AI

Open-Source Alternatives Amid Semgrep Licensing Controversy

Top 10 Open-Source LLMs in 2025 and Their Use Cases

PaddlePaddle: An Open-Source Deep Learning Framework

Get Insider Tips and Tricks in Our Newsletter!

Artificial Intelligence
in Action