Google's and Microsoft's chatbots are making up Super Bowl stats

If you happen to wanted extra proof that GenAI is inclined to creating stuff up, Google’s Gemini chatbot, previously Bard, thinks that the 2024 Tremendous Bowl already occurred. It even has the (fictional) statistics to again it up.

Per a Reddit thread, Gemini, powered by Google’s GenAI fashions of the identical title, is answering questions on Tremendous Bowl LVIII as if the sport wrapped up yesterday — or weeks earlier than. Like many bookmakers, it appears to favor the Chiefs over the 49ers (sorry, San Francisco followers).

Gemini adorns fairly creatively, in not less than one case giving a participant stats breakdown suggesting Kansas Chief quarterback Patrick Mahomes ran 286 yards for 2 touchdowns and an interception versus Brock Purdy’s 253 working yards and one landing.

Picture Credit: /r/smellymonster (opens in a new window)

It’s not simply Gemini. Microsoft’s Copilot chatbot, too, insists the sport ended and supplies misguided citations to again up the declare. However — maybe reflecting a San Francisco bias! — it says the 49ers, not the Chiefs, emerged victorious “with a last rating of 24-21.”

Picture Credit: Kyle Wiggers / TechCrunch

Copilot is powered by a GenAI mannequin related, if not equivalent, to the mannequin underpinning OpenAI’s ChatGPT (GPT-4). However in my testing, ChatGPT was loath to make the identical mistake.

Picture Credit: Kyle Wiggers / TechCrunch

It’s all somewhat foolish — and probably resolved by now, provided that this reporter had no luck replicating the Gemini responses within the Reddit thread. (I’d be shocked if Microsoft wasn’t engaged on a repair as effectively.) But it surely additionally illustrates the key limitations of at this time’s GenAI — and the risks of inserting an excessive amount of belief in it.

GenAI fashions haven’t any actual intelligence. Fed an infinite variety of examples normally sourced from the general public net, AI fashions learn the way seemingly information (e.g. textual content) is to happen primarily based on patterns, together with the context of any surrounding information.

This probability-based method works remarkably effectively at scale. However whereas the vary of phrases and their chances are seemingly to lead to textual content that is smart, it’s removed from sure. LLMs can generate one thing that’s grammatically right however nonsensical, for example — just like the declare concerning the Golden Gate. Or they will spout mistruths, propagating inaccuracies of their coaching information.

It’s not malicious on the LLMs’ half. They don’t have malice, and the ideas of true and false are meaningless to them. They’ve merely discovered to affiliate sure phrases or phrases with sure ideas, even when these associations aren’t correct.

Therefore Gemini’s and Copilot’s Tremendous Bowl falsehoods.

Google and Microsoft, like most GenAI distributors, readily acknowledge that their GenAI apps aren’t excellent and are, actually, inclined to creating errors. However these acknowledgements come within the type of small print I’d argue might simply be missed.

Tremendous Bowl disinformation actually isn’t essentially the most dangerous instance of GenAI going off the rails. That distinction most likely lies with endorsing torture, reinforcing ethnic and racial stereotypes or writing convincingly about conspiracy theories. It’s, nevertheless, a helpful reminder to double-check statements from GenAI bots. There’s a good likelihood they’re not true.

Source link

Artificial Intelligence
in Action

Top Stories

Silicon Valley shaken as open-source AI models Llama 3.1 and Mistral Large 2 match industry leaders

OpenAI Unveils SearchGPT: A New AI-Powered Search Engine

How Salesforce’s MINT-1T dataset could disrupt the AI industry

Google’s and Microsoft’s chatbots are making up Super Bowl stats

Leave a Reply Cancel reply

Related Strories

Silicon Valley shaken as open-source AI models Llama 3.1 and Mistral Large 2 match industry leaders

OpenAI Unveils SearchGPT: A New AI-Powered Search Engine

How Salesforce’s MINT-1T dataset could disrupt the AI industry

AI’s Growing Power Needs: Tech Industry’s Move Towards Nuclear Power

Quick links

Popular Categories

Follow Socials

Artificial Intelligence in Action

Top Stories

Silicon Valley shaken as open-source AI models Llama 3.1 and Mistral Large 2 match industry leaders

OpenAI Unveils SearchGPT: A New AI-Powered Search Engine

How Salesforce’s MINT-1T dataset could disrupt the AI industry

Google’s and Microsoft’s chatbots are making up Super Bowl stats

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.

Leave a Reply Cancel reply

Silicon Valley shaken as open-source AI models Llama 3.1 and Mistral Large 2 match industry leaders

OpenAI Unveils SearchGPT: A New AI-Powered Search Engine

How Salesforce’s MINT-1T dataset could disrupt the AI industry

AI’s Growing Power Needs: Tech Industry’s Move Towards Nuclear Power

Get Insider Tips and Tricks in Our Newsletter!

Artificial Intelligence
in Action