Google Bard gets image generation and a more capable Gemini Pro to take on ChatGPT

Google is updating its Bard AI chatbot to step up its competitors with rival OpenAI’s ChatGPT. The Sundar Pichai-led web large right this moment introduced it’s increasing Bard to now embrace picture era capabilities, powered by its personal Imagen 2 AI mannequin, in addition to a extra succesful model of Gemini Professional.

Contents

Gemini Professional with multi-lingual help Imagen-2 on Bard to tackle ChatGPT Plus with DALL-E 3 A brand new strategy to iterate on AI pictures

The transfer provides extra individuals entry to Bard’s AI smarts, together with a brand new free instrument to create AI pictures.

“These updates make Bard an much more useful and globally accessible AI collaborator for every little thing from large, inventive initiatives to smaller, on a regular basis duties,” Jack Krawczyk, product lead for Bard, famous in a blog post.

Individually, the corporate additionally introduced it’s experimenting with one other picture generator, dubbed ImageFX, beginning right this moment.

Gemini Professional with multi-lingual help

Over a month in the past, Google introduced Gemini in three sizes: Nano for cell gadgets, Professional for extra intermediate use instances, and Extremely, what it claimed to be essentially the most highly effective and succesful giant language mannequin (LLM) but developed by any firm — much more highly effective than GPT-4 — although this one just isn’t due out till later this 12 months.

Third-party comparisons between Gemini Professional, essentially the most highly effective LLM presently obtainable from Google, and different fashions discovered that it really lags behind even OpenAI’s older GPT-3.5 Turbo, a worrying signal for Google because it seeks to indicate the world it has the juice to tackle the brand new insurgents within the generative AI race. Google did launch a fine-tuned model of Gemini Professional on Bard final month, however solely in English.

However right this moment’s flurry of recent consumer-facing AI bulletins ought to assist Google shut the hole. The most recent replace for Bard, Gemini Professional shall be obtainable in over 40 languages — together with Korean, Spanish, Tamil, Italian and Russian — throughout greater than 230 international locations and territories.

This not solely provides extra individuals entry to Gemini Professional’s superior understanding, summarizing, reasoning and coding capabilities but in addition Bard’s double-check function, which validates a response by looking throughout the net.

Imagen-2 on Bard to tackle ChatGPT Plus with DALL-E 3

Most significantly, the long-awaited AI picture era capabilities are additionally coming in. That is being delivered with the assistance of the Imagen 2 mannequin, which, Google says, can produce high-quality, photorealistic outputs from textual content inputs, turning Bard into extra of a direct and succesful competitor to OpenAI’s ChatGPT Plus with DALL-E 3 picture generator mannequin, which has been obtainable to customers of OpenAI’s subscription tiers since October 2023.

“Simply kind in an outline — like “create a picture of a canine using a surfboard” — and Bard will generate customized, wide-ranging visuals to assist deliver your thought to life,” Krawczyk famous.

Imagen 2 in action on Bard — *Imagen 2 in motion on Bard*

We examined picture era on Bard and located that it produces outputs in about 30-40 seconds with good consistency. In some instances, nevertheless, it didn’t generate the picture altogether – even when it didn’t contain any famed particular person, which Google filters out (more likely to keep away from scandalous deepfakes just like what occurred with the musician Taylor Swift and customers of Microsoft’s Designer AI picture generator powered by OpenAI’s DALL-E 3).

There’s additionally no help to vary the side ratio of outputs or any immediate in another language other than English at this stage — a minimum of not from our preliminary utilization of the instrument.

Nevertheless, what’s good is that given the copyright infringement considerations round AI-generated media, Google Bard is giving customers the choice to report authorized points beneath knowledge safety, copyright and different legal guidelines for all generated media.

The corporate additionally famous that it limits the manufacturing of violent, offensive or sexually express content material and has used Deepmind-developed SynthID to embed digitally identifiable watermarks into the pixels of generated pictures. This may also help individuals differentiate if a visible has been generated with Google’s AI or an precise human artist.

A brand new strategy to iterate on AI pictures

Past updates for Bard, Google additionally introduced that it’s experimenting with ImageFX, a brand new instrument for picture era powered by Imagen 2.

Out there beginning right this moment in AI Test Kitchen, Google’s app for experimental AI initiatives, ImageFX tries to spur inventive concepts with “expressive chips” that give customers adjoining dimensions and recommendations to iterate on their immediate. This sort of function can be obtainable on aggressive instruments, together with Ideogram.

The AI Check Kitchen additionally contains different attention-grabbing experimental initiatives from Google, together with MusicFX, which might now create tunes as much as 70 seconds in size with textual content prompts and expressive chips, and TextFX, a generative AI experiment for lyricists, wordsmiths and different inventive artists.

Source link

Registro says:

December 26, 2025 at 6:59 pm

Thank you for your sharing. I am worried that I lack creative ideas. It is your article that makes me full of hope. Thank you. But, I have a question, can you help me? https://accounts.binance.info/ru-UA/register-person?ref=JVDCDCK4

Artificial Intelligence
in Action

Top Stories

How Meta’s CyberSecEval 3 can help combat weaponized LLMs

Forrester’s CISO budget priorities include API, supply chain security

Table-augmented generation shows promise for complex dataset querying, outperforms text-to-SQL

Google Bard gets image generation and a more capable Gemini Pro to take on ChatGPT

Gemini Professional with multi-lingual help

Imagen-2 on Bard to tackle ChatGPT Plus with DALL-E 3

A brand new strategy to iterate on AI pictures

Leave a Reply Cancel reply

Related Strories

Simple Guide to Training Your Team to Use ChatGPT Effectively

How to Use ChatGPT to Review and Shortlist Resumes Efficiently

Mastering ChatGPT Prompt Patterns: Templates for Every Use

How to Write Smarter ChatGPT Prompts: Techniques & Examples

Quick links

Popular Categories

Follow Socials

Artificial Intelligence in Action

Top Stories

How Meta’s CyberSecEval 3 can help combat weaponized LLMs

Forrester’s CISO budget priorities include API, supply chain security

Table-augmented generation shows promise for complex dataset querying, outperforms text-to-SQL

Google Bard gets image generation and a more capable Gemini Pro to take on ChatGPT

Gemini Professional with multi-lingual help

Imagen-2 on Bard to tackle ChatGPT Plus with DALL-E 3

A brand new strategy to iterate on AI pictures

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.

Leave a Reply Cancel reply

Simple Guide to Training Your Team to Use ChatGPT Effectively

How to Use ChatGPT to Review and Shortlist Resumes Efficiently

Mastering ChatGPT Prompt Patterns: Templates for Every Use

How to Write Smarter ChatGPT Prompts: Techniques & Examples

Get Insider Tips and Tricks in Our Newsletter!

Artificial Intelligence
in Action