How Anthropic found a trick to get AI to give you answers it's not supposed to

If you happen to construct it, individuals will attempt to break it. Generally even the individuals constructing stuff are those breaking it. Such is the case with Anthropic and its newest analysis which demonstrates an fascinating vulnerability in present LLM know-how. Roughly when you hold at a query, you possibly can break guardrails and wind up with massive language fashions telling you stuff that they’re designed to not. Like construct a bomb.

After all given progress in open-source AI know-how, you possibly can spin up your personal LLM domestically and simply ask it no matter you need, however for extra consumer-grade stuff this is a matter value pondering. What’s enjoyable about AI immediately is the fast tempo it’s advancing, and the way nicely — or not — we’re doing as a species to raised perceive what we’re constructing.

If you happen to’ll enable me the thought, I ponder if we’re going to see extra questions and problems with the sort that Anthropic outlines as LLMs and different new AI mannequin sorts get smarter, and bigger. Which is probably repeating myself. However the nearer we get to extra generalized AI intelligence, the extra it ought to resemble a considering entity, and never a pc that we are able to program, proper? In that case, we’d have a more durable time nailing down edge circumstances to the purpose when that work turns into unfeasible? Anyway, let’s discuss what Anthropic just lately shared.

Source link

Artificial Intelligence
in Action

Top Stories

How Salesforce’s MINT-1T dataset could disrupt the AI industry

AI’s Growing Power Needs: Tech Industry’s Move Towards Nuclear Power

AI wars heat up: OpenAI’s SearchGPT takes on Google’s search dominance

How Anthropic found a trick to get AI to give you answers it’s not supposed to

Leave a Reply Cancel reply

Related Strories

How Salesforce’s MINT-1T dataset could disrupt the AI industry

AI’s Growing Power Needs: Tech Industry’s Move Towards Nuclear Power

AI wars heat up: OpenAI’s SearchGPT takes on Google’s search dominance

ChatGPT-4 vs. Llama 3: A Head-to-Head Comparison

Quick links

Popular Categories

Follow Socials

Artificial Intelligence in Action

Top Stories

How Salesforce’s MINT-1T dataset could disrupt the AI industry

AI’s Growing Power Needs: Tech Industry’s Move Towards Nuclear Power

AI wars heat up: OpenAI’s SearchGPT takes on Google’s search dominance

How Anthropic found a trick to get AI to give you answers it’s not supposed to

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.

Leave a Reply Cancel reply

How Salesforce’s MINT-1T dataset could disrupt the AI industry

AI’s Growing Power Needs: Tech Industry’s Move Towards Nuclear Power

AI wars heat up: OpenAI’s SearchGPT takes on Google’s search dominance

ChatGPT-4 vs. Llama 3: A Head-to-Head Comparison

Get Insider Tips and Tricks in Our Newsletter!

Artificial Intelligence
in Action