Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

“Pokémon Pokopia” is a game about restoring a broken world — and I love it

DOGE employee stole Social Security data and thumbed it, report says

Mandiant founder just raised $190 million for autonomous AI security agent startup

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    Amazon is launching its AI health assistant on its website and app

    11 March 2026

    Sandbar secures $23M Series A for AI note-taking ring

    10 March 2026

    OpenAI and Google employees are quick to defend Anthropic in the DOD lawsuit

    10 March 2026

    OpenAI hardware executive Caitlin Kalinowski resigns in response to Pentagon deal

    9 March 2026

    Will Pentagon standoff over Anthropic scare startups out of defense work?

    9 March 2026
  • Apps

    YouTube surpasses Disney, Paramount, WBD in ad revenue in 2025

    11 March 2026

    X says it will suspend creators from revenue sharing program for AI posts without ‘armed conflict’ tag

    10 March 2026

    Periwinkle makes it even easier to host social media on Bluesky’s AT Protocol

    10 March 2026

    Meta will enable competing AI chatbots on WhatsApp in Europe, but for a fee

    9 March 2026

    Match Group COO out as dating apps struggle to connect with Gen Z

    9 March 2026
  • Crypto

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025

    Why Benchmark Made a Rare Crypto Bet on Trading App Fomo, with $17M Series A

    6 November 2025

    Solana co-founder Anatoly Yakovenko is a big fan of agentic coding

    30 October 2025

    MoviePass opens Mogul fantasy league game to the public

    29 October 2025
  • Fintech

    X taps William Shatner to give invitations to his payment service, X Money

    4 March 2026

    Stripe wants to turn your AI costs into a profit center

    3 March 2026

    3 days left: Save up to $680 on your ticket to Disrupt 2026

    25 February 2026

    More startups surpass $10M ARR in 3 months than ever before

    24 February 2026

    Stripe, PayPal Ventures Bet on India’s Xflow to Fix Cross-Border B2B Payments

    24 February 2026
  • Hardware

    Hyperscale Power is the latest startup to challenge 140-year-old transformer technology

    10 March 2026

    Whoop is launching a new blood test focused on women’s health

    10 March 2026

    Honor says its ‘Robot phone’ with moving camera can dance to music

    8 March 2026

    Apple unveils M5 Pro and M5 Max chips with new ‘Fusion Architecture’

    8 March 2026

    Eight Sleep raises $50 million at $1.5 billion valuation

    7 March 2026
  • Media & Entertainment

    “Pokémon Pokopia” is a game about restoring a broken world — and I love it

    11 March 2026

    YouTube extends fake AI detection to politicians, government officials and journalists

    10 March 2026

    Xprize Founder Peter Diamandis Launches New Contest To Announce New ‘Star Trek’

    10 March 2026

    It looks like the DOJ isn’t going to break up Live Nation and Ticketmaster

    9 March 2026

    PopSockets founder David Barnett talks about building a viral business

    7 March 2026
  • Security

    DOGE employee stole Social Security data and thumbed it, report says

    11 March 2026

    US military contractor likely built iPhone hacking tools used by Russian spies in Ukraine

    10 March 2026

    An iPhone hacking toolkit used by Russian spies likely came from a US military contractor

    10 March 2026

    Russian government hackers are targeting Signal and WhatsApp users, Dutch spies warn

    9 March 2026

    The Ring’s Jamie Siminoff tries to calm privacy fears from the Super Bowl, but his answers may not help

    9 March 2026
  • Startups

    Mandiant founder just raised $190 million for autonomous AI security agent startup

    11 March 2026

    AI networking startup Eridu emerges from stealth with hefty $200M Series A

    10 March 2026

    Bluesky CEO Jay Graber is stepping down

    10 March 2026

    Science Corp. raises $230 million as it races to bring its brain implant to market

    6 March 2026

    EXCLUSIVE: Luma Launches Creative AI Agents Powered by New ‘Unified Intelligence’ Models

    6 March 2026
  • Transportation

    GM figured out how to deal with EV uncertainty with the Chevy Bolt

    11 March 2026

    Electric air taxi maker Archer hits back at Joby alleging hidden Chinese ties

    10 March 2026

    Electric air taxis are set to fly in 26 states

    10 March 2026

    The 2027 Chevy Bolt is the McRib of the automotive world

    9 March 2026

    TechCrunch Mobility: Rivian’s R2 game

    9 March 2026
  • Venture

    This SpaceX Veteran Says The Next Big Thing In Space Is Satellites Returning To Earth

    10 March 2026

    Founders Fund is approaching $6 billion for its latest growth fund, sources say

    10 March 2026

    Robinhood’s startup fund stumbles in its NYSE debut

    7 March 2026

    City Detect, which uses artificial intelligence to help cities stay safe and clean, raises $13M Series A

    7 March 2026

    Lio raises $30 million from Andreessen Horowitz and others to automate business procurement

    5 March 2026
  • Recommended Essentials
TechTost
You are at:Home»Startups»Anthropic says some Claude models can now end ‘harmful or abusive’ conversations
Startups

Anthropic says some Claude models can now end ‘harmful or abusive’ conversations

techtost.comBy techtost.com16 August 202502 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Anthropic Says Some Claude Models Can Now End 'harmful Or
Share
Facebook Twitter LinkedIn Pinterest Email

The man has announced new possibilities This will allow some of its newer, larger models to terminate conversations in what the company describes as “rare, extreme cases of persistently harmful or abusive user interactions”. Impressively, Anthropic says he does not protect the human user, but the AI model itself.

To make it clear, the company does not claim that Claude AI models are feeling or can harm their conversations with users. In his own words, the humanity remains “very uncertain about the possible moral condition of Claude and others LLM, now or in the future”.

However, his announcement points out a recent program created to study what he calls “model prosperity” and says that Anthropic is essentially taking a just-in-case approach, “working to identify and implement low-cost interventions to alleviate the risk of the model”.

This last change is currently limited to Claude Opus 4 and 4.1. Again, it is assumed that it will only occur in “extreme cases of limbs”, such as “requests from users for sexual content that includes minors and efforts to request information that would allow for violence on large -scale or acts of terrorism”.

While these types of applications could possibly create legal or public problems for humanity itself (witnesses recent reports on how Chatgpt may potentially enhance or contribute to the delusional thinking of its users), the company says that during the test before the installation, Claude 4 and a model of obvious dysfunction “when he did.

As for these new opportunities that the conversations end, the company says: “In all cases, Claude is only to use the ability that ends the discussion as a last resort when multiple redirect efforts have been exhausted and the hope of a productive interaction is exhausted or when a user is explicitly demanding.

Anthropic also says that Claude has “he is directed not to use this ability in cases where users may be at impending risk of harming themselves or others”.

TechCrunch event

Francisco
|
27-29 October 2025

When Claude ends a discussion, Anthropic says users will still be able to start new conversations from the same account and create new branches of the annoying conversation by editing their answers.

“We treat this feature as an ongoing experiment and will continue to improve our approach,” the company says.

abusive Anthropic Classical Claude conversations harmful Human models
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleSenator Hawley to investigate meta after reporting finds AI Chatbots Flirt with children
Next Article The judge says FTC’s investigation into media issues should be worried about all Americans ”
bhanuprakash.cg
techtost.com
  • Website

Related Posts

Mandiant founder just raised $190 million for autonomous AI security agent startup

11 March 2026

AI networking startup Eridu emerges from stealth with hefty $200M Series A

10 March 2026

Bluesky CEO Jay Graber is stepping down

10 March 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

“Pokémon Pokopia” is a game about restoring a broken world — and I love it

11 March 2026

DOGE employee stole Social Security data and thumbed it, report says

11 March 2026

Mandiant founder just raised $190 million for autonomous AI security agent startup

11 March 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

X taps William Shatner to give invitations to his payment service, X Money

4 March 2026

Stripe wants to turn your AI costs into a profit center

3 March 2026

3 days left: Save up to $680 on your ticket to Disrupt 2026

25 February 2026
Startups

Mandiant founder just raised $190 million for autonomous AI security agent startup

AI networking startup Eridu emerges from stealth with hefty $200M Series A

Bluesky CEO Jay Graber is stepping down

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.