Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Harness hits $5.5B valuation with $240M raise to automate AI’s ‘post-code’ divide

TIME named “Architects of AI” Person of the Year

WhatsApp’s biggest market becomes the toughest test

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    TIME named “Architects of AI” Person of the Year

    15 December 2025

    Runway releases its first global model, adds native audio to latest video model

    14 December 2025

    OpenAI hits back at Google with GPT-5.2 after ‘code red’ memo.

    14 December 2025

    Trump’s AI executive order promises ‘a rulebook’ – startups may find legal loophole instead

    13 December 2025

    Ok, so what’s up with the LinkedIn algo?

    12 December 2025
  • Apps

    WhatsApp’s biggest market becomes the toughest test

    15 December 2025

    Google debuts ‘Disco’, a Gemini-powered tool for building web apps from browser tabs

    14 December 2025

    Google’s AI testing feature for clothes now only works with a selfie

    14 December 2025

    DoorDash driver faces felony charges after allegedly spraying customers’ food

    13 December 2025

    Google Translate now lets you listen to real-time translations on your headphones

    13 December 2025
  • Crypto

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025

    Why Benchmark Made a Rare Crypto Bet on Trading App Fomo, with $17M Series A

    6 November 2025

    Solana co-founder Anatoly Yakovenko is a big fan of agentic coding

    30 October 2025

    MoviePass opens Mogul fantasy league game to the public

    29 October 2025

    Only 5 days until Disrupt 2025 sets the startup world on fire

    22 October 2025
  • Fintech

    Coinbase starts onboarding users again in India, plans to do fiat on-ramp next year

    7 December 2025

    Walmart-backed PhonePe shuts down Pincode app in yet another step back in e-commerce

    5 December 2025

    Nexus stays out of AI, keeping half of its new $700M fund for India startup

    4 December 2025

    Fintech firm Marquis notifies dozens of US banks and credit unions of data breach after ransomware attack

    3 December 2025

    Revolut hits $75 billion valuation in new capital raise

    24 November 2025
  • Hardware

    Pebble founder unveils $75 AI smart ring to record short notes with the push of a button

    10 December 2025

    Amazon’s Ring launches controversial AI-powered facial recognition feature on video doorbells

    10 December 2025

    Google’s first AI glasses are expected next year

    9 December 2025

    eSIM adoption is on the rise thanks to travel and device compatibility

    6 December 2025

    AWS re:Invent was an all-in pitch for AI. Customers may not be ready.

    5 December 2025
  • Media & Entertainment

    Understanding the Dangerous Netflix-Warner Bros. Deal

    15 December 2025

    Disney signs deal with OpenAI to allow Sora to create AI videos with its characters

    11 December 2025

    YouTube TV will launch genre-based subscription plans in 2026

    11 December 2025

    Founder of AI startup Tavus says users talk to AI Santa ‘for hours’ a day

    10 December 2025

    Spotify releases music videos in the US and Canada for Premium subscribers

    9 December 2025
  • Security

    The flaw in the photo booth manufacturer’s website exposes customers’ photos

    13 December 2025

    Home Depot exposed access to internal systems for a year, researcher says

    13 December 2025

    Security flaws in the Freedom Chat app exposed users’ phone numbers and PINs

    11 December 2025

    Petco takes down Vetco website after exposing customers’ personal information

    10 December 2025

    Petco’s security bug affected customers’ SSNs, driver’s licenses and more

    9 December 2025
  • Startups

    Harness hits $5.5B valuation with $240M raise to automate AI’s ‘post-code’ divide

    15 December 2025

    Mesa shuts down credit card that rewards cardholders for paying their mortgages

    14 December 2025

    Port raises $100M valuation from $800M round to take on Spotify’s Backstage

    14 December 2025

    Eclipse Energy’s microbes can turn dormant oil wells into hydrogen factories

    13 December 2025

    Interest in Spoor’s AI bird tracking software is soaring

    13 December 2025
  • Transportation

    TechCrunch Mobility: Rivian’s survival plan involves more than cars

    14 December 2025

    India’s Spinny lines up $160m funding to acquire GoMechanic, sources say

    14 December 2025

    Inside Rivian’s big bet on self-driving with artificial intelligence

    13 December 2025

    Zevo wants to add robotaxis to its car-sharing fleet, starting with newcomer Tensor

    13 December 2025

    Driving aboard Rivian’s fight for autonomy

    12 December 2025
  • Venture

    Runware raises $50 million in Series A to make it easier for developers to create images and videos

    12 December 2025

    Stanford’s star reporter understands Silicon Valley’s startup culture

    12 December 2025

    The market has “changed” and founders now have the power, VCs say

    11 December 2025

    Tiger Global plans cautious business future with new $2.2 billion fund

    8 December 2025

    Sources: AI-powered synthetic research startup Aaru raises Series A at $1B ‘headline’ valuation

    6 December 2025
  • Recommended Essentials
TechTost
You are at:Home»Startups»Anthropic says some Claude models can now end ‘harmful or abusive’ conversations
Startups

Anthropic says some Claude models can now end ‘harmful or abusive’ conversations

techtost.comBy techtost.com16 August 202502 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Anthropic Says Some Claude Models Can Now End 'harmful Or
Share
Facebook Twitter LinkedIn Pinterest Email

The man has announced new possibilities This will allow some of its newer, larger models to terminate conversations in what the company describes as “rare, extreme cases of persistently harmful or abusive user interactions”. Impressively, Anthropic says he does not protect the human user, but the AI model itself.

To make it clear, the company does not claim that Claude AI models are feeling or can harm their conversations with users. In his own words, the humanity remains “very uncertain about the possible moral condition of Claude and others LLM, now or in the future”.

However, his announcement points out a recent program created to study what he calls “model prosperity” and says that Anthropic is essentially taking a just-in-case approach, “working to identify and implement low-cost interventions to alleviate the risk of the model”.

This last change is currently limited to Claude Opus 4 and 4.1. Again, it is assumed that it will only occur in “extreme cases of limbs”, such as “requests from users for sexual content that includes minors and efforts to request information that would allow for violence on large -scale or acts of terrorism”.

While these types of applications could possibly create legal or public problems for humanity itself (witnesses recent reports on how Chatgpt may potentially enhance or contribute to the delusional thinking of its users), the company says that during the test before the installation, Claude 4 and a model of obvious dysfunction “when he did.

As for these new opportunities that the conversations end, the company says: “In all cases, Claude is only to use the ability that ends the discussion as a last resort when multiple redirect efforts have been exhausted and the hope of a productive interaction is exhausted or when a user is explicitly demanding.

Anthropic also says that Claude has “he is directed not to use this ability in cases where users may be at impending risk of harming themselves or others”.

TechCrunch event

Francisco
|
27-29 October 2025

When Claude ends a discussion, Anthropic says users will still be able to start new conversations from the same account and create new branches of the annoying conversation by editing their answers.

“We treat this feature as an ongoing experiment and will continue to improve our approach,” the company says.

abusive Anthropic Classical Claude conversations harmful Human models
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleSenator Hawley to investigate meta after reporting finds AI Chatbots Flirt with children
Next Article The judge says FTC’s investigation into media issues should be worried about all Americans ”
bhanuprakash.cg
techtost.com
  • Website

Related Posts

Harness hits $5.5B valuation with $240M raise to automate AI’s ‘post-code’ divide

15 December 2025

Mesa shuts down credit card that rewards cardholders for paying their mortgages

14 December 2025

Port raises $100M valuation from $800M round to take on Spotify’s Backstage

14 December 2025
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Harness hits $5.5B valuation with $240M raise to automate AI’s ‘post-code’ divide

15 December 2025

TIME named “Architects of AI” Person of the Year

15 December 2025

WhatsApp’s biggest market becomes the toughest test

15 December 2025
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

Coinbase starts onboarding users again in India, plans to do fiat on-ramp next year

7 December 2025

Walmart-backed PhonePe shuts down Pincode app in yet another step back in e-commerce

5 December 2025

Nexus stays out of AI, keeping half of its new $700M fund for India startup

4 December 2025
Startups

Harness hits $5.5B valuation with $240M raise to automate AI’s ‘post-code’ divide

Mesa shuts down credit card that rewards cardholders for paying their mortgages

Port raises $100M valuation from $800M round to take on Spotify’s Backstage

© 2025 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.