Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

All we like is soulfulness

Two Americans convicted of helping North Korea steal $5 million in fake IT worker scheme

This energy startup’s bet on 100-year-old grid technology is paying off

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    Runway’s CEO Says AI Could Help Hollywood Make 50 Movies Instead of One $100 Million Blockbuster

    16 April 2026

    OpenAI updates its Agents SDK to help enterprises build safer, more capable agents

    16 April 2026

    Reid Hoffman weighs in on the ‘tokenmaxxing’ debate.

    15 April 2026

    Anthropic’s co-founder confirms the company briefed the Trump administration on Mythos

    15 April 2026

    Microsoft is working on yet another OpenClaw-like agent

    14 April 2026
  • Apps

    Canva’s AI assistant can now call on various tools to make designs for you

    16 April 2026

    AI learning app Gizmo soars with 13 million users and $22 million in investment

    16 April 2026

    Adobe’s new Firefly AI assistant can use Creative Cloud apps to complete tasks

    15 April 2026

    How the Freecash rewards app made it to the top of the app stores

    15 April 2026

    X brings voice memos back to X Chat

    14 April 2026
  • Crypto

    British cryptographer Adam Back denies NYT report that he is Bitcoin creator Satoshi Nakamoto

    9 April 2026

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025

    Why Benchmark Made a Rare Crypto Bet on Trading App Fomo, with $17M Series A

    6 November 2025

    Solana co-founder Anatoly Yakovenko is a big fan of agentic coding

    30 October 2025
  • Fintech

    Airwallex is set to take on Stripe and the rest of the payments industry — in the physical world

    16 April 2026

    Cash app launches ‘pay later’ feature for P2P transfers

    3 April 2026

    Doss raises $55 million for AI inventory management that connects to ERP

    24 March 2026

    Despite stiff competition, Kalshi, Polymarket CEOs back $35m VC fund projections

    23 March 2026

    Amid legal turmoil, Kalshi is temporarily banned in Nevada

    20 March 2026
  • Hardware

    Amazon Unveils Slimmer Fire TV Stick HD, Opens Ember Artline TVs for Pre-Order

    16 April 2026

    Motorola is suing social platforms and creators over posts raising concerns about speech in India

    16 April 2026

    AI data center startup Fluidstack is in talks for a $1 billion round at an $18 billion valuation months after raising $7.5 billion, report says

    15 April 2026

    Amazon is ending support for older Kindle devices

    9 April 2026

    Intel signs Elon Musk’s Terafab chip project

    8 April 2026
  • Media & Entertainment

    All we like is soulfulness

    16 April 2026

    Wait, could they still break up Live Nation?

    16 April 2026

    HBO Max is coming to India through an exclusive JioHotstar deal

    15 April 2026

    YouTube Live Streams will now withhold ads during peak engagement to protect the atmosphere

    14 April 2026

    X says he’s reducing payouts to clickbait accounts

    12 April 2026
  • Security

    Two Americans convicted of helping North Korea steal $5 million in fake IT worker scheme

    16 April 2026

    Sweden blames Russian hackers for attempted ‘catastrophic’ cyberattack on thermal plant

    15 April 2026

    Adobe fixes PDF zero-day security flaw that hackers have been exploiting for months

    15 April 2026

    Someone planted backdoors in dozens of WordPress plugins used on thousands of websites

    14 April 2026

    Anodot hack leaves over a dozen compromised companies facing extortion

    14 April 2026
  • Startups

    This energy startup’s bet on 100-year-old grid technology is paying off

    16 April 2026

    Hightouch reaches $100M ARR powered by AI-powered marketing tools

    16 April 2026

    StrictlyVC San Francisco is less than a month away

    15 April 2026

    Walmart-owned Flipkart, Amazon are squeezing India’s e-commerce startups

    12 April 2026

    This founder helped build SpaceX’s most powerful rocket engine. Now he’s building a “fighter for orbit.”

    12 April 2026
  • Transportation

    Monarch Tractor collapse ends with takeover by Caterpillar

    16 April 2026

    Ford EV and chief technology officer are leaving the auto industry

    16 April 2026

    Chipmakers AMD, Arm and Qualcomm are investing in this buzzing self-driving technology startup

    15 April 2026

    London is closing in on its first robotaxi service as Waymo begins trials

    15 April 2026

    Tesla adds ‘ribs’, other stats to track how often drivers use Full Self-Driving software

    14 April 2026
  • Venture

    Anthropic rejects VC funding that values ​​it at $800B+, for now

    16 April 2026

    Financial risk management platform Pillar raises $20 million in rounds led by a16z

    15 April 2026

    Vercel CEO Guillermo Rauch signals IPO readiness as AI agents drive revenue

    14 April 2026

    Nvidia-backed SiFive hits $3.65 billion valuation for open AI chips

    11 April 2026

    How to make the Startup Battlefield Top 20 — and what each company gets regardless

    10 April 2026
  • Recommended Essentials
TechTost
You are at:Home»Startups»Anthropic says some Claude models can now end ‘harmful or abusive’ conversations
Startups

Anthropic says some Claude models can now end ‘harmful or abusive’ conversations

techtost.comBy techtost.com16 August 202502 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Anthropic Says Some Claude Models Can Now End 'harmful Or
Share
Facebook Twitter LinkedIn Pinterest Email

The man has announced new possibilities This will allow some of its newer, larger models to terminate conversations in what the company describes as “rare, extreme cases of persistently harmful or abusive user interactions”. Impressively, Anthropic says he does not protect the human user, but the AI model itself.

To make it clear, the company does not claim that Claude AI models are feeling or can harm their conversations with users. In his own words, the humanity remains “very uncertain about the possible moral condition of Claude and others LLM, now or in the future”.

However, his announcement points out a recent program created to study what he calls “model prosperity” and says that Anthropic is essentially taking a just-in-case approach, “working to identify and implement low-cost interventions to alleviate the risk of the model”.

This last change is currently limited to Claude Opus 4 and 4.1. Again, it is assumed that it will only occur in “extreme cases of limbs”, such as “requests from users for sexual content that includes minors and efforts to request information that would allow for violence on large -scale or acts of terrorism”.

While these types of applications could possibly create legal or public problems for humanity itself (witnesses recent reports on how Chatgpt may potentially enhance or contribute to the delusional thinking of its users), the company says that during the test before the installation, Claude 4 and a model of obvious dysfunction “when he did.

As for these new opportunities that the conversations end, the company says: “In all cases, Claude is only to use the ability that ends the discussion as a last resort when multiple redirect efforts have been exhausted and the hope of a productive interaction is exhausted or when a user is explicitly demanding.

Anthropic also says that Claude has “he is directed not to use this ability in cases where users may be at impending risk of harming themselves or others”.

TechCrunch event

Francisco
|
27-29 October 2025

When Claude ends a discussion, Anthropic says users will still be able to start new conversations from the same account and create new branches of the annoying conversation by editing their answers.

“We treat this feature as an ongoing experiment and will continue to improve our approach,” the company says.

abusive Anthropic Classical Claude conversations harmful Human models
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleSenator Hawley to investigate meta after reporting finds AI Chatbots Flirt with children
Next Article The judge says FTC’s investigation into media issues should be worried about all Americans ”
bhanuprakash.cg
techtost.com
  • Website

Related Posts

This energy startup’s bet on 100-year-old grid technology is paying off

16 April 2026

Hightouch reaches $100M ARR powered by AI-powered marketing tools

16 April 2026

Anthropic rejects VC funding that values ​​it at $800B+, for now

16 April 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

All we like is soulfulness

16 April 2026

Two Americans convicted of helping North Korea steal $5 million in fake IT worker scheme

16 April 2026

This energy startup’s bet on 100-year-old grid technology is paying off

16 April 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

Airwallex is set to take on Stripe and the rest of the payments industry — in the physical world

16 April 2026

Cash app launches ‘pay later’ feature for P2P transfers

3 April 2026

Doss raises $55 million for AI inventory management that connects to ERP

24 March 2026
Startups

This energy startup’s bet on 100-year-old grid technology is paying off

Hightouch reaches $100M ARR powered by AI-powered marketing tools

StrictlyVC San Francisco is less than a month away

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.