Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Google VP warns two types of AI startups may not survive

These former Big Tech engineers are using artificial intelligence to navigate Trump’s trade mess

Sam Altman would like to remind you that people use a lot of energy too

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    Sam Altman would like to remind you that people use a lot of energy too

    22 February 2026

    ‘Toy Story 5’ takes aim at creepy AI toys: ‘I’m always listening’

    21 February 2026

    Great news for xAI: Grok is now very good at answering questions about Baldur’s Gate

    21 February 2026

    UAE’s G42 partners with Cerebra to deploy 8 exaflops of computers in India

    20 February 2026

    Why these startup CEOs don’t think AI will replace human roles

    20 February 2026
  • Apps

    Apple’s iOS 26.4 arrives in public beta with AI music playlists, video podcasts and more

    22 February 2026

    India’s Sarvam launches Indus AI chat app as competition heats up

    21 February 2026

    Remember HQ? “Quiz Daddy” Scott Rogowsky is back with TextSavvy, a daily mobile game show

    21 February 2026

    As the browser war heats up, Chrome is adding new productivity features

    20 February 2026

    Google says its AI systems helped prevent Play Store malware in 2025

    20 February 2026
  • Crypto

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025

    Why Benchmark Made a Rare Crypto Bet on Trading App Fomo, with $17M Series A

    6 November 2025

    Solana co-founder Anatoly Yakovenko is a big fan of agentic coding

    30 October 2025

    MoviePass opens Mogul fantasy league game to the public

    29 October 2025
  • Fintech

    InScope raises $14.5M to solve financial reporting pain

    20 February 2026

    OpenAI deepens India push with Pine Labs fintech partnership

    19 February 2026

    Cash app adds payment links so you can get paid in DMs

    11 February 2026

    MrBeast’s company buys Gen Z fintech app Step

    9 February 2026

    Stripe Alumni Raise €30M Series A for Duna, Backed by Stripe and Adyen Executives

    5 February 2026
  • Hardware

    Joseph C Belden: Last Chance for Innovators to Earn Scaling Privileges

    20 February 2026

    At a critical time, Snap is losing a top spec executive

    20 February 2026

    Freeform Raises $67M Series B to Scale Laser AI Production

    19 February 2026

    India’s Sarvam wants to bring its AI models to phones, cars and smart glasses

    19 February 2026

    Google debuts $499 Pixel 10a

    18 February 2026
  • Media & Entertainment

    Google adds music-making capabilities to its Gemini app

    21 February 2026

    Disrupt 2026 Super Early Bird pricing expires in 1 week

    20 February 2026

    YouTube’s latest experiment brings its AI chat tool to TVs

    20 February 2026

    OpenAI, Reliance partner to add AI search to JioHotstar

    19 February 2026

    SeatGeek and Spotify are teaming up to offer concert ticket discounts within the music platform

    19 February 2026
  • Security

    Error on student admissions website exposed children’s personal details

    21 February 2026

    Ukrainian man jailed for identity theft that helped North Koreans get jobs at US companies

    21 February 2026

    Cellebrite cut off Serbia citing misuse of its phone unlocking tools. Why not others?

    20 February 2026

    FBI says ATM ‘jackpot’ attacks on the rise, hackers net millions in stolen cash

    20 February 2026

    Sex toy maker Tenga says hacker stole customer information

    19 February 2026
  • Startups

    Google VP warns two types of AI startups may not survive

    22 February 2026

    Co-founders behind Reface and Prisma join hands to improve on-device model inference with Mirai

    21 February 2026

    Nominations for the Startup Battlefield 200 are now open

    21 February 2026

    The OpenAI mafia: 18 startups founded by graduates

    20 February 2026

    Nvidia deepens early-stage push into India’s AI startup ecosystem

    20 February 2026
  • Transportation

    These former Big Tech engineers are using artificial intelligence to navigate Trump’s trade mess

    22 February 2026

    Rivian owners will soon be able to access vehicle controls using their Apple Watch

    21 February 2026

    Lucid Motors is cutting 12% of its workforce as it pursues profitability

    21 February 2026

    New York puts the brakes on robotaxi expansion plan

    20 February 2026

    AI data center boom fuels Redwood’s energy storage business

    20 February 2026
  • Venture

    Ali Partovi’s Neo appears to upgrade the throttle model in low dilution terms

    21 February 2026

    Peak XV Raises $1.3B, Doubles In AI As Global India VC Competition Heats Up

    21 February 2026

    General Catalyst commits $5 billion to India over five years

    20 February 2026

    Reload wants to give your AI agents a shared memory

    20 February 2026

    This VC’s best advice for building a founding team

    19 February 2026
  • Recommended Essentials
TechTost
You are at:Home»AI»Openai’s research on AI models deliberate lies are wild
AI

Openai’s research on AI models deliberate lies are wild

techtost.comBy techtost.com19 September 202504 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Openai's Research On Ai Models Deliberate Lies Are Wild
Share
Facebook Twitter LinkedIn Pinterest Email

Every day, researchers in the largest technology companies fall a bomb. There was time that Google said that the latest quantum chip showed that there were multiple universes. Or when the man gave the Ai Claudius agent a snack sales machine to run and went amok, calling for safety to people and insisting he was human.

This week, it was Openai’s turn to increase our collective eyebrows.

Openai released Monday some survey explained How to stop AI models from “Scheming”. It is a practice in which an “AI behaves in a way on the surface while hiding its real goals”, Openai defined in his tweet for research.

In the document, conducted by Apollo’s research, the researchers went a little further, likening the AI ​​who was planning to a human stock market that breaks the law to make as much money as possible. The researchers, however, claimed that most AI “Scheming” were not so harmful. “The most common failures include simple forms of deception – for example, pretending to have completed a project without doing so,” they wrote.

The document was mostly published to show that the “continuing alignment”-the technique of the school that was tested-was well-tested.

But he also explained that AI developers have not found a way to train their models not to design. This is due to the fact that such education could really teach in the model how to design even better to avoid detection.

“An important way of failing to try to” train “Scheming is simply teaching the model to design more carefully and secretly,” the researchers wrote.

TechCrunch event

Francisco
|
27-29 October 2025

Perhaps the most amazing place is that if a model understands that it is being tested, it can pretend that it is not just planning to pass the test, even if it still is formed. “Models often become more aware that they are evaluated. This awareness of the situation can reduce Scheming itself, regardless of actual alignment,” the researchers wrote.

It’s not news that AI models will lie. So far most of us have experienced AI illusions, or the model with confidence, giving an answer to a prompt that is simply not true. But hallucinations show basic speculations with confidence, as the OpenAi survey released Earlier this month documented.

Scheming is something else. It is deliberate.

Even this revelation – that a model will deliberately mislead people – is not new. APOLLO research first Published a document in December documenting how the five models were formed when they were given instructions to achieve a “cost” goal.

The news here is really good news: The researchers saw significant reductions in the figure using “alignment”. This technique involves teaching the model a “protection specification” and then make the model review it before acting. It’s a bit like making young children repeat the rules before letting them play.

Openai researchers insist that the lies they have caught with their own models, or even Chatgpt, are not so serious. Like Openai co -founder Wojciech Zaremba, he told TechCrunch Maxwell Zeff for research: “This project has been done in the simulated environment and we think it represents future cases. Work. And this is just the lie.

The fact that AI models from many players deliberately deceive people is perhaps understandable. They were made by humans, to mimic people and (synthetic data) mostly trained in human -produced data.

They are also bonkers.

While we have all experienced the frustration of poor execution technology (who are thinking about you, printers in the house of the past), when was the last time the software that is not deliberately lied to you? Have your inbox ever built the emails on your own? Has the CMS that did not exist to place his numbers? Is your FinTech application its own banking transactions?

It is worth discussing, as the corporate world barrels to a future of AI where companies believe that agents can be treated as independent employees. The researchers of this document have the same warning.

“As AIs are assigned more complex tasks with real consequences and begin to seek more ambiguous, long-term goals, we expect the ability to be harmful to develop-so that our safeguards and ability to tasting strictly to grow respectively,” they wrote.

deliberate hallucinations lies models open OpenAIs Research Wild
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleThe concept launches agents for data analysis and automation
Next Article Dawn Capital’s Shamillah Bankiya is collapsing the status of business business markets
bhanuprakash.cg
techtost.com
  • Website

Related Posts

Sam Altman would like to remind you that people use a lot of energy too

22 February 2026

‘Toy Story 5’ takes aim at creepy AI toys: ‘I’m always listening’

21 February 2026

Nominations for the Startup Battlefield 200 are now open

21 February 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Google VP warns two types of AI startups may not survive

22 February 2026

These former Big Tech engineers are using artificial intelligence to navigate Trump’s trade mess

22 February 2026

Sam Altman would like to remind you that people use a lot of energy too

22 February 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

InScope raises $14.5M to solve financial reporting pain

20 February 2026

OpenAI deepens India push with Pine Labs fintech partnership

19 February 2026

Cash app adds payment links so you can get paid in DMs

11 February 2026
Startups

Google VP warns two types of AI startups may not survive

Co-founders behind Reface and Prisma join hands to improve on-device model inference with Mirai

Nominations for the Startup Battlefield 200 are now open

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.