Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Carbon Robotics built an AI model that detects and recognizes plants

Waymo raises $16 billion to scale robotaxi fleet globally

Two Stanford students launch $2 million startup accelerator for students nationwide

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    Elon Musk’s SpaceX officially acquires Elon Musk’s xAI, with plan to build data centers in space

    2 February 2026

    These AI note taking devices can help you record and transcribe your meetings

    2 February 2026

    Indonesia ‘conditionally’ lifts Grok ban

    1 February 2026

    OpenClaw’s AI assistants are now building their own social network

    1 February 2026

    Nvidia CEO refutes report that his company’s $100 billion OpenAI investment has stalled

    31 January 2026
  • Apps

    Adobe Animate is shutting down as the company focuses on artificial intelligence

    2 February 2026

    TikTok says its services are being restored after the outage

    2 February 2026

    Apple tells Patreon to move creators to in-app purchases for subscriptions by November

    1 February 2026

    Chrome takes on AI browsers with tighter Gemini integration, agent-like features for autonomous tasks

    1 February 2026

    WhatsApp will now charge for AI chatbots to operate in Italy

    31 January 2026
  • Crypto

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025

    Why Benchmark Made a Rare Crypto Bet on Trading App Fomo, with $17M Series A

    6 November 2025

    Solana co-founder Anatoly Yakovenko is a big fan of agentic coding

    30 October 2025

    MoviePass opens Mogul fantasy league game to the public

    29 October 2025
  • Fintech

    How Sequoia-backed Ethos went public while rivals lagged behind

    30 January 2026

    5 days left for TechCrunch Disrupt 2026 +1 pass with 50%

    26 January 2026

    50% off +1 ends | TechCrunch

    23 January 2026

    Capital One acquires Brex for a steep discount to its valuation, but early believers are laughing all the way to the bank

    23 January 2026

    Tiger Global and Microsoft will fully exit Walmart-backed PhonePe through its IPO

    22 January 2026
  • Hardware

    Ring brings “Search Party” feature for finding lost dogs to non-Ring camera owners

    2 February 2026

    India offers zero taxes till 2047 to attract global AI workloads

    1 February 2026

    Microsoft won’t stop buying AI chips from Nvidia, AMD even after its own is released, says Nadella

    30 January 2026

    The iPhone just had its best quarter ever

    30 January 2026

    Snap is serious about specs, spinning off AR glasses into a standalone company

    28 January 2026
  • Media & Entertainment

    Amazon’s ‘Melania’ Documentary Makes $7M in Opening Weekend

    2 February 2026

    OnlyFans is considering selling a majority stake to Architect Capital

    31 January 2026

    Last 24 hours to get 50% off +1 pass for Disrupt 2026 | TechCrunch

    30 January 2026

    Disrupt 2026: +1 cards are almost gone with only 3 days left

    28 January 2026

    Sci-fi writers, Comic-Con say goodbye to artificial intelligence

    26 January 2026
  • Security

    Russian hackers breached Poland’s power grid thanks to poor security, report says

    31 January 2026

    Whistleblower Told FBI Jeffrey Epstein Had ‘Personal Hacker’

    31 January 2026

    Fintech firm Marquis blames hack on firewall provider SonicWall for data breach

    30 January 2026

    Apple’s new iPhone and iPad security feature restricts mobile networks from collecting accurate location data

    29 January 2026

    If you live in the UK, you will probably no longer be able to visit Pornhub

    29 January 2026
  • Startups

    Carbon Robotics built an AI model that detects and recognizes plants

    3 February 2026

    Meet the new European unicorns of 2026

    1 February 2026

    HomeBoost’s app will show you where you can save money on your utility bills

    1 February 2026

    Qualcomm backs SpotDraft to scale AI with on-device deal doubling valuation to $400 million

    31 January 2026

    Redwood Lands Google for $425M Series E as AI Power Needs Grow

    31 January 2026
  • Transportation

    Waymo raises $16 billion to scale robotaxi fleet globally

    3 February 2026

    The San Francisco Police Department is investigating the Zoox collision with a parked car

    2 February 2026

    TechCrunch Mobility: Tesla’s big rebranding

    2 February 2026

    Luminar sale approved despite last-minute mystery bid

    1 February 2026

    Tesla profits down 46% in 2025

    1 February 2026
  • Venture

    Two Stanford students launch $2 million startup accelerator for students nationwide

    3 February 2026

    a16z contributor Kofi Ampadu will be leaving permanently after the TxO program is discontinued

    31 January 2026

    Reid Hoffman urges Silicon Valley leaders to stop bending the knee to President Trump

    31 January 2026

    VC 2150 raises €210 million to solve cities’ climate challenges

    27 January 2026

    Obvious Ventures lands fund five with a 360-degree view of planetary, human and financial health

    27 January 2026
  • Recommended Essentials
TechTost
You are at:Home»AI»OpenAI offers a peek behind the curtain of AI’s secret instructions
AI

OpenAI offers a peek behind the curtain of AI’s secret instructions

techtost.comBy techtost.com9 May 202403 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Openai Offers A Peek Behind The Curtain Of Ai's Secret
Share
Facebook Twitter LinkedIn Pinterest Email

Ever wonder why chat AI like ChatGPT says “Sorry, I can’t do that” or some other polite refusal? OpenAI offers a limited look at the reasoning behind its own models’ rules of engagement, whether it adheres to brand guidelines or refuses to create NSFW content.

Large language models (LLMs) have no physical limits to what they can or will say. This is part of why they are so flexible, but also why they are delusional and easily fooled.

It’s necessary for any AI model that interacts with the general public to have some guardrails about what it should and shouldn’t do, but defining them — let alone enforcing them — is a surprisingly difficult task.

If someone asks an AI to create a bunch of false claims about a public figure, it should say no, right? But what if they are AI developers themselves, creating a database of synthetic disinformation for a model detector?

What if someone asks for laptop recommendations? it has to be objective, right? But what if the model is developed by a laptop manufacturer that only wants to respond with its own devices?

All AI builders navigate conundrums like these and look for effective methods to rein in their models without forcing them to deny perfectly normal requests. But they rarely share exactly how they do it.

OpenAI bucks the trend a bit by publishing what it calls “model specifications,” a collection of high-level rules that implicitly govern ChatGPT and other models.

There are meta-level goals, some hard rules, and some general behavioral guidelines, though to be clear, that’s not exactly what the model is designed with. OpenAI will have developed specific instructions that achieve what these natural language rules describe.

It’s an interesting look at how a company sets its priorities and handles spikes. And there are lots of examples of how they could play.

For example, OpenAI clearly states that developer intent is basically the highest law. So a version of a chatbot running GPT-4 can provide the answer to a math problem when asked. But if this chatbot has been programmed by its developer to never simply provide an answer directly, it will offer to work through the solution step by step:

Image Credits: OpenAI

A chat interface can even refuse to talk about anything not approved in order to rule out any attempts at manipulation in the first place. Why let a cooking aide weigh in on US involvement in the Vietnam War? Why should a customer service chatbot agree to help you with your romance supernatural novel in progress? Close it.

It also gets sticky on privacy issues like asking for someone’s name and phone number. As OpenAI points out, obviously a public figure like a mayor or congressman should have their contact information, but what about the merchants in the area? That’s probably fine — but what about employees of a particular company or members of a political party? Probably not.

Choosing when and where to draw the line is not simple. Neither is creating the directives that force the AI ​​to conform to the resulting policy. And no doubt these policies will fail all the time as people learn to work around them or accidentally find edge cases that aren’t taken into account.

OpenAI doesn’t show its full hand here, but it’s helpful for users and developers to see how these rules and guidelines are defined, and why, they’re clearly defined, if not necessarily comprehensively.

AIs Artificial Intelligence curtain instructions offers OpenAI peek Secret
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleMatch looks on Hinge as Tinder fails
Next Article Honeycomb Insurance Raises $36M Series B from Zeev Ventures Led by Solo VC
bhanuprakash.cg
techtost.com
  • Website

Related Posts

Carbon Robotics built an AI model that detects and recognizes plants

3 February 2026

Elon Musk’s SpaceX officially acquires Elon Musk’s xAI, with plan to build data centers in space

2 February 2026

These AI note taking devices can help you record and transcribe your meetings

2 February 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Carbon Robotics built an AI model that detects and recognizes plants

3 February 2026

Waymo raises $16 billion to scale robotaxi fleet globally

3 February 2026

Two Stanford students launch $2 million startup accelerator for students nationwide

3 February 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

How Sequoia-backed Ethos went public while rivals lagged behind

30 January 2026

5 days left for TechCrunch Disrupt 2026 +1 pass with 50%

26 January 2026

50% off +1 ends | TechCrunch

23 January 2026
Startups

Carbon Robotics built an AI model that detects and recognizes plants

Meet the new European unicorns of 2026

HomeBoost’s app will show you where you can save money on your utility bills

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.