Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Volkswagen begins testing its self-driving minibuses in Los Angeles ahead of launch with Uber

Florida AG announces OpenAI investigation into shootings allegedly involving ChatGPT

Last 24 hours: Save up to $500 on your Disrupt 2026 Pass

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    Florida AG announces OpenAI investigation into shootings allegedly involving ChatGPT

    10 April 2026

    ChatGPT finally offers $100/month plan

    10 April 2026

    AWS boss explains why investing billions in both Anthropic and OpenAI is an okay conflict

    9 April 2026

    Poke makes using AI agents as easy as sending a text

    9 April 2026

    Last 3 days to save up to $500 on your Disrupt 2026 Pass

    8 April 2026
  • Apps

    Last 24 hours: Save up to $500 on your Disrupt 2026 Pass

    10 April 2026

    The EFF is the latest organization to leave X

    10 April 2026

    Last 2 days to save up to $500 on your Disrupt 2026 ticket

    9 April 2026

    Canva Doubles Down on AI and Marketing Automation with Simtheory, Ortto Acquisitions

    9 April 2026

    Atlassian launches visual AI tools and third-party agents in Confluence

    8 April 2026
  • Crypto

    British cryptographer Adam Back denies NYT report that he is Bitcoin creator Satoshi Nakamoto

    9 April 2026

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025

    Why Benchmark Made a Rare Crypto Bet on Trading App Fomo, with $17M Series A

    6 November 2025

    Solana co-founder Anatoly Yakovenko is a big fan of agentic coding

    30 October 2025
  • Fintech

    Cash app launches ‘pay later’ feature for P2P transfers

    3 April 2026

    Doss raises $55 million for AI inventory management that connects to ERP

    24 March 2026

    Despite stiff competition, Kalshi, Polymarket CEOs back $35m VC fund projections

    23 March 2026

    Amid legal turmoil, Kalshi is temporarily banned in Nevada

    20 March 2026

    Nominations for the Startup Battlefield 200 are still open

    19 March 2026
  • Hardware

    Amazon is ending support for older Kindle devices

    9 April 2026

    Intel signs Elon Musk’s Terafab chip project

    8 April 2026

    The Xiaomi 17 Ultra has some impressive extras that make taking photos really fun

    6 April 2026

    In Japan, the robot doesn’t come for your job. fills the one no one wants

    6 April 2026

    Peter Thiel’s big bet on solar-powered cow collars

    5 April 2026
  • Media & Entertainment

    Spotify now allows everyone to turn off videos in its app

    9 April 2026

    As YouTube expands into TV, it sees more interactive video across all formats

    9 April 2026

    Tubi is the first streamer to launch a native app on ChatGPT

    8 April 2026

    Binge is a movie watching app that warns you about skips in real time

    7 April 2026

    Netflix is ​​expanding into kids’ games with a new standalone app

    6 April 2026
  • Security

    VeraCrypt encryption software developer says Windows users may experience startup problems after Microsoft shuts down its account

    10 April 2026

    Hackers steal and leak sensitive LAPD police documents

    9 April 2026

    The developer of WireGuard VPN cannot send software updates after Microsoft locks the account

    9 April 2026

    Hack-for-hire group caught targeting Android devices and iCloud backups

    8 April 2026

    Iranian hackers are targeting critical US infrastructure, US agencies warn

    8 April 2026
  • Startups

    What founders can learn from Anjuna’s layoffs and recovery

    10 April 2026

    Former Tesla engineer’s startup taps Pronto to help automate a copper mine

    9 April 2026

    Databricks co-founder wins prestigious ACM award, says ‘AGI is already here’

    9 April 2026

    Why a former AirPods engineer is now building heat pumps

    8 April 2026

    AI startup Rocket offers McKinsey-style reporting at a fraction of the cost

    7 April 2026
  • Transportation

    Volkswagen begins testing its self-driving minibuses in Los Angeles ahead of launch with Uber

    10 April 2026

    Volkswagen is dropping the all-electric ID.4 in the U.S

    10 April 2026

    Waymo robotaxis tracks potholes and shares that data with Waze users

    9 April 2026

    Self-driving car in Texas hits and kills mother duck, sparking neighborhood outrage

    9 April 2026

    Hermeus raises $350 million to build unmanned hypersonic fighters

    8 April 2026
  • Venture

    How to make the Startup Battlefield Top 20 — and what each company gets regardless

    10 April 2026

    Collide Capital Raises $95M to Back Future-of-Work Fintech Startups

    9 April 2026

    VC Eclipse has a new $1.3 billion fund to back — and build — “natural AI” startups

    8 April 2026

    The AI ​​gold rush is pulling private wealth into riskier, older bets

    7 April 2026

    Save up to $500 on tickets this week for Disrupt 2026

    6 April 2026
  • Recommended Essentials
TechTost
You are at:Home»AI»OpenAI offers a peek behind the curtain of AI’s secret instructions
AI

OpenAI offers a peek behind the curtain of AI’s secret instructions

techtost.comBy techtost.com9 May 202403 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Openai Offers A Peek Behind The Curtain Of Ai's Secret
Share
Facebook Twitter LinkedIn Pinterest Email

Ever wonder why chat AI like ChatGPT says “Sorry, I can’t do that” or some other polite refusal? OpenAI offers a limited look at the reasoning behind its own models’ rules of engagement, whether it adheres to brand guidelines or refuses to create NSFW content.

Large language models (LLMs) have no physical limits to what they can or will say. This is part of why they are so flexible, but also why they are delusional and easily fooled.

It’s necessary for any AI model that interacts with the general public to have some guardrails about what it should and shouldn’t do, but defining them — let alone enforcing them — is a surprisingly difficult task.

If someone asks an AI to create a bunch of false claims about a public figure, it should say no, right? But what if they are AI developers themselves, creating a database of synthetic disinformation for a model detector?

What if someone asks for laptop recommendations? it has to be objective, right? But what if the model is developed by a laptop manufacturer that only wants to respond with its own devices?

All AI builders navigate conundrums like these and look for effective methods to rein in their models without forcing them to deny perfectly normal requests. But they rarely share exactly how they do it.

OpenAI bucks the trend a bit by publishing what it calls “model specifications,” a collection of high-level rules that implicitly govern ChatGPT and other models.

There are meta-level goals, some hard rules, and some general behavioral guidelines, though to be clear, that’s not exactly what the model is designed with. OpenAI will have developed specific instructions that achieve what these natural language rules describe.

It’s an interesting look at how a company sets its priorities and handles spikes. And there are lots of examples of how they could play.

For example, OpenAI clearly states that developer intent is basically the highest law. So a version of a chatbot running GPT-4 can provide the answer to a math problem when asked. But if this chatbot has been programmed by its developer to never simply provide an answer directly, it will offer to work through the solution step by step:

Image Credits: OpenAI

A chat interface can even refuse to talk about anything not approved in order to rule out any attempts at manipulation in the first place. Why let a cooking aide weigh in on US involvement in the Vietnam War? Why should a customer service chatbot agree to help you with your romance supernatural novel in progress? Close it.

It also gets sticky on privacy issues like asking for someone’s name and phone number. As OpenAI points out, obviously a public figure like a mayor or congressman should have their contact information, but what about the merchants in the area? That’s probably fine — but what about employees of a particular company or members of a political party? Probably not.

Choosing when and where to draw the line is not simple. Neither is creating the directives that force the AI ​​to conform to the resulting policy. And no doubt these policies will fail all the time as people learn to work around them or accidentally find edge cases that aren’t taken into account.

OpenAI doesn’t show its full hand here, but it’s helpful for users and developers to see how these rules and guidelines are defined, and why, they’re clearly defined, if not necessarily comprehensively.

AIs Artificial Intelligence curtain instructions offers OpenAI peek Secret
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleMatch looks on Hinge as Tinder fails
Next Article Honeycomb Insurance Raises $36M Series B from Zeev Ventures Led by Solo VC
bhanuprakash.cg
techtost.com
  • Website

Related Posts

Florida AG announces OpenAI investigation into shootings allegedly involving ChatGPT

10 April 2026

ChatGPT finally offers $100/month plan

10 April 2026

AWS boss explains why investing billions in both Anthropic and OpenAI is an okay conflict

9 April 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Volkswagen begins testing its self-driving minibuses in Los Angeles ahead of launch with Uber

10 April 2026

Florida AG announces OpenAI investigation into shootings allegedly involving ChatGPT

10 April 2026

Last 24 hours: Save up to $500 on your Disrupt 2026 Pass

10 April 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

Cash app launches ‘pay later’ feature for P2P transfers

3 April 2026

Doss raises $55 million for AI inventory management that connects to ERP

24 March 2026

Despite stiff competition, Kalshi, Polymarket CEOs back $35m VC fund projections

23 March 2026
Startups

What founders can learn from Anjuna’s layoffs and recovery

Former Tesla engineer’s startup taps Pronto to help automate a copper mine

Databricks co-founder wins prestigious ACM award, says ‘AGI is already here’

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.