Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Waymo halts freeway routes after robotaxi race in construction zones

How VCs and Founders Use Inflated ‘ARR’ to Crown AI Startups

Google prefers glitter with disco ball icons: “Are you sure you still want this?”

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    How VCs and Founders Use Inflated ‘ARR’ to Crown AI Startups

    23 May 2026

    Hark Raises $700M Series A for Secret ‘Universal’ AI Interface

    22 May 2026

    Six search engines worth trying now that Google isn’t Google anymore

    22 May 2026

    Spotify adds AI-powered question-and-answer capabilities to podcasts

    21 May 2026

    Jensen Huang Says He’s Found a ‘Brand New’ $200B Market for Nvidia

    21 May 2026
  • Apps

    Google prefers glitter with disco ball icons: “Are you sure you still want this?”

    23 May 2026

    Meta is quietly launching a new Reddit-like app called Forum

    22 May 2026

    Spotify and Universal Music strike deal allowing AI covers and remixes by fans

    22 May 2026

    Spotify takes on Google’s NotebookLM with its new app

    21 May 2026

    Airbnb enters hotels, extends AI to host integration and customer support

    21 May 2026
  • Crypto

    As crypto cools, a16z crypto raises $2.2 billion in capital

    6 May 2026

    Coinbase to lay off 14% of staff as part of broader restructuring

    5 May 2026

    British cryptographer Adam Back denies NYT report that he is Bitcoin creator Satoshi Nakamoto

    9 April 2026

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025
  • Fintech

    General Catalyst just led a $63 million bet in India’s travel payments market

    21 May 2026

    Startup Battlefield 200 applications close on May 27

    21 May 2026

    Venmo’s biggest makeover in years comes at a very interesting time

    11 May 2026

    Fintech startup Parker files for bankruptcy

    10 May 2026

    Robinhood’s venture fund IPO attracted 150,000+ private investors, CEO says

    7 May 2026
  • Hardware

    We tested Google’s AI glasses and they’re almost there

    23 May 2026

    Finnish phone maker HMD ropes Indian AI chatbot into new smartphone to reach local market

    22 May 2026

    Flipper unveils a Linux-powered networking gadget designed for hackers and tinkerers

    22 May 2026

    Minimalist Light Phone teams up with Andrew Yang’s Noble Mobile, which pays you to stop doomscrolling

    20 May 2026

    Mach Industries just spent $50 million to solve a major defense technology problem

    20 May 2026
  • Media & Entertainment

    Spotify launches an audiobook creation tool powered by ElevenLabs

    22 May 2026

    New York City Mayor Zohran Mamdani Takes To Twitch To Chat With New Yorkers

    21 May 2026

    Clouted wants to take the guesswork out of making short videos go viral

    21 May 2026

    ‘Ask YouTube’ Brings AI Chat Search to Video, Adds Gemini Omni to Shorts

    20 May 2026

    Google’s Gemini Omni turns images, audio and text into video — and that’s just the beginning

    19 May 2026
  • Security

    Scammers abuse an internal Microsoft account to send spam links

    22 May 2026

    Law enforcement shuts down VPN service used by two dozen ransomware gangs

    21 May 2026

    GitHub says hackers stole data from thousands of internal repositories

    21 May 2026

    Customers say Trump Mobile is leaking their personal information

    20 May 2026

    US cyber agency CISA has exposed bundles of passwords and cloud keys to the open web

    19 May 2026
  • Startups

    This startup raised $43 million to create a hive mind for ships

    22 May 2026

    Maka Kids redefines kids’ screen time with a streaming app optimized for wellness, not engagement

    22 May 2026

    This new startup is taking on a fragrance industry that hasn’t changed in nearly half a century

    21 May 2026

    Imperagen raises £5m to use quantum physics, AI to engineer enzymes

    21 May 2026

    NanoClaw creator rejects $20M takeover offer, raises $12M instead

    20 May 2026
  • Transportation

    Waymo halts freeway routes after robotaxi race in construction zones

    23 May 2026

    Who will benefit most from SpaceX’s IPO? Mainly Elon — and a few of his inner circle

    22 May 2026

    Waymo extends layoff to four cities as robotaxis continue to drive flooding

    22 May 2026

    Waymo halts service in Atlanta as its robotic car continues to drive into floods

    21 May 2026

    SpaceX’s IPO filing is filled with AI bets, Starship dreams and Elon Musk at the center

    21 May 2026
  • Venture

    Convective Capital Raises $85M Fund to Build Disaster Resilience

    22 May 2026

    Sam Altman does a ‘mic drop’ pitch to every Y Combinator startup

    21 May 2026

    Startup Battlefield 200 applications close on May 27

    20 May 2026

    Stilta raises $10.5M from a16z and YC to help companies rediscover patents they forgot they had

    20 May 2026

    Forget Streaming: Status AI Raises $17 Million To Turn Social Media Into Interactive Entertainment

    19 May 2026
  • Recommended Essentials
TechTost
You are at:Home»AI»OpenAI offers a peek behind the curtain of AI’s secret instructions
AI

OpenAI offers a peek behind the curtain of AI’s secret instructions

techtost.comBy techtost.com9 May 202403 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Openai Offers A Peek Behind The Curtain Of Ai's Secret
Share
Facebook Twitter LinkedIn Pinterest Email

Ever wonder why chat AI like ChatGPT says “Sorry, I can’t do that” or some other polite refusal? OpenAI offers a limited look at the reasoning behind its own models’ rules of engagement, whether it adheres to brand guidelines or refuses to create NSFW content.

Large language models (LLMs) have no physical limits to what they can or will say. This is part of why they are so flexible, but also why they are delusional and easily fooled.

It’s necessary for any AI model that interacts with the general public to have some guardrails about what it should and shouldn’t do, but defining them — let alone enforcing them — is a surprisingly difficult task.

If someone asks an AI to create a bunch of false claims about a public figure, it should say no, right? But what if they are AI developers themselves, creating a database of synthetic disinformation for a model detector?

What if someone asks for laptop recommendations? it has to be objective, right? But what if the model is developed by a laptop manufacturer that only wants to respond with its own devices?

All AI builders navigate conundrums like these and look for effective methods to rein in their models without forcing them to deny perfectly normal requests. But they rarely share exactly how they do it.

OpenAI bucks the trend a bit by publishing what it calls “model specifications,” a collection of high-level rules that implicitly govern ChatGPT and other models.

There are meta-level goals, some hard rules, and some general behavioral guidelines, though to be clear, that’s not exactly what the model is designed with. OpenAI will have developed specific instructions that achieve what these natural language rules describe.

It’s an interesting look at how a company sets its priorities and handles spikes. And there are lots of examples of how they could play.

For example, OpenAI clearly states that developer intent is basically the highest law. So a version of a chatbot running GPT-4 can provide the answer to a math problem when asked. But if this chatbot has been programmed by its developer to never simply provide an answer directly, it will offer to work through the solution step by step:

Image Credits: OpenAI

A chat interface can even refuse to talk about anything not approved in order to rule out any attempts at manipulation in the first place. Why let a cooking aide weigh in on US involvement in the Vietnam War? Why should a customer service chatbot agree to help you with your romance supernatural novel in progress? Close it.

It also gets sticky on privacy issues like asking for someone’s name and phone number. As OpenAI points out, obviously a public figure like a mayor or congressman should have their contact information, but what about the merchants in the area? That’s probably fine — but what about employees of a particular company or members of a political party? Probably not.

Choosing when and where to draw the line is not simple. Neither is creating the directives that force the AI ​​to conform to the resulting policy. And no doubt these policies will fail all the time as people learn to work around them or accidentally find edge cases that aren’t taken into account.

OpenAI doesn’t show its full hand here, but it’s helpful for users and developers to see how these rules and guidelines are defined, and why, they’re clearly defined, if not necessarily comprehensively.

AIs Artificial Intelligence curtain instructions offers OpenAI peek Secret
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleMatch looks on Hinge as Tinder fails
Next Article Honeycomb Insurance Raises $36M Series B from Zeev Ventures Led by Solo VC
bhanuprakash.cg
techtost.com
  • Website

Related Posts

How VCs and Founders Use Inflated ‘ARR’ to Crown AI Startups

23 May 2026

Hark Raises $700M Series A for Secret ‘Universal’ AI Interface

22 May 2026

Six search engines worth trying now that Google isn’t Google anymore

22 May 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Waymo halts freeway routes after robotaxi race in construction zones

23 May 2026

How VCs and Founders Use Inflated ‘ARR’ to Crown AI Startups

23 May 2026

Google prefers glitter with disco ball icons: “Are you sure you still want this?”

23 May 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

General Catalyst just led a $63 million bet in India’s travel payments market

21 May 2026

Startup Battlefield 200 applications close on May 27

21 May 2026

Venmo’s biggest makeover in years comes at a very interesting time

11 May 2026
Startups

This startup raised $43 million to create a hive mind for ships

Maka Kids redefines kids’ screen time with a streaming app optimized for wellness, not engagement

This new startup is taking on a fragrance industry that hasn’t changed in nearly half a century

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.