Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

What you need to know about Warner Bros.’ landmark Discovery sale

Why China’s humanoid robot industry is winning the early market

Google launches Nano Banana 2 model with faster image generation

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    Musk slams OpenAI in deposition, says ‘no one killed themselves because of Grok’

    28 February 2026

    Pentagon moves to designate Anthropic as a supply chain risk

    28 February 2026

    Anthropic CEO stands firm as Pentagon deadline looms

    27 February 2026

    Jack Dorsey just halved the size of Block’s employee base — and he says your company is next

    27 February 2026

    Salesforce CEO Marc Benioff: This isn’t our first SaaSpocalypse

    26 February 2026
  • Apps

    Google launches Nano Banana 2 model with faster image generation

    1 March 2026

    South Korea is opening the door to allow Google Maps to be fully operational

    28 February 2026

    Spotify releases audiobook maps

    28 February 2026

    Bumble adds AI photo feedback and profile guidance tools

    27 February 2026

    Threads is testing a shortcut to quickly start DM conversations

    27 February 2026
  • Crypto

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025

    Why Benchmark Made a Rare Crypto Bet on Trading App Fomo, with $17M Series A

    6 November 2025

    Solana co-founder Anatoly Yakovenko is a big fan of agentic coding

    30 October 2025

    MoviePass opens Mogul fantasy league game to the public

    29 October 2025
  • Fintech

    3 days left: Save up to $680 on your ticket to Disrupt 2026

    25 February 2026

    More startups surpass $10M ARR in 3 months than ever before

    24 February 2026

    Stripe, PayPal Ventures Bet on India’s Xflow to Fix Cross-Border B2B Payments

    24 February 2026

    InScope raises $14.5M to solve financial reporting pain

    20 February 2026

    OpenAI deepens India push with Pine Labs fintech partnership

    19 February 2026
  • Hardware

    Xiaomi launches 17 Ultra smartphones, an AirTag clone and an ultra-thin powerbank

    28 February 2026

    Last 24 hours to get Disrupt 2026 tickets at the lowest prices of the year

    27 February 2026

    Everything announced at Samsung’s Galaxy Unpacked event, including S26 smartphones, privacy screen and more

    26 February 2026

    Samsung introduces new display technology that adds a privacy screen to apps and notifications

    25 February 2026

    Oura launches a proprietary AI model focused on women’s health

    25 February 2026
  • Media & Entertainment

    What you need to know about Warner Bros.’ landmark Discovery sale

    1 March 2026

    Apple and Netflix team up to stream Formula 1 Canadian Grand Prix

    27 February 2026

    Netflix pulls out of bid for Warner Bros. Discovery, giving studios, HBO and CNN to Ellison-owned Paramount

    27 February 2026

    Book the best deals for Disrupt 2026 | TechCrunch

    26 February 2026

    Americans now listen to podcasts more often than talk radio, study shows

    25 February 2026
  • Security

    The resulting data breach is growing, affecting at least 25 million people

    28 February 2026

    India cuts off access to popular developer platform Supabase with block order

    28 February 2026

    CISA replaces deputy director after a difficult year on the job

    27 February 2026

    Cisco Says Hackers Are Exploiting Critical Flaw To Break Into Large Customer Networks By 2023

    26 February 2026

    US cybersecurity agency CISA reportedly in dire straits amid Trump cuts and layoffs

    26 February 2026
  • Startups

    Why China’s humanoid robot industry is winning the early market

    1 March 2026

    Jest, a marketplace for messaging games, is challenging the app store status quo

    28 February 2026

    Superhuman bets on redesigned smart ring to win back US market after Oura controversy

    27 February 2026

    Trace raises $3 million to solve AI agent adoption in the enterprise

    27 February 2026

    How to avoid bad hires in early stage startups

    26 February 2026
  • Transportation

    Self-driving truck startup Einride raises $113M PIPE ahead of public debut

    27 February 2026

    It’s time to pull the plug on plug-in hybrids

    26 February 2026

    Harbinger acquires self-driving company Phantom AI

    26 February 2026

    Waymo robotaxis are now operating in 10 US cities

    25 February 2026

    Self-driving tech startup Wayve raises $1.2 billion from Nvidia, Uber and three automakers

    25 February 2026
  • Venture

    After Zomato, Deepinder Goyal is back with a $54 million brain-monitoring bet

    28 February 2026

    Dive into Boston’s startup ecosystem at Founder Summit 2026 | TechCrunch

    27 February 2026

    A VC and some big-name developers are trying to solve the open source funding problem, permanently

    27 February 2026

    Y Combinator grad and AI insurance brokerage Harper raises $47 million

    26 February 2026

    Anthropic acquires AI startup Vercept after Meta indicts one of its founders

    26 February 2026
  • Recommended Essentials
TechTost
You are at:Home»AI»Openai’s latest AI models have a new assurance to prevent biofilings
AI

Openai’s latest AI models have a new assurance to prevent biofilings

techtost.comBy techtost.com17 April 202503 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Openai's Latest Ai Models Have A New Assurance To Prevent
Share
Facebook Twitter LinkedIn Pinterest Email

Openai says it has developed a new system for monitoring the latest AI, O3 and O4-MINI reasoning models, for prompts associated with biological and chemical threats. The system aims to prevent models to offer tips that could command someone to carry out potentially harmful attacks, According to the Openai Security Report.

O3 and O4-MINI represent a significant increase in potential in relation to previous Openai models, the company says and thus pose new dangers in the hands of bad actors. According to Openai’s internal reference points, the O3 is more specialized in answering questions about creating certain types of biological threats in particular. For this reason-and in order to mitigate other dangers-Openai has created the new monitoring system, which the company describes as “security-focused monitoring”.

The screen, which has been trained customized to account for Openai’s content policies, runs over O3 and O4-mini. It is designed to identify the prompts associated with biological and chemical risk and to guide models to refuse to provide advice on these issues.

To create a basic line, Openai had red groupers spend about 1,000 hours to signal “unsafe” talks associated with the O3 and O4-Mini biocis. During a test in which Openai simulated the “exclusion of exclusion” of its security monitoring, models refused to respond to dangerous prompts of 98.7% of the time, according to Openai.

Openai acknowledges that its test did not represent people who could try new prompts after being blocked by the screen, so the company says it will continue to be partly based on human monitoring.

O3 and O4-MINI do not cross Openai’s “High Risk” threshold for BIORICISKA, according to the company. However, compared to O1 and GPT-4, Openai reports that the first versions of O3 and O4-Mini proved to be more useful in answering questions about the development of biological weapons.

Chart from O3 and O4-MINI System Card (Screenshot: Openai)

The company is actively monitoring how its models could make it easier for malicious users to develop chemical and biological threats, according to recent updated Openai Preparedness.

Openai is increasingly based on automated systems to mitigate the dangers of its models. For example, for prevention GPT-4O image generator from creating child sexual abuse (CSAM)Openai says it uses a reasoning screen similar to that that the company developed for O3 and O4-mini.

However, many researchers have caused concern that OpenAI does not prioritize security as much as it should. One of the company’s partners, Metr, said he had a relatively short time to try O3 at a reference point for misleading behavior. Meanwhile, Openai has decided not to release a security report for the GPT-4.1 model, which began earlier this week.

AI security assurance biofilings ChatGPT latest models open OpenAIs prevent
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleTiktok starts testing footnotes, a new Community notes feature
Next Article TechCrunch All Stage: Reveal Full Daily layout
bhanuprakash.cg
techtost.com
  • Website

Related Posts

Musk slams OpenAI in deposition, says ‘no one killed themselves because of Grok’

28 February 2026

Pentagon moves to designate Anthropic as a supply chain risk

28 February 2026

Anthropic CEO stands firm as Pentagon deadline looms

27 February 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

What you need to know about Warner Bros.’ landmark Discovery sale

1 March 2026

Why China’s humanoid robot industry is winning the early market

1 March 2026

Google launches Nano Banana 2 model with faster image generation

1 March 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

3 days left: Save up to $680 on your ticket to Disrupt 2026

25 February 2026

More startups surpass $10M ARR in 3 months than ever before

24 February 2026

Stripe, PayPal Ventures Bet on India’s Xflow to Fix Cross-Border B2B Payments

24 February 2026
Startups

Why China’s humanoid robot industry is winning the early market

Jest, a marketplace for messaging games, is challenging the app store status quo

Superhuman bets on redesigned smart ring to win back US market after Oura controversy

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.