Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

The biggest AI stories of the year (so far)

Travis Kalanick is launching a new company called Atoms that focuses on robotics

Founded by a father-son duo, Nyne gives AI agents the human context they’ve been missing

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    ‘It wasn’t built right the first time’ — Musk’s xAI starts again, again

    14 March 2026

    Before quantum computing arrives, this startup wants businesses that are already working on it

    13 March 2026

    How to watch Jensen Huang’s Nvidia GTC 2026 keynote

    13 March 2026

    Ford’s new AI assistant will help fleet owners know if seat belts are being used

    12 March 2026

    AI ‘Actress’ Tilly Norwood Releases Worst Song I’ve Ever Heard

    12 March 2026
  • Apps

    Digg is laying off staff and shutting down the app as well as the company’s tools

    14 March 2026

    Truecaller now lets you hang up on scammers — on behalf of your family

    13 March 2026

    Channel Surfer lets you watch YouTube like it’s old-school cable TV

    13 March 2026

    Google Maps is getting an AI ‘Ask Maps’ feature and upgraded ‘immersive’ navigation

    12 March 2026

    Google Play adds new paid and PC games, game tests, community posts and more

    12 March 2026
  • Crypto

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025

    Why Benchmark Made a Rare Crypto Bet on Trading App Fomo, with $17M Series A

    6 November 2025

    Solana co-founder Anatoly Yakovenko is a big fan of agentic coding

    30 October 2025

    MoviePass opens Mogul fantasy league game to the public

    29 October 2025
  • Fintech

    India neobank Fi removes banking services on its platform

    11 March 2026

    X taps William Shatner to give invitations to his payment service, X Money

    4 March 2026

    Stripe wants to turn your AI costs into a profit center

    3 March 2026

    3 days left: Save up to $680 on your ticket to Disrupt 2026

    25 February 2026

    More startups surpass $10M ARR in 3 months than ever before

    24 February 2026
  • Hardware

    Ex-Apple Engineer Raises $5M for Note-Taking Locket That Only Records Your Voice

    12 March 2026

    Canopii seems to succeed where the old indoor farms failed

    11 March 2026

    Hyperscale Power is the latest startup to challenge 140-year-old transformer technology

    10 March 2026

    Whoop is launching a new blood test focused on women’s health

    10 March 2026

    Honor says its ‘Robot phone’ with moving camera can dance to music

    8 March 2026
  • Media & Entertainment

    Spotify will let you edit your taste profile to control your recommendations

    13 March 2026

    Disney+ launches TikTok-style short-form video stream ‘Verts’

    13 March 2026

    Substack launches an embedded recording studio

    12 March 2026

    TikTok now allows Apple Music subscribers to play entire songs without leaving the app

    12 March 2026

    WordPress debuts a private workspace that runs in your browser via a new service, my.WordPress.net

    11 March 2026
  • Security

    Law enforcement shuts down botnet consisting of tens of thousands of hacked routers

    12 March 2026

    The pro-Iranian hacktivist group says it is behind the attack on medical technology giant Stryker

    12 March 2026

    Salt Typhoon hacks the world’s phone and internet giants — here’s where they’ve been hit

    11 March 2026

    DOGE employee stole Social Security data and thumbed it, report says

    11 March 2026

    US military contractor likely built iPhone hacking tools used by Russian spies in Ukraine

    10 March 2026
  • Startups

    The biggest AI stories of the year (so far)

    14 March 2026

    Chinese brain interface startup Gestala raises $21 million just two months after launching

    13 March 2026

    Sales automation startup Rox AI hits $1.2 billion valuation, sources say

    13 March 2026

    When startups become a family business

    12 March 2026

    Ride-hailing inDrive acquires Pakistan’s Krave Mart to boost grocery delivery

    12 March 2026
  • Transportation

    Travis Kalanick is launching a new company called Atoms that focuses on robotics

    14 March 2026

    Kinetic robotics joins Uber’s Vegas app two years after major reset

    13 March 2026

    Why Rivian is holding onto the $45,000 R2 base model until ‘late 2027’

    13 March 2026

    Group14 opens factory to produce flash charge battery materials for EVs

    12 March 2026

    Nuro is testing its autonomous vehicle technology on the streets of Tokyo

    12 March 2026
  • Venture

    Founded by a father-son duo, Nyne gives AI agents the human context they’ve been missing

    14 March 2026

    Gumloop gets $50M from Benchmark to turn every worker into an AI agent builder

    13 March 2026

    This SpaceX Veteran Says The Next Big Thing In Space Is Satellites Returning To Earth

    10 March 2026

    Founders Fund is approaching $6 billion for its latest growth fund, sources say

    10 March 2026

    Robinhood’s startup fund stumbles in its NYSE debut

    7 March 2026
  • Recommended Essentials
TechTost
You are at:Home»AI»One of Google’s recent Gemini AI models rates worse in security
AI

One of Google’s recent Gemini AI models rates worse in security

techtost.comBy techtost.com2 May 202503 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
One Of Google's Recent Gemini Ai Models Rates Worse In
Share
Facebook Twitter LinkedIn Pinterest Email

A recently released Google AI model, worse in some security tests than its predecessor, according to the company’s internal comparative assessment.

To one technical report Published this week, Google reveals that the Flash Gemini 2.5 model is more likely to create text that violates security instructions from Gemini 2.0 Flash. In two measurements, “text security in text” and “text security in text”, Gemini 2.5 Flash Reges 4.1% and 9.6% respectively.

The text in text counts how often a model violates Google’s instructions given the exhortation, while image security in text assesses how closely the model clings to these limits when requested by using an image. Both tests are automated, not anthropogenic.

In an e -mail statement, a Google spokesman confirmed that Gemini 2.5 Flash “worsens worse in text security in text and image in text”.

These amazing reference results come as AI companies move to make their models more permissible – in other words, less likely to refuse to respond to controversial or sensitive issues. For the latest harvest of Lama models, Meta said it is coordinating models that do not support “some views for others” and respond to more “discussed” political prompts. Openai said earlier this year that it would modify future models so as not to take a editorial stance and provide multiple perspectives on controversial issues.

Sometimes these efforts to be permanent have been restored. TechCrunch said Monday that the default model supplying Openai’s chatgpt allowed minors to create erotic conversations. Openai blamed the behavior of a “error”.

According to Google’s technical report, the Gemini 2.5 Flash, which is still in the preview, follows the instructions more faithfully than the Gemini 2.0 Flash, including the instructions that cross problematic lines. The company claims that regressions can be partially attributed to false positives, but also admits that Gemini 2.5 Flash sometimes creates “violated content” when explicitly requested.

TechCrunch event

Berkeley, ca
|
June 5

Book now

“Of course there is tension between [instruction following] On sensitive issues and violations of security policy, which are reflected in all our evaluations, “the report said.

The scores from SpeechMap, a reference point that detects how models respond to sensitive and controversial prompts, also suggest that Gemini 2.5 Flash is much less likely to refuse to answer questionable questions from Flash Gemini 2.0. Testing the model by TechCrunch via the AI ​​Openrouter platform has found that it would write unexpected essays to support the replacement of human judges with AI, weakening the protection of procedures in the US and implementing widely widespread government programs.

Thomas Woodside, co -founder of the Secure AI Project, said that Google’s limited details in his technical report prove the need for greater transparency in model tests.

“There is a compromise between monitoring of teaching and the policy that follows, because some users can request content that would violate policies,” Woodside told TechCrunch. “In this case, Google’s latest Flash model complies with instructions more, while violating policies more. Google does not provide much details about the specific cases where policies were violated, although they say they are not serious.

Google has undergone a fire on the pre -model security reports.

It took weeks of the company to publish a technical report for its most capable model, Gemini 2.5 Pro. When the report was finally published, initially skip the security test details.

On Monday, Google published a more detailed report with additional security information.

Gemini Google Googles models rates security worse
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleApple approves the Spotify app that allows our users to access pricing information, external payment links
Next Article Aurora inaugurates the Driver’s non -driver truck service and a surprise candidate joins the bankruptcy of Canoo
bhanuprakash.cg
techtost.com
  • Website

Related Posts

‘It wasn’t built right the first time’ — Musk’s xAI starts again, again

14 March 2026

Before quantum computing arrives, this startup wants businesses that are already working on it

13 March 2026

How to watch Jensen Huang’s Nvidia GTC 2026 keynote

13 March 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

The biggest AI stories of the year (so far)

14 March 2026

Travis Kalanick is launching a new company called Atoms that focuses on robotics

14 March 2026

Founded by a father-son duo, Nyne gives AI agents the human context they’ve been missing

14 March 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

India neobank Fi removes banking services on its platform

11 March 2026

X taps William Shatner to give invitations to his payment service, X Money

4 March 2026

Stripe wants to turn your AI costs into a profit center

3 March 2026
Startups

The biggest AI stories of the year (so far)

Chinese brain interface startup Gestala raises $21 million just two months after launching

Sales automation startup Rox AI hits $1.2 billion valuation, sources say

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.