Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Exaforce Raises $125M Series B to Build AI to Catch and Stop Cyberattacks as They Happen

Potholes are costing cities millions: This company uses artificial intelligence and trucks to fix them

Anthropic warns investors against secondary platforms offering access to its shares

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    Medicare’s new payment model is designed for artificial intelligence, and most of the tech world has no idea

    13 May 2026

    Dessn raises $6 million for production-focused design tool

    12 May 2026

    Riding on an AI rally, Robinhood is preparing its second retail IPO

    12 May 2026

    There aren’t enough rockets for space data centers. Cowboy Space raised $275 million to build them.

    11 May 2026

    We’re feeling cynical about xAI’s big deal with Anthropic

    11 May 2026
  • Apps

    Everything Google announced at its Android Expo, from Googlebooks to vibe-encoded widgets

    13 May 2026

    TikTok now wants to be the place where you book that trip you just saw on TikTok

    12 May 2026

    Discord Launches Nitro Rewards, Giving Subscribers Access to Xbox Game Pass Base Level at No Extra Cost

    11 May 2026

    Etsy launches its ChatGPT app as it continues its AI push

    10 May 2026

    Tinder Match Group owner slows hiring to pay for increased use of AI tools

    10 May 2026
  • Crypto

    As crypto cools, a16z crypto raises $2.2 billion in capital

    6 May 2026

    Coinbase to lay off 14% of staff as part of broader restructuring

    5 May 2026

    British cryptographer Adam Back denies NYT report that he is Bitcoin creator Satoshi Nakamoto

    9 April 2026

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025
  • Fintech

    Venmo’s biggest makeover in years comes at a very interesting time

    11 May 2026

    Fintech startup Parker files for bankruptcy

    10 May 2026

    Robinhood’s venture fund IPO attracted 150,000+ private investors, CEO says

    7 May 2026

    PayPal says it’s “becoming a tech company again” — that’s AI

    6 May 2026

    Stripe introduces Link, a digital wallet that autonomous AI agents can also use

    1 May 2026
  • Hardware

    Google unveils Googlebook, a new line of laptops with native artificial intelligence

    13 May 2026

    The Instax Wide 400 takes the simplicity of instant photography and expands it, literally

    10 May 2026

    Google Unveils Fitbit Air Without Whoop-like Display

    8 May 2026

    Google’s $9.99 per month AI health plan launches on May 19

    8 May 2026

    Apple to pay $250 million to settle lawsuit over Siri’s lagging AI features

    7 May 2026
  • Media & Entertainment

    Digg is trying again, this time as an AI news aggregator

    12 May 2026

    Bravo creates unscripted mini-dramas for the Peacock app

    11 May 2026

    The hottest place for startups to strike a deal? The F1 mantra

    10 May 2026

    Netflix delays Greta Gerwig’s ‘Narnia’ for big theatrical push to 2027

    2 May 2026

    Roku’s $3 streaming service Howdy hits 1 million subscribers, per recent report

    29 April 2026
  • Security

    Exaforce Raises $125M Series B to Build AI to Catch and Stop Cyberattacks as They Happen

    13 May 2026

    Google launches new Android security feature to help uncover spyware attacks

    12 May 2026

    US healthcare marketplaces shared citizenship and race data with ad tech giants

    11 May 2026

    Some kids bypass age verification checks with a fake moustache

    10 May 2026

    Police arrest crew that sent malicious messages to thousands across Toronto

    10 May 2026
  • Startups

    Korea’s biggest manufacturers support Config, TSMC robot data

    11 May 2026

    China’s Moonshot AI Raises $2B in $20B Valuation as Demand for Open Source AI Soars

    10 May 2026

    Could Lovable’s automatic 10% pay rise be the cure for toxic cultures?

    9 May 2026

    Gusto hits $1 billion in revenue, moves closer to public markets

    9 May 2026

    Learn what it takes to raise a Series A in 2027 at Disrupt 2026

    8 May 2026
  • Transportation

    Potholes are costing cities millions: This company uses artificial intelligence and trucks to fix them

    13 May 2026

    Waymo issues recall to address a flooding issue

    12 May 2026

    GM just laid off hundreds of IT workers to hire people with stronger AI skills

    12 May 2026

    TechCrunch Mobility: Lime’s IPO bet

    11 May 2026

    Uber always wanted to be more than a ride. now he has reason to hurry

    11 May 2026
  • Venture

    Anthropic warns investors against secondary platforms offering access to its shares

    13 May 2026

    Mother Ventures looks at moms as the ‘economic engine’

    9 May 2026

    2 days left: Get 50% off a second Disrupt 2026 pass

    7 May 2026

    All your M&A questions will be answered at Disrupt 2026

    6 May 2026

    ElevenLabs lists BlackRock, Jamie Foxx and Eva Longoria as new investors

    6 May 2026
  • Recommended Essentials
TechTost
You are at:Home»AI»One of Google’s recent Gemini AI models rates worse in security
AI

One of Google’s recent Gemini AI models rates worse in security

techtost.comBy techtost.com2 May 202503 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
One Of Google's Recent Gemini Ai Models Rates Worse In
Share
Facebook Twitter LinkedIn Pinterest Email

A recently released Google AI model, worse in some security tests than its predecessor, according to the company’s internal comparative assessment.

To one technical report Published this week, Google reveals that the Flash Gemini 2.5 model is more likely to create text that violates security instructions from Gemini 2.0 Flash. In two measurements, “text security in text” and “text security in text”, Gemini 2.5 Flash Reges 4.1% and 9.6% respectively.

The text in text counts how often a model violates Google’s instructions given the exhortation, while image security in text assesses how closely the model clings to these limits when requested by using an image. Both tests are automated, not anthropogenic.

In an e -mail statement, a Google spokesman confirmed that Gemini 2.5 Flash “worsens worse in text security in text and image in text”.

These amazing reference results come as AI companies move to make their models more permissible – in other words, less likely to refuse to respond to controversial or sensitive issues. For the latest harvest of Lama models, Meta said it is coordinating models that do not support “some views for others” and respond to more “discussed” political prompts. Openai said earlier this year that it would modify future models so as not to take a editorial stance and provide multiple perspectives on controversial issues.

Sometimes these efforts to be permanent have been restored. TechCrunch said Monday that the default model supplying Openai’s chatgpt allowed minors to create erotic conversations. Openai blamed the behavior of a “error”.

According to Google’s technical report, the Gemini 2.5 Flash, which is still in the preview, follows the instructions more faithfully than the Gemini 2.0 Flash, including the instructions that cross problematic lines. The company claims that regressions can be partially attributed to false positives, but also admits that Gemini 2.5 Flash sometimes creates “violated content” when explicitly requested.

TechCrunch event

Berkeley, ca
|
June 5

Book now

“Of course there is tension between [instruction following] On sensitive issues and violations of security policy, which are reflected in all our evaluations, “the report said.

The scores from SpeechMap, a reference point that detects how models respond to sensitive and controversial prompts, also suggest that Gemini 2.5 Flash is much less likely to refuse to answer questionable questions from Flash Gemini 2.0. Testing the model by TechCrunch via the AI ​​Openrouter platform has found that it would write unexpected essays to support the replacement of human judges with AI, weakening the protection of procedures in the US and implementing widely widespread government programs.

Thomas Woodside, co -founder of the Secure AI Project, said that Google’s limited details in his technical report prove the need for greater transparency in model tests.

“There is a compromise between monitoring of teaching and the policy that follows, because some users can request content that would violate policies,” Woodside told TechCrunch. “In this case, Google’s latest Flash model complies with instructions more, while violating policies more. Google does not provide much details about the specific cases where policies were violated, although they say they are not serious.

Google has undergone a fire on the pre -model security reports.

It took weeks of the company to publish a technical report for its most capable model, Gemini 2.5 Pro. When the report was finally published, initially skip the security test details.

On Monday, Google published a more detailed report with additional security information.

Gemini Google Googles models rates security worse
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleApple approves the Spotify app that allows our users to access pricing information, external payment links
Next Article Aurora inaugurates the Driver’s non -driver truck service and a surprise candidate joins the bankruptcy of Canoo
bhanuprakash.cg
techtost.com
  • Website

Related Posts

Medicare’s new payment model is designed for artificial intelligence, and most of the tech world has no idea

13 May 2026

Everything Google announced at its Android Expo, from Googlebooks to vibe-encoded widgets

13 May 2026

Google unveils Googlebook, a new line of laptops with native artificial intelligence

13 May 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Exaforce Raises $125M Series B to Build AI to Catch and Stop Cyberattacks as They Happen

13 May 2026

Potholes are costing cities millions: This company uses artificial intelligence and trucks to fix them

13 May 2026

Anthropic warns investors against secondary platforms offering access to its shares

13 May 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

Venmo’s biggest makeover in years comes at a very interesting time

11 May 2026

Fintech startup Parker files for bankruptcy

10 May 2026

Robinhood’s venture fund IPO attracted 150,000+ private investors, CEO says

7 May 2026
Startups

Korea’s biggest manufacturers support Config, TSMC robot data

China’s Moonshot AI Raises $2B in $20B Valuation as Demand for Open Source AI Soars

Could Lovable’s automatic 10% pay rise be the cure for toxic cultures?

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.