Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Cyberdecks are having a moment, rejecting big tech surveillance with style and substance

A startup, Everand, is now bringing together e-books, audiobooks and book clubs as a challenge to Amazon

Password manager Dashlane says hackers stole some customers’ password vaults

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    Anthropic scales Claude Mythos to critical infrastructure in 15+ countries

    2 June 2026

    Florida sues OpenAI’s Sam Altman in first-of-its-kind violent crime lawsuit

    2 June 2026

    The internet is being remade for machines

    1 June 2026

    Understanding the AI ​​psychosis debate

    31 May 2026

    ‘What a joke’: Github Copilot’s new token-based pricing upsets developers

    31 May 2026
  • Apps

    Meta is testing ‘Series’ for episodic Reels on Instagram and Facebook

    2 June 2026

    A new app, The Mall, creates a universal flow for online shopping

    2 June 2026

    DuckDuckGo makes its ‘AI-free’ search engine easier to access as traffic grows

    1 June 2026

    TikTok’s road to becoming a super app

    31 May 2026

    YouTube adds new podcast features, including an AI recommendation tool and ‘Auto Speed’

    30 May 2026
  • Crypto

    Startup Battlefield 200 applications close today

    27 May 2026

    5 days left: Save up to $410 on Disrupt 2026 passes

    25 May 2026

    As crypto cools, a16z crypto raises $2.2 billion in capital

    6 May 2026

    Coinbase to lay off 14% of staff as part of broader restructuring

    5 May 2026

    British cryptographer Adam Back denies NYT report that he is Bitcoin creator Satoshi Nakamoto

    9 April 2026
  • Fintech

    Last 24 hours to save up to $410 on your Disrupt 2026 ticket

    29 May 2026

    2 days left: Lock in up to $410 in ticket savings for Disrupt 2026

    28 May 2026

    Robinhood now allows your AI agents to trade stocks

    28 May 2026

    Disrupt 2026 Early Bird ticket savings expire in 3 days

    27 May 2026

    Disrupt 2026 Early Bird ticket prices end May 29

    26 May 2026
  • Hardware

    Cyberdecks are having a moment, rejecting big tech surveillance with style and substance

    3 June 2026

    Nvidia chases $200 billion CPU market with AI agent computing from Microsoft, Dell and HP

    2 June 2026

    This $300 Pizza Oven Can Easily Help Revive Your Summer Pizza Nights

    30 May 2026

    Kiwibit’s artificial intelligence bird feeder is my new backyard friend

    29 May 2026

    Vertu wants CEOs to run companies from a foldable AI starting at $6,880

    29 May 2026
  • Media & Entertainment

    A startup, Everand, is now bringing together e-books, audiobooks and book clubs as a challenge to Amazon

    2 June 2026

    The two biggest movies of this weekend were both directed by YouTubers

    31 May 2026

    The two biggest movies of this weekend were both directed by YouTubers

    30 May 2026

    YouTube will automatically flag videos with artificial intelligence

    28 May 2026

    Meta launches Instagram, Facebook and WhatsApp subscriptions, with more to follow, including AI plans

    27 May 2026
  • Security

    Password manager Dashlane says hackers stole some customers’ password vaults

    2 June 2026

    Hackers took over Instagram accounts by tricking the Meta AI support chatbot into granting access

    1 June 2026

    Iranian hackers blamed for breach of Los Angeles transit system that took weeks to recover

    30 May 2026

    Microsoft is under fire for threatening a security researcher with a criminal investigation

    29 May 2026

    A security flaw in prison payphone service Pay Tel exposed publicly the driver’s licenses of more than 300,000 callers

    29 May 2026
  • Startups

    Board, the new gaming startup from Mirror founder Brynn Putnam, raises $20 million, has already sold thousands

    2 June 2026

    From Stage to Future: Where Are Startup Battlefield Alumni Now?

    2 June 2026

    Revolut offers service to thousands of users in India ahead of wider rollout

    1 June 2026

    The deadline to submit applications for the Startup Battlefield 200 has been extended to June 8

    30 May 2026

    H1 secures $40M from CVS, proving SaaS startups can still attract investment

    30 May 2026
  • Transportation

    Defense tech darling Mach Industries hits $1.8 billion valuation, 4x jump in one year

    2 June 2026

    SpaceX says it may issue ‘significant’ equity in ‘future transactions’

    1 June 2026

    TechCrunch Mobility: It doesn’t matter that people hate the Ferrari Luce

    31 May 2026

    Rivian is under investigation for rear suspension failures on R1 models

    30 May 2026

    Waymo’s newest robotaxi is Chinese-made, built to make money, and is now accepting riders

    30 May 2026
  • Venture

    How Europe’s AI strategy diverges from Silicon Valley’s

    2 June 2026

    How to make the Startup Battlefield Top 20 — and what each company gets regardless

    2 June 2026

    Black founders raise highest quarterly funding since 2022, but there’s a catch

    31 May 2026

    Snap alums reveal Ghost Angels fund

    31 May 2026

    The groupthink explosion: what three top VCs really think about the AI ​​frenzy

    30 May 2026
  • Recommended Essentials
TechTost
You are at:Home»AI»One of Google’s recent Gemini AI models rates worse in security
AI

One of Google’s recent Gemini AI models rates worse in security

techtost.comBy techtost.com2 May 202503 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
One Of Google's Recent Gemini Ai Models Rates Worse In
Share
Facebook Twitter LinkedIn Pinterest Email

A recently released Google AI model, worse in some security tests than its predecessor, according to the company’s internal comparative assessment.

To one technical report Published this week, Google reveals that the Flash Gemini 2.5 model is more likely to create text that violates security instructions from Gemini 2.0 Flash. In two measurements, “text security in text” and “text security in text”, Gemini 2.5 Flash Reges 4.1% and 9.6% respectively.

The text in text counts how often a model violates Google’s instructions given the exhortation, while image security in text assesses how closely the model clings to these limits when requested by using an image. Both tests are automated, not anthropogenic.

In an e -mail statement, a Google spokesman confirmed that Gemini 2.5 Flash “worsens worse in text security in text and image in text”.

These amazing reference results come as AI companies move to make their models more permissible – in other words, less likely to refuse to respond to controversial or sensitive issues. For the latest harvest of Lama models, Meta said it is coordinating models that do not support “some views for others” and respond to more “discussed” political prompts. Openai said earlier this year that it would modify future models so as not to take a editorial stance and provide multiple perspectives on controversial issues.

Sometimes these efforts to be permanent have been restored. TechCrunch said Monday that the default model supplying Openai’s chatgpt allowed minors to create erotic conversations. Openai blamed the behavior of a “error”.

According to Google’s technical report, the Gemini 2.5 Flash, which is still in the preview, follows the instructions more faithfully than the Gemini 2.0 Flash, including the instructions that cross problematic lines. The company claims that regressions can be partially attributed to false positives, but also admits that Gemini 2.5 Flash sometimes creates “violated content” when explicitly requested.

TechCrunch event

Berkeley, ca
|
June 5

Book now

“Of course there is tension between [instruction following] On sensitive issues and violations of security policy, which are reflected in all our evaluations, “the report said.

The scores from SpeechMap, a reference point that detects how models respond to sensitive and controversial prompts, also suggest that Gemini 2.5 Flash is much less likely to refuse to answer questionable questions from Flash Gemini 2.0. Testing the model by TechCrunch via the AI ​​Openrouter platform has found that it would write unexpected essays to support the replacement of human judges with AI, weakening the protection of procedures in the US and implementing widely widespread government programs.

Thomas Woodside, co -founder of the Secure AI Project, said that Google’s limited details in his technical report prove the need for greater transparency in model tests.

“There is a compromise between monitoring of teaching and the policy that follows, because some users can request content that would violate policies,” Woodside told TechCrunch. “In this case, Google’s latest Flash model complies with instructions more, while violating policies more. Google does not provide much details about the specific cases where policies were violated, although they say they are not serious.

Google has undergone a fire on the pre -model security reports.

It took weeks of the company to publish a technical report for its most capable model, Gemini 2.5 Pro. When the report was finally published, initially skip the security test details.

On Monday, Google published a more detailed report with additional security information.

Gemini Google Googles models rates security worse
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleApple approves the Spotify app that allows our users to access pricing information, external payment links
Next Article Aurora inaugurates the Driver’s non -driver truck service and a surprise candidate joins the bankruptcy of Canoo
bhanuprakash.cg
techtost.com
  • Website

Related Posts

Anthropic scales Claude Mythos to critical infrastructure in 15+ countries

2 June 2026

Florida sues OpenAI’s Sam Altman in first-of-its-kind violent crime lawsuit

2 June 2026

The internet is being remade for machines

1 June 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Cyberdecks are having a moment, rejecting big tech surveillance with style and substance

3 June 2026

A startup, Everand, is now bringing together e-books, audiobooks and book clubs as a challenge to Amazon

2 June 2026

Password manager Dashlane says hackers stole some customers’ password vaults

2 June 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

Last 24 hours to save up to $410 on your Disrupt 2026 ticket

29 May 2026

2 days left: Lock in up to $410 in ticket savings for Disrupt 2026

28 May 2026

Robinhood now allows your AI agents to trade stocks

28 May 2026
Startups

Board, the new gaming startup from Mirror founder Brynn Putnam, raises $20 million, has already sold thousands

From Stage to Future: Where Are Startup Battlefield Alumni Now?

Revolut offers service to thousands of users in India ahead of wider rollout

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.