Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Facebook makes it easy for creators to report copycats

The biggest AI stories of the year (so far)

Travis Kalanick is launching a new company called Atoms that focuses on robotics

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    ‘It wasn’t built right the first time’ — Musk’s xAI starts again, again

    14 March 2026

    Before quantum computing arrives, this startup wants businesses that are already working on it

    13 March 2026

    How to watch Jensen Huang’s Nvidia GTC 2026 keynote

    13 March 2026

    Ford’s new AI assistant will help fleet owners know if seat belts are being used

    12 March 2026

    AI ‘Actress’ Tilly Norwood Releases Worst Song I’ve Ever Heard

    12 March 2026
  • Apps

    Digg is laying off staff and shutting down the app as well as the company’s tools

    14 March 2026

    Truecaller now lets you hang up on scammers — on behalf of your family

    13 March 2026

    Channel Surfer lets you watch YouTube like it’s old-school cable TV

    13 March 2026

    Google Maps is getting an AI ‘Ask Maps’ feature and upgraded ‘immersive’ navigation

    12 March 2026

    Google Play adds new paid and PC games, game tests, community posts and more

    12 March 2026
  • Crypto

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025

    Why Benchmark Made a Rare Crypto Bet on Trading App Fomo, with $17M Series A

    6 November 2025

    Solana co-founder Anatoly Yakovenko is a big fan of agentic coding

    30 October 2025

    MoviePass opens Mogul fantasy league game to the public

    29 October 2025
  • Fintech

    India neobank Fi removes banking services on its platform

    11 March 2026

    X taps William Shatner to give invitations to his payment service, X Money

    4 March 2026

    Stripe wants to turn your AI costs into a profit center

    3 March 2026

    3 days left: Save up to $680 on your ticket to Disrupt 2026

    25 February 2026

    More startups surpass $10M ARR in 3 months than ever before

    24 February 2026
  • Hardware

    Ex-Apple Engineer Raises $5M for Note-Taking Locket That Only Records Your Voice

    12 March 2026

    Canopii seems to succeed where the old indoor farms failed

    11 March 2026

    Hyperscale Power is the latest startup to challenge 140-year-old transformer technology

    10 March 2026

    Whoop is launching a new blood test focused on women’s health

    10 March 2026

    Honor says its ‘Robot phone’ with moving camera can dance to music

    8 March 2026
  • Media & Entertainment

    Facebook makes it easy for creators to report copycats

    14 March 2026

    Spotify will let you edit your taste profile to control your recommendations

    13 March 2026

    Disney+ launches TikTok-style short-form video stream ‘Verts’

    13 March 2026

    Substack launches an embedded recording studio

    12 March 2026

    TikTok now allows Apple Music subscribers to play entire songs without leaving the app

    12 March 2026
  • Security

    Law enforcement shuts down botnet consisting of tens of thousands of hacked routers

    12 March 2026

    The pro-Iranian hacktivist group says it is behind the attack on medical technology giant Stryker

    12 March 2026

    Salt Typhoon hacks the world’s phone and internet giants — here’s where they’ve been hit

    11 March 2026

    DOGE employee stole Social Security data and thumbed it, report says

    11 March 2026

    US military contractor likely built iPhone hacking tools used by Russian spies in Ukraine

    10 March 2026
  • Startups

    The biggest AI stories of the year (so far)

    14 March 2026

    Chinese brain interface startup Gestala raises $21 million just two months after launching

    13 March 2026

    Sales automation startup Rox AI hits $1.2 billion valuation, sources say

    13 March 2026

    When startups become a family business

    12 March 2026

    Ride-hailing inDrive acquires Pakistan’s Krave Mart to boost grocery delivery

    12 March 2026
  • Transportation

    Travis Kalanick is launching a new company called Atoms that focuses on robotics

    14 March 2026

    Kinetic robotics joins Uber’s Vegas app two years after major reset

    13 March 2026

    Why Rivian is holding onto the $45,000 R2 base model until ‘late 2027’

    13 March 2026

    Group14 opens factory to produce flash charge battery materials for EVs

    12 March 2026

    Nuro is testing its autonomous vehicle technology on the streets of Tokyo

    12 March 2026
  • Venture

    Founded by a father-son duo, Nyne gives AI agents the human context they’ve been missing

    14 March 2026

    Gumloop gets $50M from Benchmark to turn every worker into an AI agent builder

    13 March 2026

    This SpaceX Veteran Says The Next Big Thing In Space Is Satellites Returning To Earth

    10 March 2026

    Founders Fund is approaching $6 billion for its latest growth fund, sources say

    10 March 2026

    Robinhood’s startup fund stumbles in its NYSE debut

    7 March 2026
  • Recommended Essentials
TechTost
You are at:Home»AI»Google launches “Implied Temporary Storage” to access the latest AI models cheaper AI models
AI

Google launches “Implied Temporary Storage” to access the latest AI models cheaper AI models

techtost.comBy techtost.com11 May 202503 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Google Launches "implied Temporary Storage" To Access The Latest Ai
Share
Facebook Twitter LinkedIn Pinterest Email

Google is developing a feature in the Gemini API that the company claims to make the latest AI models cheaper for third -party developers.

Google calls “Implicit Caching” and says it can deliver a 75% savings to the “recurring frame” voted on models via API Gemini. It supports Google Google 2.5 Pro and 2.5 Flash models.

This is likely to be welcome news to developers, as the cost of using Frontier models continues to increase.

Simply send implicit temporary storage to API Gemini, automatically allowing a 75% cost savings with Gemini 2.5 models when your request hits a cache 🚢

We also reduced the min token needed to hit the hidden memories to 1K at 2.5 flash and 2k to 2.5 Pro!

– Logan Kilpatrick (@OficialLogank) 8 May 2025

Temporary storage, a widely adopted practice in the AI ​​industry, frequently reuse or pre-calculated data from models to reduce the requirements and costs of computer. For example, memories can store answers to questions that users often ask for a model, eliminating the need for the model to recreate the answers to the same request.

Google previously offered temporary storage of a model but only clear Timely temporary storage, which means that devs had to determine their high frequency prompts. While cost savings are supposed to be guaranteed, the explicit exhorting temporary storage usually included a lot of manual work.

Some developers were not happy with how the explicit implementation of Google temporary temporary storage worked for the Gemini 2.5 Pro, which stated that it could cause surprisingly large API accounts. Complaints arrived in a fever last week, Urging the Gemini group to apologize and is committed to making changes.

Unlike explicit temporary storage, implicit temporary storage is automatic. Enabled by default for Gemini 2.5 models, it transmits cost savings if an API Gemini request on a model hits a cache.

TechCrunch event

Berkeley, ca
|
June 5

Book now

“[W]If you send a request to one of the Gemini 2.5 models, if the request shares a common prefix as one of the previous requests, then it is eligible for a cache, “Google explained to a blog. “We will dynamically pass the cost savings back to you.”

The number of minimum prompts for implicit temporary storage is 1,024 for 2.5 flash and 2.048 for 2.5 Pro, According to the documentation of Google developerwhich is not a terribly large amount, which means that it should not be needed much to activate these automatic savings. The brands are the raw pieces of data models with a thousand tokens equivalent to about 750 words.

Since Google’s latest claims to save cost from temporary storage have ran afoul, there are some buyer-visual areas in this new feature. For one, Google recommends developers to maintain a recurring framework at the beginning of the requests to increase the chances of implicit cache. The framework that can change from request to request must be annexed at the end, the company says.

Once again, Google has offered no third party verification that the new implicit temporary storage system would deliver the promised automatic savings. So we have to see what the first adopters say.

access cheaper Gemini Google implied latest Launches models storage temporary
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleGoogle unravels AI tools to protect Chrome users from scams
Next Article Greek revival you don’t watch (but most likely to be)
bhanuprakash.cg
techtost.com
  • Website

Related Posts

‘It wasn’t built right the first time’ — Musk’s xAI starts again, again

14 March 2026

Before quantum computing arrives, this startup wants businesses that are already working on it

13 March 2026

Disney+ launches TikTok-style short-form video stream ‘Verts’

13 March 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Facebook makes it easy for creators to report copycats

14 March 2026

The biggest AI stories of the year (so far)

14 March 2026

Travis Kalanick is launching a new company called Atoms that focuses on robotics

14 March 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

India neobank Fi removes banking services on its platform

11 March 2026

X taps William Shatner to give invitations to his payment service, X Money

4 March 2026

Stripe wants to turn your AI costs into a profit center

3 March 2026
Startups

The biggest AI stories of the year (so far)

Chinese brain interface startup Gestala raises $21 million just two months after launching

Sales automation startup Rox AI hits $1.2 billion valuation, sources say

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.