Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Largest orbital computing cluster is open for business

Roblox introduces ‘Kids’ and ‘Select’ accounts for age-appropriate access to games and chats

TechCrunch Mobility: Who’s chasing all the self-driving talent?

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    Largest orbital computing cluster is open for business

    13 April 2026

    Anthropic restricts Mythos traffic to protect the Internet — or does Anthropic?

    12 April 2026

    Sam Altman responds to ‘inflammatory’ New Yorker article after his home was attacked

    12 April 2026

    Stalking victim sues OpenAI, claims ChatGPT fueled her abuser’s delusions and ignored her warnings

    11 April 2026

    Anthropic has temporarily banned the creator of OpenClaw from accessing Claude

    11 April 2026
  • Apps

    Roblox introduces ‘Kids’ and ‘Select’ accounts for age-appropriate access to games and chats

    13 April 2026

    You can now edit your comments on Instagram

    13 April 2026

    Meta AI app climbs to No. 5 in App Store after release of Muse Spark

    12 April 2026

    StubHub to pay $10 million to settle FTC claims of ‘deceptive’ ticket pricing

    12 April 2026

    PSA: If you use the Meta AI app, your friends will find out and it will be embarrassing

    11 April 2026
  • Crypto

    British cryptographer Adam Back denies NYT report that he is Bitcoin creator Satoshi Nakamoto

    9 April 2026

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025

    Why Benchmark Made a Rare Crypto Bet on Trading App Fomo, with $17M Series A

    6 November 2025

    Solana co-founder Anatoly Yakovenko is a big fan of agentic coding

    30 October 2025
  • Fintech

    Cash app launches ‘pay later’ feature for P2P transfers

    3 April 2026

    Doss raises $55 million for AI inventory management that connects to ERP

    24 March 2026

    Despite stiff competition, Kalshi, Polymarket CEOs back $35m VC fund projections

    23 March 2026

    Amid legal turmoil, Kalshi is temporarily banned in Nevada

    20 March 2026

    Nominations for the Startup Battlefield 200 are still open

    19 March 2026
  • Hardware

    Amazon is ending support for older Kindle devices

    9 April 2026

    Intel signs Elon Musk’s Terafab chip project

    8 April 2026

    The Xiaomi 17 Ultra has some impressive extras that make taking photos really fun

    6 April 2026

    In Japan, the robot doesn’t come for your job. fills the one no one wants

    6 April 2026

    Peter Thiel’s big bet on solar-powered cow collars

    5 April 2026
  • Media & Entertainment

    X says he’s reducing payouts to clickbait accounts

    12 April 2026

    TechCrunch is headed to Tokyo — and it’s bringing the Startup Battlefield with it

    10 April 2026

    Spotify now allows everyone to turn off videos in its app

    9 April 2026

    As YouTube expands into TV, it sees more interactive video across all formats

    9 April 2026

    Tubi is the first streamer to launch a native app on ChatGPT

    8 April 2026
  • Security

    Convicted spyware maker Bryan Fleming avoids jail time on conviction

    12 April 2026

    The Trump administration plans to cut the cybersecurity agency’s budget by $700 million

    11 April 2026

    Russian government hackers broke into thousands of home routers to steal passwords

    11 April 2026

    France to abandon Windows for Linux to reduce dependence on US technology

    10 April 2026

    VeraCrypt encryption software developer says Windows users may experience startup problems after Microsoft shuts down its account

    10 April 2026
  • Startups

    Walmart-owned Flipkart, Amazon are squeezing India’s e-commerce startups

    12 April 2026

    This founder helped build SpaceX’s most powerful rocket engine. Now he’s building a “fighter for orbit.”

    12 April 2026

    Sierra’s Bret Taylor says the era of button-clicking is over

    11 April 2026

    After the data breach, the $10 billion startup Mercor is one month old

    11 April 2026

    What founders can learn from Anjuna’s layoffs and recovery

    10 April 2026
  • Transportation

    TechCrunch Mobility: Who’s chasing all the self-driving talent?

    13 April 2026

    Slate Auto: Everything you need to know about the Bezos-backed EV startup

    12 April 2026

    Battery recycling company Ascend Elements files for bankruptcy

    11 April 2026

    Volkswagen begins testing its self-driving minibuses in Los Angeles ahead of launch with Uber

    10 April 2026

    Volkswagen is dropping the all-electric ID.4 in the U.S

    10 April 2026
  • Venture

    Nvidia-backed SiFive hits $3.65 billion valuation for open AI chips

    11 April 2026

    How to make the Startup Battlefield Top 20 — and what each company gets regardless

    10 April 2026

    Collide Capital Raises $95M to Back Future-of-Work Fintech Startups

    9 April 2026

    VC Eclipse has a new $1.3 billion fund to back — and build — “natural AI” startups

    8 April 2026

    The AI ​​gold rush is pulling private wealth into riskier, older bets

    7 April 2026
  • Recommended Essentials
TechTost
You are at:Home»Venture»Startup Gimlet Labs solves the AI ​​inference problem in a surprisingly elegant way
Venture

Startup Gimlet Labs solves the AI ​​inference problem in a surprisingly elegant way

techtost.comBy techtost.com24 March 202604 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Startup Gimlet Labs Solves The Ai ​​inference Problem In A
Share
Facebook Twitter LinkedIn Pinterest Email

Stanford adjunct professor and retired founder Zain Asgar just raised an $80 million Series A round for a startup that solves the AI ​​inference bottleneck in a smart way. The round was led by Menlo Ventures.

The company, Gimlet Labshas created what it claims is the first and only “multi-silicon inference cloud,” which is software that allows an AI workload to run simultaneously on several types of hardware. It can split the work of an AI application across both traditional CPUs and AI-tuned GPUs, as well as high-memory systems.

“We’re basically dealing with any different hardware that’s available,” Asgar told TechCrunch.

A single agent can combine multiple steps, and each “requires different hardware: Inference is compute-bound, decoding is memory-bound, and tool calls are network-bound,” lead investor Tim Tully of Menlo writes in a blog post about the funding.

No one chip does it all yet, but as new hardware comes out and old GPUs are retooled, “the multi-silicon fleet is ready — it just lacks the software layer to make it work.” That’s what Tully believes Gimlet Labs offers.

If the current growth-more-computing trend continues, McKinsey estimates Data center spending will reach nearly $7 trillion by 2030. Asgar says applications only use existing hardware already deployed “somewhere between 15 and 30 percent” of the time.

“Another way to think about it: you’re wasting hundreds of billions of dollars because you’re just letting resources sit idle,” he said. “Our goal was basically to try to figure out how you can make AI workloads 10 times more efficient than ever before, today.”

Techcrunch event

San Francisco, California
|
13-15 October 2026

So he and his co-founders, Michelle Nguyen, Omid Azizi, and Natalie Serrino, began building orchestration software that cuts agents’ workloads so they can deploy to all kinds of hardware simultaneously.

Gimlet Labs claims to reliably speed up AI inference by 3x to 10x for the same cost and power. Gimlet says it can even slice the underlying model to run on different architectures, using the best chip for each part of the model.

The company has already partnered with chip makers NVIDIA, AMD, Intel, ARM, Cerebras and d-Matrix.

Gimlet’s product, delivered either as software or via an API in its own Gimlet Cloud, is not intended for the rank and file AI application developer. It is for the largest AI model labs and data centers.

The company went public in October with, he said, eight-figure revenue out of the gate (so at least $10 million). Asghar said his customer base has more than doubled in the past four months and now includes a major model maker and an ultra-large cloud computing company, although he declined to name them.

The co-founders previously worked together at Pixie, a startup that created an open source observability tool for Kubernetes. Pixie was acquired by New Relic in 2020, just two months after launching in a $9 million round led by Benchmark. (Pixie’s technology is now part of the open source organization that oversees Kubernetes.)

After Asgar met Tully by chance about a year ago and also received angel investment from Stanford professors, VCs started calling. After the launch, a term sheet landed on Asgar’s desk. When VCs heard Asgar was considering offers, “we got a pretty big flood of funding” and the round was quickly oversubscribed, he said.

In the previous round, the startup has now raised a total of $92 million, including from a range of angels including Sequoia’s Bill Coughran, Stanford professor Nick McKeown, former VMware CEO Raghu Raghuram and Intel CEO Lip-Bu Tan. The company currently employs 30 people.

Other investors include Factory, which led the seed, Eclipse Ventures, Prosperity7 and Triatomic.

elegant Gimlet inference Labs problem solves startup surprisingly
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleBernie Sanders’ AI ‘gotcha’ video fails, but the memes are great
Next Article Zipline raises another $200 million to fuel drone delivery expansion
bhanuprakash.cg
techtost.com
  • Website

Related Posts

Slate Auto: Everything you need to know about the Bezos-backed EV startup

12 April 2026

Nvidia-backed SiFive hits $3.65 billion valuation for open AI chips

11 April 2026

After the data breach, the $10 billion startup Mercor is one month old

11 April 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Largest orbital computing cluster is open for business

13 April 2026

Roblox introduces ‘Kids’ and ‘Select’ accounts for age-appropriate access to games and chats

13 April 2026

TechCrunch Mobility: Who’s chasing all the self-driving talent?

13 April 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

Cash app launches ‘pay later’ feature for P2P transfers

3 April 2026

Doss raises $55 million for AI inventory management that connects to ERP

24 March 2026

Despite stiff competition, Kalshi, Polymarket CEOs back $35m VC fund projections

23 March 2026
Startups

Walmart-owned Flipkart, Amazon are squeezing India’s e-commerce startups

This founder helped build SpaceX’s most powerful rocket engine. Now he’s building a “fighter for orbit.”

Sierra’s Bret Taylor says the era of button-clicking is over

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.