Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

All we like is soulfulness

Two Americans convicted of helping North Korea steal $5 million in fake IT worker scheme

This energy startup’s bet on 100-year-old grid technology is paying off

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    Runway’s CEO Says AI Could Help Hollywood Make 50 Movies Instead of One $100 Million Blockbuster

    16 April 2026

    OpenAI updates its Agents SDK to help enterprises build safer, more capable agents

    16 April 2026

    Reid Hoffman weighs in on the ‘tokenmaxxing’ debate.

    15 April 2026

    Anthropic’s co-founder confirms the company briefed the Trump administration on Mythos

    15 April 2026

    Microsoft is working on yet another OpenClaw-like agent

    14 April 2026
  • Apps

    Canva’s AI assistant can now call on various tools to make designs for you

    16 April 2026

    AI learning app Gizmo soars with 13 million users and $22 million in investment

    16 April 2026

    Adobe’s new Firefly AI assistant can use Creative Cloud apps to complete tasks

    15 April 2026

    How the Freecash rewards app made it to the top of the app stores

    15 April 2026

    X brings voice memos back to X Chat

    14 April 2026
  • Crypto

    British cryptographer Adam Back denies NYT report that he is Bitcoin creator Satoshi Nakamoto

    9 April 2026

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025

    Why Benchmark Made a Rare Crypto Bet on Trading App Fomo, with $17M Series A

    6 November 2025

    Solana co-founder Anatoly Yakovenko is a big fan of agentic coding

    30 October 2025
  • Fintech

    Airwallex is set to take on Stripe and the rest of the payments industry — in the physical world

    16 April 2026

    Cash app launches ‘pay later’ feature for P2P transfers

    3 April 2026

    Doss raises $55 million for AI inventory management that connects to ERP

    24 March 2026

    Despite stiff competition, Kalshi, Polymarket CEOs back $35m VC fund projections

    23 March 2026

    Amid legal turmoil, Kalshi is temporarily banned in Nevada

    20 March 2026
  • Hardware

    Amazon Unveils Slimmer Fire TV Stick HD, Opens Ember Artline TVs for Pre-Order

    16 April 2026

    Motorola is suing social platforms and creators over posts raising concerns about speech in India

    16 April 2026

    AI data center startup Fluidstack is in talks for a $1 billion round at an $18 billion valuation months after raising $7.5 billion, report says

    15 April 2026

    Amazon is ending support for older Kindle devices

    9 April 2026

    Intel signs Elon Musk’s Terafab chip project

    8 April 2026
  • Media & Entertainment

    All we like is soulfulness

    16 April 2026

    Wait, could they still break up Live Nation?

    16 April 2026

    HBO Max is coming to India through an exclusive JioHotstar deal

    15 April 2026

    YouTube Live Streams will now withhold ads during peak engagement to protect the atmosphere

    14 April 2026

    X says he’s reducing payouts to clickbait accounts

    12 April 2026
  • Security

    Two Americans convicted of helping North Korea steal $5 million in fake IT worker scheme

    16 April 2026

    Sweden blames Russian hackers for attempted ‘catastrophic’ cyberattack on thermal plant

    15 April 2026

    Adobe fixes PDF zero-day security flaw that hackers have been exploiting for months

    15 April 2026

    Someone planted backdoors in dozens of WordPress plugins used on thousands of websites

    14 April 2026

    Anodot hack leaves over a dozen compromised companies facing extortion

    14 April 2026
  • Startups

    This energy startup’s bet on 100-year-old grid technology is paying off

    16 April 2026

    Hightouch reaches $100M ARR powered by AI-powered marketing tools

    16 April 2026

    StrictlyVC San Francisco is less than a month away

    15 April 2026

    Walmart-owned Flipkart, Amazon are squeezing India’s e-commerce startups

    12 April 2026

    This founder helped build SpaceX’s most powerful rocket engine. Now he’s building a “fighter for orbit.”

    12 April 2026
  • Transportation

    Monarch Tractor collapse ends with takeover by Caterpillar

    16 April 2026

    Ford EV and chief technology officer are leaving the auto industry

    16 April 2026

    Chipmakers AMD, Arm and Qualcomm are investing in this buzzing self-driving technology startup

    15 April 2026

    London is closing in on its first robotaxi service as Waymo begins trials

    15 April 2026

    Tesla adds ‘ribs’, other stats to track how often drivers use Full Self-Driving software

    14 April 2026
  • Venture

    Anthropic rejects VC funding that values ​​it at $800B+, for now

    16 April 2026

    Financial risk management platform Pillar raises $20 million in rounds led by a16z

    15 April 2026

    Vercel CEO Guillermo Rauch signals IPO readiness as AI agents drive revenue

    14 April 2026

    Nvidia-backed SiFive hits $3.65 billion valuation for open AI chips

    11 April 2026

    How to make the Startup Battlefield Top 20 — and what each company gets regardless

    10 April 2026
  • Recommended Essentials
TechTost
You are at:Home»Venture»Startup Gimlet Labs solves the AI ​​inference problem in a surprisingly elegant way
Venture

Startup Gimlet Labs solves the AI ​​inference problem in a surprisingly elegant way

techtost.comBy techtost.com24 March 202604 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Startup Gimlet Labs Solves The Ai ​​inference Problem In A
Share
Facebook Twitter LinkedIn Pinterest Email

Stanford adjunct professor and retired founder Zain Asgar just raised an $80 million Series A round for a startup that solves the AI ​​inference bottleneck in a smart way. The round was led by Menlo Ventures.

The company, Gimlet Labshas created what it claims is the first and only “multi-silicon inference cloud,” which is software that allows an AI workload to run simultaneously on several types of hardware. It can split the work of an AI application across both traditional CPUs and AI-tuned GPUs, as well as high-memory systems.

“We’re basically dealing with any different hardware that’s available,” Asgar told TechCrunch.

A single agent can combine multiple steps, and each “requires different hardware: Inference is compute-bound, decoding is memory-bound, and tool calls are network-bound,” lead investor Tim Tully of Menlo writes in a blog post about the funding.

No one chip does it all yet, but as new hardware comes out and old GPUs are retooled, “the multi-silicon fleet is ready — it just lacks the software layer to make it work.” That’s what Tully believes Gimlet Labs offers.

If the current growth-more-computing trend continues, McKinsey estimates Data center spending will reach nearly $7 trillion by 2030. Asgar says applications only use existing hardware already deployed “somewhere between 15 and 30 percent” of the time.

“Another way to think about it: you’re wasting hundreds of billions of dollars because you’re just letting resources sit idle,” he said. “Our goal was basically to try to figure out how you can make AI workloads 10 times more efficient than ever before, today.”

Techcrunch event

San Francisco, California
|
13-15 October 2026

So he and his co-founders, Michelle Nguyen, Omid Azizi, and Natalie Serrino, began building orchestration software that cuts agents’ workloads so they can deploy to all kinds of hardware simultaneously.

Gimlet Labs claims to reliably speed up AI inference by 3x to 10x for the same cost and power. Gimlet says it can even slice the underlying model to run on different architectures, using the best chip for each part of the model.

The company has already partnered with chip makers NVIDIA, AMD, Intel, ARM, Cerebras and d-Matrix.

Gimlet’s product, delivered either as software or via an API in its own Gimlet Cloud, is not intended for the rank and file AI application developer. It is for the largest AI model labs and data centers.

The company went public in October with, he said, eight-figure revenue out of the gate (so at least $10 million). Asghar said his customer base has more than doubled in the past four months and now includes a major model maker and an ultra-large cloud computing company, although he declined to name them.

The co-founders previously worked together at Pixie, a startup that created an open source observability tool for Kubernetes. Pixie was acquired by New Relic in 2020, just two months after launching in a $9 million round led by Benchmark. (Pixie’s technology is now part of the open source organization that oversees Kubernetes.)

After Asgar met Tully by chance about a year ago and also received angel investment from Stanford professors, VCs started calling. After the launch, a term sheet landed on Asgar’s desk. When VCs heard Asgar was considering offers, “we got a pretty big flood of funding” and the round was quickly oversubscribed, he said.

In the previous round, the startup has now raised a total of $92 million, including from a range of angels including Sequoia’s Bill Coughran, Stanford professor Nick McKeown, former VMware CEO Raghu Raghuram and Intel CEO Lip-Bu Tan. The company currently employs 30 people.

Other investors include Factory, which led the seed, Eclipse Ventures, Prosperity7 and Triatomic.

elegant Gimlet inference Labs problem solves startup surprisingly
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleBernie Sanders’ AI ‘gotcha’ video fails, but the memes are great
Next Article Zipline raises another $200 million to fuel drone delivery expansion
bhanuprakash.cg
techtost.com
  • Website

Related Posts

Anthropic rejects VC funding that values ​​it at $800B+, for now

16 April 2026

Chipmakers AMD, Arm and Qualcomm are investing in this buzzing self-driving technology startup

15 April 2026

Financial risk management platform Pillar raises $20 million in rounds led by a16z

15 April 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

All we like is soulfulness

16 April 2026

Two Americans convicted of helping North Korea steal $5 million in fake IT worker scheme

16 April 2026

This energy startup’s bet on 100-year-old grid technology is paying off

16 April 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

Airwallex is set to take on Stripe and the rest of the payments industry — in the physical world

16 April 2026

Cash app launches ‘pay later’ feature for P2P transfers

3 April 2026

Doss raises $55 million for AI inventory management that connects to ERP

24 March 2026
Startups

This energy startup’s bet on 100-year-old grid technology is paying off

Hightouch reaches $100M ARR powered by AI-powered marketing tools

StrictlyVC San Francisco is less than a month away

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.