Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

All we like is soulfulness

Two Americans convicted of helping North Korea steal $5 million in fake IT worker scheme

This energy startup’s bet on 100-year-old grid technology is paying off

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    Runway’s CEO Says AI Could Help Hollywood Make 50 Movies Instead of One $100 Million Blockbuster

    16 April 2026

    OpenAI updates its Agents SDK to help enterprises build safer, more capable agents

    16 April 2026

    Reid Hoffman weighs in on the ‘tokenmaxxing’ debate.

    15 April 2026

    Anthropic’s co-founder confirms the company briefed the Trump administration on Mythos

    15 April 2026

    Microsoft is working on yet another OpenClaw-like agent

    14 April 2026
  • Apps

    Canva’s AI assistant can now call on various tools to make designs for you

    16 April 2026

    AI learning app Gizmo soars with 13 million users and $22 million in investment

    16 April 2026

    Adobe’s new Firefly AI assistant can use Creative Cloud apps to complete tasks

    15 April 2026

    How the Freecash rewards app made it to the top of the app stores

    15 April 2026

    X brings voice memos back to X Chat

    14 April 2026
  • Crypto

    British cryptographer Adam Back denies NYT report that he is Bitcoin creator Satoshi Nakamoto

    9 April 2026

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025

    Why Benchmark Made a Rare Crypto Bet on Trading App Fomo, with $17M Series A

    6 November 2025

    Solana co-founder Anatoly Yakovenko is a big fan of agentic coding

    30 October 2025
  • Fintech

    Airwallex is set to take on Stripe and the rest of the payments industry — in the physical world

    16 April 2026

    Cash app launches ‘pay later’ feature for P2P transfers

    3 April 2026

    Doss raises $55 million for AI inventory management that connects to ERP

    24 March 2026

    Despite stiff competition, Kalshi, Polymarket CEOs back $35m VC fund projections

    23 March 2026

    Amid legal turmoil, Kalshi is temporarily banned in Nevada

    20 March 2026
  • Hardware

    Amazon Unveils Slimmer Fire TV Stick HD, Opens Ember Artline TVs for Pre-Order

    16 April 2026

    Motorola is suing social platforms and creators over posts raising concerns about speech in India

    16 April 2026

    AI data center startup Fluidstack is in talks for a $1 billion round at an $18 billion valuation months after raising $7.5 billion, report says

    15 April 2026

    Amazon is ending support for older Kindle devices

    9 April 2026

    Intel signs Elon Musk’s Terafab chip project

    8 April 2026
  • Media & Entertainment

    All we like is soulfulness

    16 April 2026

    Wait, could they still break up Live Nation?

    16 April 2026

    HBO Max is coming to India through an exclusive JioHotstar deal

    15 April 2026

    YouTube Live Streams will now withhold ads during peak engagement to protect the atmosphere

    14 April 2026

    X says he’s reducing payouts to clickbait accounts

    12 April 2026
  • Security

    Two Americans convicted of helping North Korea steal $5 million in fake IT worker scheme

    16 April 2026

    Sweden blames Russian hackers for attempted ‘catastrophic’ cyberattack on thermal plant

    15 April 2026

    Adobe fixes PDF zero-day security flaw that hackers have been exploiting for months

    15 April 2026

    Someone planted backdoors in dozens of WordPress plugins used on thousands of websites

    14 April 2026

    Anodot hack leaves over a dozen compromised companies facing extortion

    14 April 2026
  • Startups

    This energy startup’s bet on 100-year-old grid technology is paying off

    16 April 2026

    Hightouch reaches $100M ARR powered by AI-powered marketing tools

    16 April 2026

    StrictlyVC San Francisco is less than a month away

    15 April 2026

    Walmart-owned Flipkart, Amazon are squeezing India’s e-commerce startups

    12 April 2026

    This founder helped build SpaceX’s most powerful rocket engine. Now he’s building a “fighter for orbit.”

    12 April 2026
  • Transportation

    Monarch Tractor collapse ends with takeover by Caterpillar

    16 April 2026

    Ford EV and chief technology officer are leaving the auto industry

    16 April 2026

    Chipmakers AMD, Arm and Qualcomm are investing in this buzzing self-driving technology startup

    15 April 2026

    London is closing in on its first robotaxi service as Waymo begins trials

    15 April 2026

    Tesla adds ‘ribs’, other stats to track how often drivers use Full Self-Driving software

    14 April 2026
  • Venture

    Anthropic rejects VC funding that values ​​it at $800B+, for now

    16 April 2026

    Financial risk management platform Pillar raises $20 million in rounds led by a16z

    15 April 2026

    Vercel CEO Guillermo Rauch signals IPO readiness as AI agents drive revenue

    14 April 2026

    Nvidia-backed SiFive hits $3.65 billion valuation for open AI chips

    11 April 2026

    How to make the Startup Battlefield Top 20 — and what each company gets regardless

    10 April 2026
  • Recommended Essentials
TechTost
You are at:Home»Startups»Pruna Ai Open goes the AI ​​model optimization box
Startups

Pruna Ai Open goes the AI ​​model optimization box

techtost.comBy techtost.com23 March 202503 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Pruna Ai Open Goes The Ai ​​model Optimization Box
Share
Facebook Twitter LinkedIn Pinterest Email

Pruna aiA European boot that works in AI compression algorithms makes its optimization frame open source on Thursday.

Pruna AI creates a framework that applies several methods of efficiency, such as temporary storage, pruning, quantification and distillation in a given AI model.

“We are also standardizing the savings and loading of compressed models, applying these compression methods and also evaluating your compressed model after you compress it,” said Pruna Ai Co-Ponder and CTO John Rachwan in Techcrunch.

In particular, the Pruna AI framework can evaluate if there is a significant quality loss after compression of a model and the performance profits you get.

“If I had to use a metaphor, we are similar to the way they embrace standard transformers and diffusers – how to call them, how to save them, load them, etc. We do the same, but for efficiency methods,” he added.

Large AI laboratories have already used various compression methods. For example, Openai is based on distillation to create faster versions of flagship models.

This is likely how Openai developed the GPT-4 turbo, a faster version of GPT-4. Similarly, the Flux.1-Schnell The image creation model is a distilled version of the Flux.1 model from the Black Forest Labs.

Distillation is a technique used to extract knowledge from a large AI model with a “educator-lecturer” model. Developers send requests to a teacher model and record the results. The answers are sometimes compared to a data set to see how accurate they are. These results are then used to educate the student model, which is trained to approach the teacher’s behavior.

“For big companies, what they usually do is build these things at home and what you can find in the world open source is usually based on individual methods. For example, let’s say a quantification method for LLMS or a temporary storage method for diffusion models,” Rachwan said. “But you can’t find a tool that gathers all of this, makes it all easy to use and combine together. And this is the great value that Pruna brings right now.”

From left to right: Rayan Nait Mazi, Bertrand Charpentier, John Rachwan, Stephan GünnemannImage credits:Pruna ai

While Pruna AI supports all kinds of models, from large language models to diffusion models, text -based speech models and computer vision models, the company focuses more specifically on imagery and video models right now.

Some of existing Pruna AI users include Scenario and Flare. In addition to the open source version, Pruna AI has a business offer with advanced optimization capabilities, including an optimization factor.

“The most exciting trait we have released soon will be a compression factor,” Rachwan said. “Basically. You give it your model. You say,” I want more speed, but don’t throw my precision by more than 2%. “And then, the agent will only do his magic.

PRUNA AI charges the time for Pro. “It’s similar to how you would think of a GPU when you rent a GPU on AWS or any cloud service,” Rachwan said.

And if your model is a critical part of your AI infrastructure, you will end up saving a lot of money in conclusions with the optimized model. For example, Pruna AI has made a lama model eight times smaller without excessive loss using the compression frame. Pruna AI hopes her clients will think of the compression framework as an investment she pays for herself.

Pruna AI increased a $ 6.5 million seed funding a few months ago. Investors in boot include EQT Ventures, Daphni, Motier Ventures and Kima Ventures.

box Eqt ventures model open optimization Pruna Pruna ai
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleGM works with Nvidia to bring AI to robot, factories and self-guiding cars
Next Article Cape opens $ 99 $/month Beta of its first mobile design, Proton Inks, increases $ 30 million
bhanuprakash.cg
techtost.com
  • Website

Related Posts

This energy startup’s bet on 100-year-old grid technology is paying off

16 April 2026

Hightouch reaches $100M ARR powered by AI-powered marketing tools

16 April 2026

StrictlyVC San Francisco is less than a month away

15 April 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

All we like is soulfulness

16 April 2026

Two Americans convicted of helping North Korea steal $5 million in fake IT worker scheme

16 April 2026

This energy startup’s bet on 100-year-old grid technology is paying off

16 April 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

Airwallex is set to take on Stripe and the rest of the payments industry — in the physical world

16 April 2026

Cash app launches ‘pay later’ feature for P2P transfers

3 April 2026

Doss raises $55 million for AI inventory management that connects to ERP

24 March 2026
Startups

This energy startup’s bet on 100-year-old grid technology is paying off

Hightouch reaches $100M ARR powered by AI-powered marketing tools

StrictlyVC San Francisco is less than a month away

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.