Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Musk slams OpenAI in deposition, says ‘no one killed themselves because of Grok’

South Korea is opening the door to allow Google Maps to be fully operational

India cuts off access to popular developer platform Supabase with block order

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    Musk slams OpenAI in deposition, says ‘no one killed themselves because of Grok’

    28 February 2026

    Pentagon moves to designate Anthropic as a supply chain risk

    28 February 2026

    Anthropic CEO stands firm as Pentagon deadline looms

    27 February 2026

    Jack Dorsey just halved the size of Block’s employee base — and he says your company is next

    27 February 2026

    Salesforce CEO Marc Benioff: This isn’t our first SaaSpocalypse

    26 February 2026
  • Apps

    South Korea is opening the door to allow Google Maps to be fully operational

    28 February 2026

    Spotify releases audiobook maps

    28 February 2026

    Bumble adds AI photo feedback and profile guidance tools

    27 February 2026

    Threads is testing a shortcut to quickly start DM conversations

    27 February 2026

    Instagram now alerts parents if their teen is looking for suicide or self-harm content

    26 February 2026
  • Crypto

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025

    Why Benchmark Made a Rare Crypto Bet on Trading App Fomo, with $17M Series A

    6 November 2025

    Solana co-founder Anatoly Yakovenko is a big fan of agentic coding

    30 October 2025

    MoviePass opens Mogul fantasy league game to the public

    29 October 2025
  • Fintech

    3 days left: Save up to $680 on your ticket to Disrupt 2026

    25 February 2026

    More startups surpass $10M ARR in 3 months than ever before

    24 February 2026

    Stripe, PayPal Ventures Bet on India’s Xflow to Fix Cross-Border B2B Payments

    24 February 2026

    InScope raises $14.5M to solve financial reporting pain

    20 February 2026

    OpenAI deepens India push with Pine Labs fintech partnership

    19 February 2026
  • Hardware

    Last 24 hours to get Disrupt 2026 tickets at the lowest prices of the year

    27 February 2026

    Everything announced at Samsung’s Galaxy Unpacked event, including S26 smartphones, privacy screen and more

    26 February 2026

    Samsung introduces new display technology that adds a privacy screen to apps and notifications

    25 February 2026

    Oura launches a proprietary AI model focused on women’s health

    25 February 2026

    Spotify and Liquid Death are releasing a limited-edition speaker shaped like a … container?

    24 February 2026
  • Media & Entertainment

    Apple and Netflix team up to stream Formula 1 Canadian Grand Prix

    27 February 2026

    Netflix pulls out of bid for Warner Bros. Discovery, giving studios, HBO and CNN to Ellison-owned Paramount

    27 February 2026

    Book the best deals for Disrupt 2026 | TechCrunch

    26 February 2026

    Americans now listen to podcasts more often than talk radio, study shows

    25 February 2026

    Music producer ProducerAI joins Google Labs

    25 February 2026
  • Security

    India cuts off access to popular developer platform Supabase with block order

    28 February 2026

    CISA replaces deputy director after a difficult year on the job

    27 February 2026

    Cisco Says Hackers Are Exploiting Critical Flaw To Break Into Large Customer Networks By 2023

    26 February 2026

    US cybersecurity agency CISA reportedly in dire straits amid Trump cuts and layoffs

    26 February 2026

    Treasury sanctions Russian zero-day broker accused of buying holdings stolen from US defense contractor

    25 February 2026
  • Startups

    Jest, a marketplace for messaging games, is challenging the app store status quo

    28 February 2026

    Superhuman bets on redesigned smart ring to win back US market after Oura controversy

    27 February 2026

    Trace raises $3 million to solve AI agent adoption in the enterprise

    27 February 2026

    How to avoid bad hires in early stage startups

    26 February 2026

    Apply to take the stage at Founder Summit 2026

    26 February 2026
  • Transportation

    Self-driving truck startup Einride raises $113M PIPE ahead of public debut

    27 February 2026

    It’s time to pull the plug on plug-in hybrids

    26 February 2026

    Harbinger acquires self-driving company Phantom AI

    26 February 2026

    Waymo robotaxis are now operating in 10 US cities

    25 February 2026

    Self-driving tech startup Wayve raises $1.2 billion from Nvidia, Uber and three automakers

    25 February 2026
  • Venture

    After Zomato, Deepinder Goyal is back with a $54 million brain-monitoring bet

    28 February 2026

    Dive into Boston’s startup ecosystem at Founder Summit 2026 | TechCrunch

    27 February 2026

    A VC and some big-name developers are trying to solve the open source funding problem, permanently

    27 February 2026

    Y Combinator grad and AI insurance brokerage Harper raises $47 million

    26 February 2026

    Anthropic acquires AI startup Vercept after Meta indicts one of its founders

    26 February 2026
  • Recommended Essentials
TechTost
You are at:Home»AI»Nvidia launches NIM to make deploying AI models smoother in production
AI

Nvidia launches NIM to make deploying AI models smoother in production

techtost.comBy techtost.com19 March 202403 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Nvidia Launches Nim To Make Deploying Ai Models Smoother In
Share
Facebook Twitter LinkedIn Pinterest Email

At the GTC conference, Nvidia today was announced Nvidia NIM, a new software platform designed to streamline the deployment of custom and pre-trained AI models in production environments. NIM takes the software work that Nvidia has done on model inference and optimization and makes it easily accessible by combining a particular model with an optimized inference engine and then packaging it into a container, making it accessible as a microservice.

Typically, it would take developers weeks — if not months — to ship similar containers, Nvidia claims — and that’s if the company even has any in-house AI talent. With NIM, Nvidia is clearly aiming to create an ecosystem of AI containers that use its hardware as the foundation layer with these curated microservices as the core software layer for companies looking to accelerate their AI roadmap.

NIM currently includes support for models from NVIDIA, A121, Adept, Cohere, Getty Images and Shutterstock as well as open models from Google, Hugging Face, Meta, Microsoft, Mistral AI and Stability AI. Nvidia is already working with Amazon, Google, and Microsoft to make these NIM microservices available on SageMaker, Kubernetes Engine, and Azure AI, respectively. They will also be integrated into frameworks such as Deepset, LangChain and LlamaIndex.

Image Credits: Nvidia

“We think Nvidia’s GPU is the best place to run inference on these models […]and we believe that NVIDIA NIM is the best software package, the best runtime, so developers can focus on enterprise applications — and just let Nvidia do the work to produce these models for them as much as possible. efficient, business-like way so they can just do the rest of their work,” said Manuvir Das, head of Nvidia’s enterprise computing division, during a press conference ahead of today’s announcements.”

As for the inference engine, Nvidia will use Triton Inference Server, TensorRT and TensorRT-LLM. Some of the Nvidia microservices available through NIM will include Riva for customizing speech and translation models, cuOpt for routing optimizations, and the Earth-2 model for weather and climate simulations.

The company plans to add additional features over time, including, for example, making the Nvidia RAG LLM operator available as a NIM, which promises to make building AI chatbots that can pull in custom data much easier.

This wouldn’t be a developer conference without a few customer and partner announcements. Current users of NIM include Box, Cloudera, Cohesity, Datastax, Dropbox
and NetApp.

“Established business platforms are a gold mine of data that can be turned into productive combinations of artificial intelligence,” said Jensen Huang, founder and CEO of NVIDIA. “Created with our partner ecosystem, these containerized AI microservices are the building blocks for businesses in every industry to become AI companies.”

All included deploying GTC jars Launches microservices models NIM nvidia production smoother
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleTruecaller adds a new AI feature to identify and block more spam calls
Next Article Ramp’s CEO says the fintech startup is just scratching the surface
bhanuprakash.cg
techtost.com
  • Website

Related Posts

Musk slams OpenAI in deposition, says ‘no one killed themselves because of Grok’

28 February 2026

Pentagon moves to designate Anthropic as a supply chain risk

28 February 2026

Anthropic CEO stands firm as Pentagon deadline looms

27 February 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Musk slams OpenAI in deposition, says ‘no one killed themselves because of Grok’

28 February 2026

South Korea is opening the door to allow Google Maps to be fully operational

28 February 2026

India cuts off access to popular developer platform Supabase with block order

28 February 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

3 days left: Save up to $680 on your ticket to Disrupt 2026

25 February 2026

More startups surpass $10M ARR in 3 months than ever before

24 February 2026

Stripe, PayPal Ventures Bet on India’s Xflow to Fix Cross-Border B2B Payments

24 February 2026
Startups

Jest, a marketplace for messaging games, is challenging the app store status quo

Superhuman bets on redesigned smart ring to win back US market after Oura controversy

Trace raises $3 million to solve AI agent adoption in the enterprise

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.