Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

‘It wasn’t built right the first time’ — Musk’s xAI starts again, again

Digg is laying off staff and shutting down the app as well as the company’s tools

Spotify will let you edit your taste profile to control your recommendations

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    ‘It wasn’t built right the first time’ — Musk’s xAI starts again, again

    14 March 2026

    Before quantum computing arrives, this startup wants businesses that are already working on it

    13 March 2026

    How to watch Jensen Huang’s Nvidia GTC 2026 keynote

    13 March 2026

    Ford’s new AI assistant will help fleet owners know if seat belts are being used

    12 March 2026

    AI ‘Actress’ Tilly Norwood Releases Worst Song I’ve Ever Heard

    12 March 2026
  • Apps

    Digg is laying off staff and shutting down the app as well as the company’s tools

    14 March 2026

    Truecaller now lets you hang up on scammers — on behalf of your family

    13 March 2026

    Channel Surfer lets you watch YouTube like it’s old-school cable TV

    13 March 2026

    Google Maps is getting an AI ‘Ask Maps’ feature and upgraded ‘immersive’ navigation

    12 March 2026

    Google Play adds new paid and PC games, game tests, community posts and more

    12 March 2026
  • Crypto

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025

    New report examines how David Sachs may benefit from Trump administration role

    1 December 2025

    Why Benchmark Made a Rare Crypto Bet on Trading App Fomo, with $17M Series A

    6 November 2025

    Solana co-founder Anatoly Yakovenko is a big fan of agentic coding

    30 October 2025

    MoviePass opens Mogul fantasy league game to the public

    29 October 2025
  • Fintech

    India neobank Fi removes banking services on its platform

    11 March 2026

    X taps William Shatner to give invitations to his payment service, X Money

    4 March 2026

    Stripe wants to turn your AI costs into a profit center

    3 March 2026

    3 days left: Save up to $680 on your ticket to Disrupt 2026

    25 February 2026

    More startups surpass $10M ARR in 3 months than ever before

    24 February 2026
  • Hardware

    Ex-Apple Engineer Raises $5M for Note-Taking Locket That Only Records Your Voice

    12 March 2026

    Canopii seems to succeed where the old indoor farms failed

    11 March 2026

    Hyperscale Power is the latest startup to challenge 140-year-old transformer technology

    10 March 2026

    Whoop is launching a new blood test focused on women’s health

    10 March 2026

    Honor says its ‘Robot phone’ with moving camera can dance to music

    8 March 2026
  • Media & Entertainment

    Spotify will let you edit your taste profile to control your recommendations

    13 March 2026

    Disney+ launches TikTok-style short-form video stream ‘Verts’

    13 March 2026

    Substack launches an embedded recording studio

    12 March 2026

    TikTok now allows Apple Music subscribers to play entire songs without leaving the app

    12 March 2026

    WordPress debuts a private workspace that runs in your browser via a new service, my.WordPress.net

    11 March 2026
  • Security

    Law enforcement shuts down botnet consisting of tens of thousands of hacked routers

    12 March 2026

    The pro-Iranian hacktivist group says it is behind the attack on medical technology giant Stryker

    12 March 2026

    Salt Typhoon hacks the world’s phone and internet giants — here’s where they’ve been hit

    11 March 2026

    DOGE employee stole Social Security data and thumbed it, report says

    11 March 2026

    US military contractor likely built iPhone hacking tools used by Russian spies in Ukraine

    10 March 2026
  • Startups

    Chinese brain interface startup Gestala raises $21 million just two months after launching

    13 March 2026

    Sales automation startup Rox AI hits $1.2 billion valuation, sources say

    13 March 2026

    When startups become a family business

    12 March 2026

    Ride-hailing inDrive acquires Pakistan’s Krave Mart to boost grocery delivery

    12 March 2026

    Google completes $32 billion acquisition of cloud cybersecurity startup Wiz

    11 March 2026
  • Transportation

    Kinetic robotics joins Uber’s Vegas app two years after major reset

    13 March 2026

    Why Rivian is holding onto the $45,000 R2 base model until ‘late 2027’

    13 March 2026

    Group14 opens factory to produce flash charge battery materials for EVs

    12 March 2026

    Nuro is testing its autonomous vehicle technology on the streets of Tokyo

    12 March 2026

    Zoox plans to put its robotaxis on the Uber app in Vegas this year

    11 March 2026
  • Venture

    Gumloop gets $50M from Benchmark to turn every worker into an AI agent builder

    13 March 2026

    This SpaceX Veteran Says The Next Big Thing In Space Is Satellites Returning To Earth

    10 March 2026

    Founders Fund is approaching $6 billion for its latest growth fund, sources say

    10 March 2026

    Robinhood’s startup fund stumbles in its NYSE debut

    7 March 2026

    City Detect, which uses artificial intelligence to help cities stay safe and clean, raises $13M Series A

    7 March 2026
  • Recommended Essentials
TechTost
You are at:Home»Media & Entertainment»Openai upgrades AI models and voice
Media & Entertainment

Openai upgrades AI models and voice

techtost.comBy techtost.com23 March 202504 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Openai Upgrades Ai Models And Voice
Share
Facebook Twitter LinkedIn Pinterest Email

Openai brings new AI models and voice models to API that the company claims to improve its previous releases.

For Openai, models fit the wider “Agentic” Vision: Building Automated Systems that can achieve independent tasks on behalf of users. The definition of “Agent” may be disputed, but the head of the product Openai Olivier Godment described an interpretation as a chatbot that can talk to the customers of a business.

“We will see more and more agents appear in the coming months,” Godment told TechCrunch during an information. “And so the general issue helps customers and developers to exploit agents who are useful, available and accurate.”

Openai claims that the new speech text model, “GPT-4o-mini-ts”, not only offers a more distinctive and realistic speech, but is also more “painful” than previous speech discussion models. Developers can command the GPT-4o-mini-ts on how to say things in the natural language-for example, “speak like a crazy scientist” or “use a tranquil voice, as an awareness teacher”.

Here is a “true crime”, outdated voice:

And here is a sample of female “professional” voice:

Jeff Harris, a member of the product staff at Openai, told Techcrunch that the goal is to let developers adapt both “experience” and “frame”.

“In different contexts, you don’t just want a flat, monotonous voice,” Harris said. “If you are in a customer support experience and you want the voice to be apologetic because it has made a mistake, you can really have the voice to have this feeling in it … Our great belief, here, is that developers and users really want to control not only what is being said, but how things are talking about.”

Concerning the new OpenAi text speech models, the “GPT-4o-Transcribe” and “GPT-4O-Mini-Transcribe”, effectively replace the Whisper Long-in-Sooth transcription model. It was trained in “different, high quality audio data sets”, new models can better record the bow and varied reason, the Openai claims, and even in chaotic environments.

They are also less likely to deform, Harris added. The whispers tend to make words – and even whole passages – in conversations, introducing everything, from racial comments to fantastic medical treatments in transcripts.

“[T]The models are very much improved against this front, “Harris said. [in this context] means that models are just listening to the words [and] They do not complete details that they did not hear. ”

However, your kilometers may vary depending on the language transcribed.

According to Openai’s internal reference points, the GPT-4O-transcribe, the more accurate than the two transcription models, has a “word error percentage” approaching 30% (from 120%) for Infing and Dravidian languages ​​such as Tamil, Telugu, Malayalam and Kannada. This means that three of the 10 words from the model will differ from a human transcript in these languages.

The results from the comparative evaluation of the OpenAi transcription.Image credits:Open

In a break from tradition, Openai does not plan to make new transcription models openly available. The company Historically released new versions of Whisper For commercial use with MIT license.

Harris said the trastrice GPT-4o-transcribe and GPT-4O-mini-transcribe are “much larger than whispers” and are therefore not good candidates for open release.

“[T]It’s not the kind of model you can run locally on your laptop, like whisper, “he continued.”[W]You want to make sure that if we release things at an open source, we do it carefully and have a model that is really improved for this particular need. And we believe that end user devices are one of the most interesting cases for open source models. ”

Updated March 20, 2025, 11:54 am Pt to clarify the language Around the word error rate and updated the reference results chart with a more recent version.

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleCape opens $ 99 $/month Beta of its first mobile design, Proton Inks, increases $ 30 million
Next Article Tiktok to start pushing amber notifications to users for your flows
bhanuprakash.cg
techtost.com
  • Website

Related Posts

Spotify will let you edit your taste profile to control your recommendations

13 March 2026

Disney+ launches TikTok-style short-form video stream ‘Verts’

13 March 2026

Substack launches an embedded recording studio

12 March 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

‘It wasn’t built right the first time’ — Musk’s xAI starts again, again

14 March 2026

Digg is laying off staff and shutting down the app as well as the company’s tools

14 March 2026

Spotify will let you edit your taste profile to control your recommendations

13 March 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

India neobank Fi removes banking services on its platform

11 March 2026

X taps William Shatner to give invitations to his payment service, X Money

4 March 2026

Stripe wants to turn your AI costs into a profit center

3 March 2026
Startups

Chinese brain interface startup Gestala raises $21 million just two months after launching

Sales automation startup Rox AI hits $1.2 billion valuation, sources say

When startups become a family business

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.