Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Universal Music Group and TikTok renew agreement to combat unauthorized AI music

Disrupt 2026 Early Bird ticket prices end May 29

Google is pitching an ecosystem of AI agents to consumers who might not buy it

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    The Pope’s encyclical on artificial intelligence is not really about artificial intelligence

    25 May 2026

    Everyone is navigating real-time AI security — even Google

    25 May 2026

    I’ve tried Amazon’s Bee wearable and I’m a bit intrigued

    24 May 2026

    Elon Musk has given up on solar power (on Earth)

    24 May 2026

    Ferrari uses IBM AI to create F1 superfans

    23 May 2026
  • Apps

    Universal Music Group and TikTok renew agreement to combat unauthorized AI music

    26 May 2026

    Google is pitching an ecosystem of AI agents to consumers who might not buy it

    26 May 2026

    Founded by Tony Robbins and Calm alums, The Path hopes to offer safer treatment with artificial intelligence

    25 May 2026

    Spotify will reserve tickets for an artist’s top fans in an effort to fill the engagement

    25 May 2026

    Audio production app Huxe, founded by former NotebookLM developers, is shutting down

    24 May 2026
  • Crypto

    5 days left: Save up to $410 on Disrupt 2026 passes

    25 May 2026

    As crypto cools, a16z crypto raises $2.2 billion in capital

    6 May 2026

    Coinbase to lay off 14% of staff as part of broader restructuring

    5 May 2026

    British cryptographer Adam Back denies NYT report that he is Bitcoin creator Satoshi Nakamoto

    9 April 2026

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025
  • Fintech

    Disrupt 2026 Early Bird ticket prices end May 29

    26 May 2026

    Startup Battlefield 200 applications close before May 27 | TechCrunch

    26 May 2026

    General Catalyst just led a $63 million bet in India’s travel payments market

    21 May 2026

    Startup Battlefield 200 applications close on May 27

    21 May 2026

    Venmo’s biggest makeover in years comes at a very interesting time

    11 May 2026
  • Hardware

    The Dreamie alarm clock made me stop using my phone in bed

    26 May 2026

    6 kitchen gadgets that make adult life easier

    25 May 2026

    Xreal, Google’s smart glasses partner, believes it has finally conquered this extremely difficult industry

    25 May 2026

    We tested Google’s AI glasses and they’re almost there

    23 May 2026

    Finnish phone maker HMD ropes Indian AI chatbot into new smartphone to reach local market

    22 May 2026
  • Media & Entertainment

    Spotify launches an audiobook creation tool powered by ElevenLabs

    22 May 2026

    New York City Mayor Zohran Mamdani Takes To Twitch To Chat With New Yorkers

    21 May 2026

    Clouted wants to take the guesswork out of making short videos go viral

    21 May 2026

    ‘Ask YouTube’ Brings AI Chat Search to Video, Adds Gemini Omni to Shorts

    20 May 2026

    Google’s Gemini Omni turns images, audio and text into video — and that’s just the beginning

    19 May 2026
  • Security

    Scammers abuse an internal Microsoft account to send spam links

    22 May 2026

    Law enforcement shuts down VPN service used by two dozen ransomware gangs

    21 May 2026

    GitHub says hackers stole data from thousands of internal repositories

    21 May 2026

    Customers say Trump Mobile is leaking their personal information

    20 May 2026

    US cyber agency CISA has exposed bundles of passwords and cloud keys to the open web

    19 May 2026
  • Startups

    What ClickUp’s mass layoff tells us about the future of work

    25 May 2026

    SolarSquare in talks to raise up to $60M as India’s rooftop solar market draws big VC interest

    24 May 2026

    This startup raised $43 million to create a hive mind for ships

    22 May 2026

    Maka Kids redefines kids’ screen time with a streaming app optimized for wellness, not engagement

    22 May 2026

    This new startup is taking on a fragrance industry that hasn’t changed in nearly half a century

    21 May 2026
  • Transportation

    Global EV market becomes K-shaped as US falls behind

    25 May 2026

    Tesla’s Full Self-Driving software is creeping into Europe

    25 May 2026

    TechCrunch Mobility: Robotaxi Reality Check

    24 May 2026

    Wayve’s self-driving technology is heading to US cars made by Stellantis

    24 May 2026

    How Elon Musk will increase his power through the SpaceX IPO

    23 May 2026
  • Venture

    The pitch trick that helped an eSports startup raise $20 million when VCs only wanted AI

    25 May 2026

    Peec, one of Berlin’s up-and-coming startups, more than doubled annual revenue in months to $10 million, sources say

    23 May 2026

    Convective Capital Raises $85M Fund to Build Disaster Resilience

    22 May 2026

    Sam Altman does a ‘mic drop’ pitch to every Y Combinator startup

    21 May 2026

    Startup Battlefield 200 applications close on May 27

    20 May 2026
  • Recommended Essentials
TechTost
You are at:Home»Media & Entertainment»DeepMind’s new AI creates soundtracks and dialogues for videos
Media & Entertainment

DeepMind’s new AI creates soundtracks and dialogues for videos

techtost.comBy techtost.com17 June 202403 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Deepmind's New Ai Creates Soundtracks And Dialogues For Videos
Share
Facebook Twitter LinkedIn Pinterest Email

DeepMind, Google’s AI research lab, says it’s developing AI technology to create soundtracks for videos.

In a Position on its official blog, DeepMind says it sees the technology, V2A (short for “video-to-audio”), as an essential piece of the multimedia puzzle created by artificial intelligence. While many bodies, including DeepMind, have developed artificial intelligence models that generate videos, these models cannot generate sound effects to synchronize with the videos they generate.

“Video production models are advancing at an incredible pace, but many current systems can only produce silent output,” writes DeepMind. “V2A technology [could] become a promising approach to the life of the films being made.’

DeepMind’s V2A technology follows the description of a soundtrack (eg “jellyfish pulsating underwater, sea life, ocean”) combined with a video to generate music, sound effects and even dialogue that match the characters and the tone of the video, watermarked by DeepMind’s deep fake -fighting SynthID technology. The AI ​​model powering V2A, a diffusion model, was trained on a combination of audio and dialogue transcripts as well as video clips, DeepMind says.

“By training on video, audio, and additional annotations, our technology learns to associate specific audio events with various visual scenes while responding to information provided in annotations or transcripts,” according to DeepMind.

Mom is the word on whether any of the training data is copyrighted — and whether the creators of the data were notified of DeepMind’s work. We’ve reached out to DeepMind for clarification and will update this post if we hear back.

AI-powered audio production tools aren’t groundbreaking. Startup Stability AI released one just last week and ElevenLabs released one in May. Neither are models for creating video audio effects. A Microsoft work can create speech and song videos from still image and platforms like Spades and GenreX have trained models to shoot video and make a better guess as to what music or effects are appropriate in a given scene.

But DeepMind claims its V2A technology is unique in that it can understand the raw pixels from a video and automatically sync sounds produced with the video, optionally without description.

V2A isn’t perfect, and DeepMind recognizes that. Because the underlying model has not been trained on many videos with artifacts or distortions, it does not generate particularly high-quality audio for them. And in general, the sound created is not super convincing; My colleague Natasha Lomas described it as “a hodgepodge of stereotypical sounds,” and I can’t say I disagree.

For these reasons, and to prevent misuse, DeepMind says it won’t be releasing the technology to the public anytime soon, if ever.

“To make sure our V2A technology can have a positive impact on the creative community, we gather diverse perspectives and ideas from leading creators and filmmakers and use this valuable feedback to inform our ongoing research and development,” writes DeepMind. “Before we consider opening access to it to the wider public, our V2A technology will undergo rigorous security assessments and testing.”

DeepMind pitches its V2A technology as a particularly useful tool for archivists and people working with historical footage. But genetic AI along these lines also threatens to upend the film and television industry. It’s going to take some seriously strong labor protections to ensure that the tools of media production don’t eliminate jobs — or, as the case may be, entire professions.

All included creates DeepMind DeepMinds dialogues Generative AI Research soundtracks v2a video videos
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleStop playing games with online security, Signal chairman warns EU lawmakers
Next Article Finbourne raises $70 million for technology that turns financial data dust into AI gold
bhanuprakash.cg
techtost.com
  • Website

Related Posts

Xreal, Google’s smart glasses partner, believes it has finally conquered this extremely difficult industry

25 May 2026

I’ve tried Amazon’s Bee wearable and I’m a bit intrigued

24 May 2026

Ferrari uses IBM AI to create F1 superfans

23 May 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Universal Music Group and TikTok renew agreement to combat unauthorized AI music

26 May 2026

Disrupt 2026 Early Bird ticket prices end May 29

26 May 2026

Google is pitching an ecosystem of AI agents to consumers who might not buy it

26 May 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

Disrupt 2026 Early Bird ticket prices end May 29

26 May 2026

Startup Battlefield 200 applications close before May 27 | TechCrunch

26 May 2026

General Catalyst just led a $63 million bet in India’s travel payments market

21 May 2026
Startups

What ClickUp’s mass layoff tells us about the future of work

SolarSquare in talks to raise up to $60M as India’s rooftop solar market draws big VC interest

This startup raised $43 million to create a hive mind for ships

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.