Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Spotify now lets you view narrated magazine articles as well

Ghost hackers: the unsolved cybersecurity mystery

Ferrari’s first EV is not for you

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    The Pope’s encyclical on artificial intelligence is not really about artificial intelligence

    25 May 2026

    Everyone is navigating real-time AI security — even Google

    25 May 2026

    I’ve tried Amazon’s Bee wearable and I’m a bit intrigued

    24 May 2026

    Elon Musk has given up on solar power (on Earth)

    24 May 2026

    Ferrari uses IBM AI to create F1 superfans

    23 May 2026
  • Apps

    Universal Music Group and TikTok renew agreement to combat unauthorized AI music

    26 May 2026

    Google is pitching an ecosystem of AI agents to consumers who might not buy it

    26 May 2026

    Founded by Tony Robbins and Calm alums, The Path hopes to offer safer treatment with artificial intelligence

    25 May 2026

    Spotify will reserve tickets for an artist’s top fans in an effort to fill the engagement

    25 May 2026

    Audio production app Huxe, founded by former NotebookLM developers, is shutting down

    24 May 2026
  • Crypto

    5 days left: Save up to $410 on Disrupt 2026 passes

    25 May 2026

    As crypto cools, a16z crypto raises $2.2 billion in capital

    6 May 2026

    Coinbase to lay off 14% of staff as part of broader restructuring

    5 May 2026

    British cryptographer Adam Back denies NYT report that he is Bitcoin creator Satoshi Nakamoto

    9 April 2026

    Hackers stole over $2.7 billion in crypto in 2025, data shows

    23 December 2025
  • Fintech

    Disrupt 2026 Early Bird ticket prices end May 29

    26 May 2026

    Startup Battlefield 200 applications close before May 27 | TechCrunch

    26 May 2026

    General Catalyst just led a $63 million bet in India’s travel payments market

    21 May 2026

    Startup Battlefield 200 applications close on May 27

    21 May 2026

    Venmo’s biggest makeover in years comes at a very interesting time

    11 May 2026
  • Hardware

    The Dreamie alarm clock made me stop using my phone in bed

    26 May 2026

    6 kitchen gadgets that make adult life easier

    25 May 2026

    Xreal, Google’s smart glasses partner, believes it has finally conquered this extremely difficult industry

    25 May 2026

    We tested Google’s AI glasses and they’re almost there

    23 May 2026

    Finnish phone maker HMD ropes Indian AI chatbot into new smartphone to reach local market

    22 May 2026
  • Media & Entertainment

    Spotify now lets you view narrated magazine articles as well

    26 May 2026

    Spotify launches an audiobook creation tool powered by ElevenLabs

    22 May 2026

    New York City Mayor Zohran Mamdani Takes To Twitch To Chat With New Yorkers

    21 May 2026

    Clouted wants to take the guesswork out of making short videos go viral

    21 May 2026

    ‘Ask YouTube’ Brings AI Chat Search to Video, Adds Gemini Omni to Shorts

    20 May 2026
  • Security

    Ghost hackers: the unsolved cybersecurity mystery

    26 May 2026

    Scammers abuse an internal Microsoft account to send spam links

    22 May 2026

    Law enforcement shuts down VPN service used by two dozen ransomware gangs

    21 May 2026

    GitHub says hackers stole data from thousands of internal repositories

    21 May 2026

    Customers say Trump Mobile is leaking their personal information

    20 May 2026
  • Startups

    What ClickUp’s mass layoff tells us about the future of work

    25 May 2026

    SolarSquare in talks to raise up to $60M as India’s rooftop solar market draws big VC interest

    24 May 2026

    This startup raised $43 million to create a hive mind for ships

    22 May 2026

    Maka Kids redefines kids’ screen time with a streaming app optimized for wellness, not engagement

    22 May 2026

    This new startup is taking on a fragrance industry that hasn’t changed in nearly half a century

    21 May 2026
  • Transportation

    Ferrari’s first EV is not for you

    26 May 2026

    Global EV market becomes K-shaped as US falls behind

    25 May 2026

    Tesla’s Full Self-Driving software is creeping into Europe

    25 May 2026

    TechCrunch Mobility: Robotaxi Reality Check

    24 May 2026

    Wayve’s self-driving technology is heading to US cars made by Stellantis

    24 May 2026
  • Venture

    The pitch trick that helped an eSports startup raise $20 million when VCs only wanted AI

    25 May 2026

    Peec, one of Berlin’s up-and-coming startups, more than doubled annual revenue in months to $10 million, sources say

    23 May 2026

    Convective Capital Raises $85M Fund to Build Disaster Resilience

    22 May 2026

    Sam Altman does a ‘mic drop’ pitch to every Y Combinator startup

    21 May 2026

    Startup Battlefield 200 applications close on May 27

    20 May 2026
  • Recommended Essentials
TechTost
You are at:Home»AI»Google describes new methods for training robots with video and large language models
AI

Google describes new methods for training robots with video and large language models

techtost.comBy techtost.com5 January 202403 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Google Describes New Methods For Training Robots With Video And
Share
Facebook Twitter LinkedIn Pinterest Email

2024 is going to be a huge year for the intersection of genetic AI/large fundamental models and robotics. There is a lot of excitement around the potential for various applications, ranging from learning to product design. Google’s DeepMind Robotics researchers are one of several groups exploring the possibilities of space. In a blog post Today, the team highlights ongoing research designed to give robotics a better understanding of exactly what we humans want from it.

Traditionally, robots have focused on doing a single task repeatedly during their lifetime. Disposable robots tend to be very good at this, but even they run into difficulties when changes or errors are inadvertently introduced into the process.

The newly announced AutoRT it is designed to leverage large foundation models, at many different extremes. In a typical example given by the DeepMind team, the system starts by leveraging a Visual Language Model (VLM) for better situational awareness. AutoRT is able to manage a fleet of robots working in parallel equipped with cameras to acquire a layout of their environment and the object within it.

A large language model, meanwhile, suggests tasks that can be accomplished by the hardware, including its final operator. LLMs are considered by many to be the key to unlocking robotics that efficiently understand more natural language commands, reducing the need for hard coding skills.

The system has already been tested quite a bit over the past seven months or so. AutoRT is capable of orchestrating up to 20 robots simultaneously and a total of 52 different devices. In total, DeepMind has collected about 77,000 tests, including more than 6,000 tasks.

Also new from the team is RT-Trajectory, which leverages video input for robotic learning. Many groups are exploring the use of YouTube videos as a method of training robots at scale, but RT-Trajectory adds an interesting layer by overlaying a 2D sketch of the arm in action over the video.

The team notes, “these trajectories, in the form of RGB images, provide low-level, practical visual cues to the model as it learns the robot’s control policies.”

DeepMind says the training had twice the success rate of its RT-2 training, at 63% compared to 29%, while testing 41 tasks.

“RT-Trajectory utilizes the rich robotic motion information that exists in all robot datasets, but is currently underutilized,” the team notes. “RT-Trajectory not only represents another step on the road to building robots capable of moving with efficient precision in new situations, but also unlocking knowledge from existing datasets.”

DeepMind describes Generative AI Google google deepmind robotics language large llm methods models robots training video
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleApple Fitness+ introduces audio meditation, a workout program for golfers and more
Next Article EV startup Fisker is struggling to meet internal sales targets, according to documents
bhanuprakash.cg
techtost.com
  • Website

Related Posts

Google is pitching an ecosystem of AI agents to consumers who might not buy it

26 May 2026

The Pope’s encyclical on artificial intelligence is not really about artificial intelligence

25 May 2026

Everyone is navigating real-time AI security — even Google

25 May 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Spotify now lets you view narrated magazine articles as well

26 May 2026

Ghost hackers: the unsolved cybersecurity mystery

26 May 2026

Ferrari’s first EV is not for you

26 May 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

Disrupt 2026 Early Bird ticket prices end May 29

26 May 2026

Startup Battlefield 200 applications close before May 27 | TechCrunch

26 May 2026

General Catalyst just led a $63 million bet in India’s travel payments market

21 May 2026
Startups

What ClickUp’s mass layoff tells us about the future of work

SolarSquare in talks to raise up to $60M as India’s rooftop solar market draws big VC interest

This startup raised $43 million to create a hive mind for ships

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.