Close Menu
TechTost
  • AI
  • Apps
  • Crypto
  • Fintech
  • Hardware
  • Media & Entertainment
  • Security
  • Startups
  • Transportation
  • Venture
  • Recommended Essentials
What's Hot

Collecting bot training data is dirty, unsavory work. Some AI labs already pay XDOF to do it.

Uber will bring its premium robotaxi service to Houston in 2027

PayPal Ventures is shutting down as the company continues to restructure

Facebook X (Twitter) Instagram
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer
Facebook X (Twitter) Instagram
TechTost
Subscribe Now
  • AI

    SpaceX values ​​balloons at $2.6T, narrowly passes Amazon

    17 June 2026

    SpaceX Goes Public: Everything You Need to Know Post-IPO

    16 June 2026

    Sundar Pichai faces backlash, pulls out of Stanford graduation ceremony for Google’s Israel, ICE ties

    16 June 2026

    Cybersecurity vets protest ‘dangerous’ US government ban on Anthropic’s most powerful models

    15 June 2026

    OpenAI is facing investigation by state attorneys general

    15 June 2026
  • Apps

    Pinterest Launches Experimental AI Shopping App Called ‘Ask Pinterest’

    17 June 2026

    Android 17 rolls out with new multitasking tools as Google expands Gemini features

    17 June 2026

    India orders temporary ban on Telegram over exam cheating

    16 June 2026

    Meta’s new ‘AI Mode’ on Facebook draws from public information on its platforms

    16 June 2026

    UK unveils sweeping social media ban on under-16s

    15 June 2026
  • Crypto

    Startup Battlefield 200 applications close today

    27 May 2026

    5 days left: Save up to $410 on Disrupt 2026 passes

    25 May 2026

    As crypto cools, a16z crypto raises $2.2 billion in capital

    6 May 2026

    Coinbase to lay off 14% of staff as part of broader restructuring

    5 May 2026

    British cryptographer Adam Back denies NYT report that he is Bitcoin creator Satoshi Nakamoto

    9 April 2026
  • Fintech

    Robinhood’s note on 10% layoffs shows that blaming AI doesn’t cut it

    17 June 2026

    Anthropic’s latest spat with the Trump administration may actually help it, sales figures suggest

    17 June 2026

    Ramp raises $750M at $44B valuation as investors thirst for fintechs with AI history

    5 June 2026

    Last 24 hours to save up to $410 on your Disrupt 2026 ticket

    29 May 2026

    2 days left: Lock in up to $410 in ticket savings for Disrupt 2026

    28 May 2026
  • Hardware

    Snap is finally debuting its long-awaited AR glasses, the specs, and, ugh, they’re not cheap

    17 June 2026

    Qualcomm wants to be the chip in everything that replaces your smartphone, and it just announced two products to that end

    17 June 2026

    This slim speaker under the pillow helped me sleep without headphones

    14 June 2026

    Jeff Bezos’ Prometheus Raises $12 Billion to Build an ‘Artificial General Engineer’ for the Natural World

    12 June 2026

    WWDC 2026: What to expect, from Siri’s long-awaited revamp to Apple Intelligence and iOS 27

    9 June 2026
  • Media & Entertainment

    60 percent of US consumers say ‘artificial intelligence’ in brand messaging is a turnoff, survey finds

    16 June 2026

    Fox to acquire Roku in $22 billion deal

    15 June 2026

    Deezer’s new tool can recognize AI music from Spotify, Apple Music and more

    11 June 2026

    Netflix expands revamped mobile app across Asia and doubles down on games for kids

    10 June 2026

    Plex adds new social features ahead of major price hike for its lifetime pass

    6 June 2026
  • Security

    Apple is planning to change the Hide My Email privacy feature that could make it less effective

    17 June 2026

    The US government’s ban on Anthropic models was never about an AI jailbreak

    16 June 2026

    As AI agents become employees, NewCore comes up with $66 million to give them identities

    15 June 2026

    The FBI built its own replica small town to simulate real-world cyberattacks

    13 June 2026

    US surveillance law to expire for first time after lawmakers rejected Trump’s controversial pick to lead spy agency

    13 June 2026
  • Startups

    Collecting bot training data is dirty, unsavory work. Some AI labs already pay XDOF to do it.

    17 June 2026

    This startup’s super metals could soon be found in military drones, luxury watches and chef’s knives

    17 June 2026

    He’s probably raising $9 million to create a more reliable kind of AI

    16 June 2026

    Sarvam becomes India’s newest AI unicorn with $234M funding round led by HCLTech

    15 June 2026

    As AI companies scramble to go public, who else is along for the ride?

    14 June 2026
  • Transportation

    Uber will bring its premium robotaxi service to Houston in 2027

    17 June 2026

    Mobileye’s robotaxi launch in the US will put it on both sides of the AV business

    17 June 2026

    SpaceX Goes Public: Everything You Need to Know Post-IPO

    16 June 2026

    GM is joining the race to make batteries for AI data centers and the grid

    15 June 2026

    TechCrunch Mobility: SpaceX rockets pass Tesla

    14 June 2026
  • Venture

    PayPal Ventures is shutting down as the company continues to restructure

    17 June 2026

    Orbio raises $21 million to automate hiring and onboarding of frontline workers

    15 June 2026

    Why business AI will be the focus of VivaTech 2026

    10 June 2026

    How Justin Ernest invested nearly $500 million in hot startups without a traditional VC fund

    10 June 2026

    Mercor’s Brendan Foody calls out Sequoia, accusing it of “double pricing” valuation tricks.

    9 June 2026
  • Recommended Essentials
TechTost
You are at:Home»Startups»Collecting bot training data is dirty, unsavory work. Some AI labs already pay XDOF to do it.
Startups

Collecting bot training data is dirty, unsavory work. Some AI labs already pay XDOF to do it.

techtost.comBy techtost.com17 June 202605 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Collecting Bot Training Data Is Dirty, Unsavory Work. Some Ai
Share
Facebook Twitter LinkedIn Pinterest Email

Two weeks ago, OpenAI he said will restart its shuttered robotics program in 2021 — the latest sign that the biggest AI labs are racing to teach machines to function in the natural world. But building competent robots requires something the AI ​​industry doesn’t yet have, which is training data that matches that used for language models.

This gap creates a new kind of infrastructure. Unlike LLMs trained on a vast sea of ​​publicly available text, bots need data that captures physical interaction, and that kind of data barely exists. YouTube videos and footage taken by gig workers are low-fidelity and difficult to reconcile with the physical world.

XDOF (pronounced “ecks-doff”), emerging from stealth today, is betting that the next big bottleneck in AI isn’t models or brands, but the data feedback loop needed to teach robots how to interact with the natural world.

The startup aims to build the data pipelines, collection tools, and annotation systems that frontier labs and robotics companies can’t easily build—and has raised $70 million from Thrive Capital, Spark Capital, a16z, Lux, and WndrCo to do so. Co-founder and CEO Philipp Wu says XDOF, which has about 60 employees, already works with 20 clients, including several frontier AI labs, but he can’t name them.

“All the top labs are trying to pursue robotics,” Wu said. “We’ve already seen some of the negative consequences of being a little bit behind in the language model race … you don’t want to be in these kinds of situations where you’re pursuing this technology too late and everyone’s in this boat where natural AI is the next frontier.”

Wu faced this problem himself as a doctoral student at UC Berkeley. His focus was on enabling robots to learn skills from large-scale datasets. There was just one problem.

“We didn’t have large-scale data to work with,” he told TechCrunch. “There was this chicken-and-egg problem—first we needed to collect real data before we could even ask how to train a base model for robotics.”

Wu and future XDOF co-founder and CTO Fred Shentu worked on a project called GELLO, a low-cost teleoperation system that allows a human operator to control a robotic arm to generate training data. “It ended up being a very influential paper in robotics because a lot of people had similar needs and bottlenecks, and a lot of people started using this type of device to collect data,” Wu said.

Spotting the opportunity, Wu, Shentu and third co-founder and Chief Operating Officer Nemo Jin launched XDOF in October 2024 to provide a data ecosystem for companies pursuing robotics models. Mindful that providing data alone can be a dead-end business, the company also focuses on data cleaning, tool processing, and annotation — creating a self-reinforcing feedback loop for robot trainers.

As a starting point, the company is working with UC Berkeley’s AI Research Lab to release what it believes is the largest collection of high-quality robot training data ever assembled, called ABC. It includes 130,000 trajectories of robot handling data, 300 hours of simulation and 100 hours of evaluations. This kind of scaled pre-training data has never been available in academia.

“We’ve seen in language, imaging, and other areas that when models and data are released, the community achieves things that you wouldn’t necessarily expect,” David McAllister, a Ph.D. at Berkeley who helped organize the release, told TechCrunch.

The team has already used the data to train robots on benchmarking tasks like folding T-shirts and flattening boxes or loading AirPods into their cases.

Unlimited degrees of freedom

The company plans to work on three levels of a data pyramid. The most valuable layer is the teleoperation data collected for the actual robot being developed. Then come remote-controlled robots that collect more general data, such as with GELLO. and finally “self-centered” data collected from people performing everyday tasks, for which XDOF plans to build its own wearable sensors.

“Your choice of camera will affect the quality of your data—which will affect the performance of the hand-tracking algorithm,” Wu said. “If you don’t design the hardware well from the beginning, the data you collect can have very specific problems that you didn’t anticipate.”

The company plans to hire and train armies of remote operators and self-centered data operators around the world — a labor-intensive model that raises an obvious question: Why don’t the big labs do this data generation themselves?

“You need a warehouse of hundreds of thousands of square feet with hundreds of robots,” Wu said. “You have to maintain these robots, calibrate their physical parameters, and properly train the operators.”

This is a build that requires focus, capital and operational scale that most AI labs would prefer to outsource – which is exactly what XDOF is betting on.

The name XDOF is a play on the robotics term “degrees of freedom,” which describes the number of independent movements a robot can perform. Your arm, from shoulder to wrist, has seven degrees of freedom. Humanoid robotics company Figure AI has 30. The X in the company’s name captures its ambition: “Arbitrary degrees of freedom, unlimited degrees of freedom,” says Wu.

When you purchase through links in our articles, we may earn a small commission. This does not affect our editorial independence.

a16z bot collecting data Dirty Labs Pay robot Thrive Capital training unsavory work XDOF
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleUber will bring its premium robotaxi service to Houston in 2027
bhanuprakash.cg
techtost.com
  • Website

Related Posts

This startup’s super metals could soon be found in military drones, luxury watches and chef’s knives

17 June 2026

He’s probably raising $9 million to create a more reliable kind of AI

16 June 2026

Sarvam becomes India’s newest AI unicorn with $234M funding round led by HCLTech

15 June 2026
Add A Comment

Leave A Reply Cancel Reply

Don't Miss

Collecting bot training data is dirty, unsavory work. Some AI labs already pay XDOF to do it.

17 June 2026

Uber will bring its premium robotaxi service to Houston in 2027

17 June 2026

PayPal Ventures is shutting down as the company continues to restructure

17 June 2026
Stay In Touch
  • Facebook
  • YouTube
  • TikTok
  • WhatsApp
  • Twitter
  • Instagram
Fintech

Robinhood’s note on 10% layoffs shows that blaming AI doesn’t cut it

17 June 2026

Anthropic’s latest spat with the Trump administration may actually help it, sales figures suggest

17 June 2026

Ramp raises $750M at $44B valuation as investors thirst for fintechs with AI history

5 June 2026
Startups

Collecting bot training data is dirty, unsavory work. Some AI labs already pay XDOF to do it.

This startup’s super metals could soon be found in military drones, luxury watches and chef’s knives

He’s probably raising $9 million to create a more reliable kind of AI

© 2026 TechTost. All Rights Reserved
  • About Us
  • Contact Us
  • Privacy Policy
  • Terms and Conditions
  • Disclaimer

Type above and press Enter to search. Press Esc to cancel.