Home
Loading

aVenture is in Alpha: During this preview period, you should expect the research data to be limited and may not yet meet our exacting standards. We've made the decision to provide early access to our data to showcase the product as we build, but you should not yet rely upon it alone for your investment decisions.

aVenture is in Alpha: During this preview period, you should expect the research data to be limited and may not yet meet our exacting standards. We've made the decision to provide early access to our data to showcase the product as we build, but you should not yet rely upon it alone for your investment decisions.

Get in touch

  • Contact

  • Request a demo

  • Request data updates

  • Add a company

Research

  • Companies

  • Investors

  • People

aVenture

  • Sitemap

  • Feature requests

Member

Backed by

© aVenture Investment Company, 2026. All rights reserved.

San Francisco, CA, USA

Privacy Policy

aVenture Investment Company ("aVenture") is an independent research platform providing detailed analysis and data on startups, venture capital investments, and key industry individuals. It is not a registered investment adviser, broker-dealer, or investment advisor and does not provide investment advice or recommendations. The data provided by aVenture does not constitute recommendations or advice, whether by methodology, analysis, AI-generated content, or a statement written by a staff member of aVenture.

aVenture is not affiliated with any of the people, companies, organizations, government agencies, regulatory bodies, or investment funds we provide coverage for on this site unless explicitly stated otherwise. Users assume full responsibility for decisions made based on information obtained from this platform. Links to external websites do not imply endorsement or affiliation with aVenture. Any links that provide the ability to invest in a primary or secondary transaction in a company are for convenience only and do not constitute solicitations or offers to buy or sell an investment. Investors should exercise heightened precaution and due diligence when investing in private companies, especially those not independently audited.

While we strive to provide valuable insights with objectivity and professional diligence, we cannot guarantee the accuracy of the information provided on our platform. Before making any investment decisions, you should verify the accuracy of all pertinent details for your decision. To the fullest extent permitted by law, aVenture shall not be liable for any direct, indirect, incidental, consequential, or financial damages arising from use of this site, whether by consumers of its contents directly or by persons or organizations covered by our research, even if we are advised of the possibility. Our best-efforts processes and correction request forms do not create a warranty or duty of care.

Profiles on this platform may include content generated in part by large language models (LLMs, artificial intelligence) that aggregate publicly available sources (e.g., SEC EDGAR, public filings, press releases). Source attribution is provided where known; always verify statements and claims here against original sources before relying on any data. Content on our site may contain inaccuracies, omissions, or what are commonly called 'hallucinations' if generated in part or in full by AI / LLMs. The risk can also exist even when content is written by a human, as internal and third-party sources may also have inaccuracies for the same or different reasons. While we randomly audit a proportion of content, this is not exhaustive.

We recommend that an independent auditor be hired to verify the accuracy of the information before relying on it for any sensitive decisions. By accessing this platform, you agree not to rely solely on any information generated by AI, aggregated, or sourced or written otherwise on this site, for investment, financial, or other decisions. aVenture assumes no responsibility for inaccuracies, omissions, or hallucinations. You must independently verify all data from primary sources. Use of this platform constitutes your waiver of claims for reliance-based damages, including negligent misrepresentation. To report an error, request a correction, or dispute information about a company or individual, contact us via our request data updates form.

Loading
Loading
Home
News
ABC-130K: The largest open source teleoperation dataset

From XDOF Blog

By

June 17, 2026

ABC-130K: The largest open source teleoperation dataset

The momentum behind robotics is unprecedented. New advancements up and down the stack have driven incredible energy and interest into the space. They have provided the world with a tangible look at the path towards general-purpose robots. It is much needed: as language models have created fully agent-driven experiences, we are left painfully aware of the gap between what is possible in the digital versus the physical. The researchers' scramble for data is one we know well from our days at UC Berkeley and beyond. Too often, researchers are forced to piece together whatever bits of data they can find from existing datasets like DROID or Open X-Embodiment. All this effort is just for data collection, let alone processing, language annotations and quality control. We think the robotics world deserves more, and better. So, we teamed up with the authors behind ABC: Scalable Behavior Cloning with Open Data, Training, and Evaluation to do something about it. More data, higher quality, broader access As part of the broader paper, we are excited to release the following datasets for research purposes. ABC-130K contains over 130,000 episodes across 195 bimanual manipulation tasks. ABC-Sim contains teleoperation data in simulation for 10 tasks and is coming soon. ABC-Eval contains real evaluation trials of various checkpoints and hyperparameters for 3 tasks and is coming soon. All data are on bimanual station setups using YAM arms from I2RT. All datasets are provided under the Apache 2.0 license. ABC-130K is the largest open source teleoperation dataset to-date. It combines data scale while maintaining high quality, all while using the accessible YAM setup that has gained popularity in the last couple of years. Example task categories include pick and place, folding, insertion and ejection, tool use, assembly and disassembly, tying and untying, and more. We also made XDOF's evaluations service available to the authors. Establishing a foundation for behavior cloning Thank you to UC Berkeley lead authors on this work and collaborators for making this all possible. The data is just one part of the foundation laid out by this work. Our hope is to provide the infra for robot learning that we wish we had.

View original article on xdof.ai

Most Recent

Former Infosys chief has a new startup that wants to challenge the IT services world

Former Infosys chief has a new startup that wants to challenge the IT services world

Backed by Mayfield and Aramco Ventures, Vishal Sikka’s new venture brings together veterans from SAP, Infosys, and VianAI.

Jun 24, 2026

AI was supposed to kill engineering jobs, but new data suggests they’re the most resilient

AI was supposed to kill engineering jobs, but new data suggests they’re the most resilient

While AI dominates the layoff narrative, engineers are actually making up a larger share of new hires, according to SignalFire data.

Jun 24, 2026

Here’s why Slate changed the battery in its cheap EV truck

Here’s why Slate changed the battery in its cheap EV truck

While there was probably a moment when Slate’s leadership had to green-light the switch from one battery type to another, the momentum toward that decision had been building for years.

Jun 24, 2026

Valor Equity Partners looks to raise a $2.5B Fund VII, per Bloomberg

Valor Equity Partners looks to raise a $2.5B Fund VII, per Bloomberg

New details have emerged about Valor's latest fund, which last year announced it was raising an unspecified amount of capital.

Jun 24, 2026

Similar Posts

Collecting robot training data is dirty, unglamorous work. Some AI labs are already paying XDOF to do it.

Collecting robot training data is dirty, unglamorous work. Some AI labs are already paying XDOF to do it.

TechCrunch reports that XDOF has emerged from stealth with a $70 million funding round and is building data pipelines, collection tools, and annotation systems for robotics foundation models. The company was founded in October 2024 and counts frontier AI labs among its customers.

Jun 16, 2026

Alloy is bringing data management to the robotics industry

Alloy is bringing data management to the robotics industry

Australia-based Alloy thinks it can help robotics firms with their data problem: The startup is building data infrastructure to help companies process and organize all the data their robots collect.

Sep 23, 2025

Teleo wants to help the robotics industry reach its ‘ChatGPT moment’

Teleo wants to help the robotics industry reach its ‘ChatGPT moment’

Teleo describes itself as a construction robotics startup, but its mission is bigger than automating heavy equipment like excavators and tractors. Today, Teleo’s retrofitted machinery allows its customers to operate their existing fleets semi-autonomously. In the future, the startup sees the data it collects as a key enabler for the robotics industry to reach its “ChatGPT moment.” That isn’t an aspiration to reach the same level of hype surrounding ChatGPT. Instead, Teleo CEO Vinay Shet sees an

Nov 21, 2024

Nomadic raises $8.4 million to wrangle the data pouring off autonomous vehicles

Nomadic raises $8.4 million to wrangle the data pouring off autonomous vehicles

The company turns footage from robots into structured, searchable datasets with a deep learning model.

Mar 31, 2026

Most Recent

Former Infosys chief has a new startup that wants to challenge the IT services world

Former Infosys chief has a new startup that wants to challenge the IT services world

Backed by Mayfield and Aramco Ventures, Vishal Sikka’s new venture brings together veterans from SAP, Infosys, and VianAI.

Jun 24, 2026

AI was supposed to kill engineering jobs, but new data suggests they’re the most resilient

AI was supposed to kill engineering jobs, but new data suggests they’re the most resilient

While AI dominates the layoff narrative, engineers are actually making up a larger share of new hires, according to SignalFire data.

Jun 24, 2026

Here’s why Slate changed the battery in its cheap EV truck

Here’s why Slate changed the battery in its cheap EV truck

While there was probably a moment when Slate’s leadership had to green-light the switch from one battery type to another, the momentum toward that decision had been building for years.

Jun 24, 2026

Valor Equity Partners looks to raise a $2.5B Fund VII, per Bloomberg

Valor Equity Partners looks to raise a $2.5B Fund VII, per Bloomberg

New details have emerged about Valor's latest fund, which last year announced it was raising an unspecified amount of capital.

Jun 24, 2026

Similar Posts

Collecting robot training data is dirty, unglamorous work. Some AI labs are already paying XDOF to do it.

Collecting robot training data is dirty, unglamorous work. Some AI labs are already paying XDOF to do it.

TechCrunch reports that XDOF has emerged from stealth with a $70 million funding round and is building data pipelines, collection tools, and annotation systems for robotics foundation models. The company was founded in October 2024 and counts frontier AI labs among its customers.

Jun 16, 2026

Alloy is bringing data management to the robotics industry

Alloy is bringing data management to the robotics industry

Australia-based Alloy thinks it can help robotics firms with their data problem: The startup is building data infrastructure to help companies process and organize all the data their robots collect.

Sep 23, 2025

Teleo wants to help the robotics industry reach its ‘ChatGPT moment’

Teleo wants to help the robotics industry reach its ‘ChatGPT moment’

Teleo describes itself as a construction robotics startup, but its mission is bigger than automating heavy equipment like excavators and tractors. Today, Teleo’s retrofitted machinery allows its customers to operate their existing fleets semi-autonomously. In the future, the startup sees the data it collects as a key enabler for the robotics industry to reach its “ChatGPT moment.” That isn’t an aspiration to reach the same level of hype surrounding ChatGPT. Instead, Teleo CEO Vinay Shet sees an

Nov 21, 2024

Nomadic raises $8.4 million to wrangle the data pouring off autonomous vehicles

Nomadic raises $8.4 million to wrangle the data pouring off autonomous vehicles

The company turns footage from robots into structured, searchable datasets with a deep learning model.

Mar 31, 2026