ClevrSkills Dataset
ClevrSkills Dataset
chip image

ClevrSkills includes ~300,000 robot episodes/trajectories including videos (from multiple views), corresponding actions, and other annotations including text, bounding boxes, cameras poses etc., which were generated from over 33 tasks in ClevrSkills environment suite. It includes a carefully designed curriculum of tasks which can be used for training robotics models to perform tasks ranging from simple pick and place to more complicated manipulation, such as sorting, stacking etc.

Tasks

 

This dataset contains 33 unique robotics tasks ranging from pick, place, throw to sorting, stacking, toppling, and balancing scales.

Sample tasks

  • Pick
  • Place
  • Throw
  • Touch
  • Topple
  • Push
  • Rotate
  • Match Pose
  • Trace

Dataset structure

  • This dataset consists of 3 zip files, each containing data for 1 level of tasks. The zip files are split into ~10GB splits. You might require merging them before unzipping using "zip -s 0 simple_tasks.zip --out simple_tasks_merge.zip"
  • Download size is ~111GB
  • The archive contains directories named traj_{seed} where seed denotes the seed used to generate the episode in ClevrSkills environment
  • Each directory corresponds to one episode and contains,
    • Videos (from multiple cameras – these are used as inputs for models)
    • Actions (Nx7 matrix in .npy format where N denotes the length of the episode)
    • Action labels (.npy file including text label for each action step)
    • Camera parameters (.npy files for camera parameters of each camera)
    • Ep_info.json (meta data of the episode)
    • Info.json (task specification / prompts and textures used in the episode)
    • Keysteps (images of keysteps of the task)
    • Prompt_assets.npy (images used in the prompts)
    • Rewards.npy (reward for each step in the episode)
    • Succes.npy (a label denoting if the task was successful or not at each timestep).

Dataset Creation
 

  • The data was generated using our ClevrSkills environment suite.
    The environment suite is built on top of ManiSkill2 simulator. We make use of a model of UFACTORY xArm6 robot as the main robot. We also use object models from the YCB dataset and textures from Vima to make the objects more diverse.
  • ClevrSkills include 33 different environments and corresponding oracle policies that can solve the task.
  • The tasks are randomly initialized based on a random seed. We use different seeds to initialize a new task and then use the oracle policy to solve the task. We record the videos of the actions taken by the oracle policy in the environment along with other meta data including text annotations, rewards, etc.

 

Dataset Citation
 

Citation: 

@article{haresh2024clevrskills,
title={ClevrSkills: Compositional Language And Visual Understanding in Robotics},
author={Haresh, Sanjay and Dijkman, Daniel Bhattacharyya, Apratim and Memisevic, Roland},
journal={Advances in Neural Information Processing Systems},
year={2024}
}

 

Dataset license

 

This dataset is intended for research purposes only and to support and contribute to the graph research community. The quality of the configuration space design and the collected execution times may be suboptimal and should not be considered as reference performances of the target device but rather as representative of the problem at hand for research purposes.


  

Data License Agreement - Research Use

Qualcomm AI Research

 

At Qualcomm AI Research, we are advancing AI to make its core capabilities – perception, reasoning, and action – ubiquitous across devices. Our mission is to make breakthroughs in fundamental AI research and scale them across industries. By bringing together some of the best minds in the field, we’re pushing the boundaries of what’s possible and shaping the future of AI.

 

Find out more about Qualcomm AI Research.

For any questions or technical support, please contact us at [email protected]

 

Qualcomm AI Research is an initiative of Qualcomm Technologies, Inc.

 

Connect with our communities

Stay ahead of the curve

Receive the latest updates, exclusive offers, and valuable insights delivered through the Qualcomm newsletter straight to your inbox.

Stay ahead of the curve

Receive the latest updates, exclusive offers, and valuable insights delivered through the Qualcomm newsletter straight to your inbox.

Qualcomm relentlessly innovates to deliver intelligent computing everywhere, helping the world tackle some of its most important challenges. Our leading-edge AI, high performance, low-power computing, and unrivaled connectivity deliver proven solutions that transform major industries. At Qualcomm, we are engineering human progress.

Stay connected

Get the latest Qualcomm and industry information delivered to your inbox.

Subscribe
Manage your subscription

© Qualcomm Technologies, Inc. and/or its affiliated companies.

Snapdragon and Qualcomm branded products are products of Qualcomm Technologies, Inc. and/or its subsidiaries. Qualcomm patented technologies are licensed by Qualcomm Incorporated.

Note: Certain services and materials may require you to accept additional terms and conditions before accessing or using those items.

References to "Qualcomm" may mean Qualcomm Incorporated, or subsidiaries or business units within the Qualcomm corporate structure, as applicable.

Qualcomm Incorporated includes our licensing business, QTL, and the vast majority of our patent portfolio. Qualcomm Technologies, Inc., a subsidiary of Qualcomm Incorporated, operates, along with its subsidiaries, substantially all of our engineering, research and development functions, and substantially all of our products and services businesses, including our QCT semiconductor business.

Materials that are as of a specific date, including but not limited to press releases, presentations, blog posts and webcasts, may have been superseded by subsequent events or disclosures.

Nothing in these materials is an offer to sell or license any of the services or materials referenced herein.