Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning

By Marcel Hussing, Jorge Mendez-Mendez, Anisha Singrodia, Cassandra Kent, and Eric Eaton

Reinforcement Learning Journal, vol. 2, 2024, pp. 979–994.

Presented at the Reinforcement Learning Conference (RLC), Amherst Massachusetts, August 9–12, 2024.


Download:

Abstract:

Offline reinforcement learning (RL) is a promising direction that allows RL agents to pre-train on large datasets, avoiding the recurrence of expensive data collection. To advance the field, it is crucial to generate large-scale datasets. Compositional RL is particularly appealing for generating such large datasets, since 1) it permits creating many tasks from few components, 2) the task structure may enable trained agents to solve new tasks by combining relevant learned components, and 3) the compositional dimensions provide a notion of task relatedness. This paper provides four offline RL datasets for simulated robotic manipulation created using the $256$ tasks from CompoSuite (Mendez et al., 2022). Each dataset is collected from an agent with a different degree of performance, and consists of $256$ million transitions. We provide training and evaluation settings for assessing an agent's ability to learn compositional task policies. Our benchmarking experiments show that current offline RL methods can learn the training tasks to some extent and that compositional methods outperform non-compositional methods. Yet, current methods are unable to extract the compositional structure to generalize to unseen tasks highlighting a need for future research in offline compositional RL.


Citation Information:

Marcel Hussing, Jorge Mendez-Mendez, Anisha Singrodia, Cassandra Kent, and Eric Eaton. "Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning." Reinforcement Learning Journal, vol. 2, 2024, pp. 979–994.

BibTeX:

@article{hussing2024robotic,
    title={Robotic Manipulation Datasets for Offline Compositional Reinforcement Learning},
    author={Hussing, Marcel and Mendez-Mendez, Jorge and Singrodia, Anisha and Kent, Cassandra and Eaton, Eric},
    journal={Reinforcement Learning Journal},
    volume={2},
    pages={979--994},
    year={2024}
}