Imitating Task and Motion Planning with Visuomotor Transformers

Dalal, Murtaza; Mandlekar, Ajay; Garrett, Caelan; Handa, Ankur; Salakhutdinov, Ruslan; Fox, Dieter

Computer Science > Robotics

arXiv:2305.16309 (cs)

[Submitted on 25 May 2023 (v1), last revised 17 Oct 2023 (this version, v3)]

Title:Imitating Task and Motion Planning with Visuomotor Transformers

Authors:Murtaza Dalal, Ajay Mandlekar, Caelan Garrett, Ankur Handa, Ruslan Salakhutdinov, Dieter Fox

View PDF

Abstract:Imitation learning is a powerful tool for training robot manipulation policies, allowing them to learn from expert demonstrations without manual programming or trial-and-error. However, common methods of data collection, such as human supervision, scale poorly, as they are time-consuming and labor-intensive. In contrast, Task and Motion Planning (TAMP) can autonomously generate large-scale datasets of diverse demonstrations. In this work, we show that the combination of large-scale datasets generated by TAMP supervisors and flexible Transformer models to fit them is a powerful paradigm for robot manipulation. To that end, we present a novel imitation learning system called OPTIMUS that trains large-scale visuomotor Transformer policies by imitating a TAMP agent. OPTIMUS introduces a pipeline for generating TAMP data that is specifically curated for imitation learning and can be used to train performant transformer-based policies. In this paper, we present a thorough study of the design decisions required to imitate TAMP and demonstrate that OPTIMUS can solve a wide variety of challenging vision-based manipulation tasks with over 70 different objects, ranging from long-horizon pick-and-place tasks, to shelf and articulated object manipulation, achieving 70 to 80% success rates. Video results and code at this https URL

Comments:	Conference on Robot Learning (CoRL) 2023. 8 pages, 5 figures, 2 tables; 11 pages appendix (10 additional figures)
Subjects:	Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (cs.LG)
Cite as:	arXiv:2305.16309 [cs.RO]
	(or arXiv:2305.16309v3 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2305.16309

Submission history

From: Murtaza Dalal [view email]
[v1] Thu, 25 May 2023 17:58:14 UTC (32,195 KB)
[v2] Fri, 29 Sep 2023 22:27:49 UTC (23,348 KB)
[v3] Tue, 17 Oct 2023 16:34:46 UTC (23,348 KB)

Computer Science > Robotics

Title:Imitating Task and Motion Planning with Visuomotor Transformers

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:Imitating Task and Motion Planning with Visuomotor Transformers

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators