Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.09430
Cited By
Mini Diffuser: Fast Multi-task Diffusion Policy Training Using Two-level Mini-batches
14 May 2025
Yutong Hu
Pinhao Song
Kehan Wen
Renaud Detry
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Mini Diffuser: Fast Multi-task Diffusion Policy Training Using Two-level Mini-batches"
30 / 30 papers shown
Title
Autoregressive Action Sequence Learning for Robotic Manipulation
Xinyu Zhang
Yuhan Liu
Haonan Chang
Liam Schramm
Abdeslam Boularias
88
16
0
04 Oct 2024
Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy
Ricardo Garcia
Shizhe Chen
Cordelia Schmid
LM&Ro
79
13
0
02 Oct 2024
OpenVLA: An Open-Source Vision-Language-Action Model
Moo Jin Kim
Karl Pertsch
Siddharth Karamcheti
Ted Xiao
Ashwin Balakrishna
...
Russ Tedrake
Dorsa Sadigh
Sergey Levine
Percy Liang
Chelsea Finn
LM&Ro
VLM
223
474
0
13 Jun 2024
RVT-2: Learning Precise Manipulation from Few Demonstrations
Ankit Goyal
Valts Blukis
Jie Xu
Yijie Guo
Yu-Wei Chao
Dieter Fox
67
53
0
12 Jun 2024
Octo: An Open-Source Generalist Robot Policy
Octo Model Team
Dibya Ghosh
Homer Walke
Karl Pertsch
Kevin Black
...
Quan Vuong
Ted Xiao
Dorsa Sadigh
Chelsea Finn
Sergey Levine
182
422
0
20 May 2024
Hierarchical Diffusion Policy for Kinematics-Aware Multi-Task Robotic Manipulation
Xiao Ma
Sumit Patidar
Iain Haughton
Stephen James
86
54
0
06 Mar 2024
3D Diffuser Actor: Policy Diffusion with 3D Scene Representations
Tsung-Wei Ke
N. Gkanatsios
Katerina Fragkiadaki
VGen
75
123
0
16 Feb 2024
Boosting Continuous Control with Consistency Policy
Yuhui Chen
Haoran Li
Dongbin Zhao
OffRL
68
26
0
10 Oct 2023
PolarNet: 3D Point Clouds for Language-Guided Robotic Manipulation
Shizhe Chen
Ricardo Garcia Pinel
Cordelia Schmid
Ivan Laptev
LM&Ro
3DPC
72
37
0
27 Sep 2023
RT-2: Vision-Language-Action Models Transfer Web Knowledge to Robotic Control
Anthony Brohan
Noah Brown
Justice Carbajal
Yevgen Chebotar
Xi Chen
...
Ted Xiao
Peng Xu
Sichun Xu
Tianhe Yu
Brianna Zitkovich
LM&Ro
LRM
137
1,232
0
28 Jul 2023
Act3D: 3D Feature Field Transformers for Multi-Task Robotic Manipulation
Théophile Gervet
Zhou Xian
N. Gkanatsios
Katerina Fragkiadaki
78
74
0
30 Jun 2023
RVT: Robotic View Transformer for 3D Object Manipulation
Ankit Goyal
Jie Xu
Yijie Guo
Valts Blukis
Yu-Wei Chao
Dieter Fox
LM&Ro
100
136
0
26 Jun 2023
Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware
Tony Zhao
Vikash Kumar
Sergey Levine
Chelsea Finn
82
613
0
23 Apr 2023
Segment Anything
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
...
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLM
VLM
329
7,278
0
05 Apr 2023
Goal-Conditioned Imitation Learning using Score-based Diffusion Policies
Moritz Reuss
M. Li
Xiaogang Jia
Rudolf Lioutikov
DiffM
125
172
0
05 Apr 2023
Diffusion Policy: Visuomotor Policy Learning via Action Diffusion
Cheng Chi
Zhenjia Xu
S. Feng
Eric A. Cousineau
Yilun Du
Benjamin Burchfiel
Russ Tedrake
Shuran Song
347
1,175
0
07 Mar 2023
Consistency Models
Yang Song
Prafulla Dhariwal
Mark Chen
Ilya Sutskever
VLM
DiffM
97
948
0
02 Mar 2023
RT-1: Robotics Transformer for Real-World Control at Scale
Anthony Brohan
Noah Brown
Justice Carbajal
Yevgen Chebotar
Joseph Dabis
...
Ted Xiao
Peng Xu
Sichun Xu
Tianhe Yu
Brianna Zitkovich
LM&Ro
100
1,111
0
13 Dec 2022
Is Conditional Generative Modeling all you need for Decision-Making?
Anurag Ajay
Yilun Du
Abhi Gupta
J. Tenenbaum
Tommi Jaakkola
Pulkit Agrawal
DiffM
118
394
0
28 Nov 2022
Perceiver-Actor: A Multi-Task Transformer for Robotic Manipulation
Mohit Shridhar
Lucas Manuelli
Dieter Fox
LM&Ro
251
489
0
12 Sep 2022
Planning with Diffusion for Flexible Behavior Synthesis
Michael Janner
Yilun Du
J. Tenenbaum
Sergey Levine
DiffM
300
684
0
20 May 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
410
15,486
0
20 Dec 2021
Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via Discretisation
Stephen James
Kentaro Wada
Tristan Laidlow
Andrew J. Davison
60
130
0
23 Jun 2021
RoFormer: Enhanced Transformer with Rotary Position Embedding
Jianlin Su
Yu Lu
Shengfeng Pan
Ahmed Murtadha
Bo Wen
Yunfeng Liu
268
2,443
0
20 Apr 2021
Denoising Diffusion Implicit Models
Jiaming Song
Chenlin Meng
Stefano Ermon
VLM
DiffM
260
7,356
0
06 Oct 2020
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
613
18,036
0
19 Jun 2020
IRIS: Implicit Reinforcement without Interaction at Scale for Learning Control from Offline Robot Manipulation Data
Ajay Mandlekar
Fabio Ramos
Byron Boots
Silvio Savarese
Li Fei-Fei
Animesh Garg
Dieter Fox
OffRL
88
119
0
13 Nov 2019
RLBench: The Robot Learning Benchmark & Learning Environment
Stephen James
Z. Ma
David Rovick Arrojo
Andrew J. Davison
SSL
VLM
OffRL
104
551
0
26 Sep 2019
FiLM: Visual Reasoning with a General Conditioning Layer
Ethan Perez
Florian Strub
H. D. Vries
Vincent Dumoulin
Aaron Courville
FAtt
AIMat
OffRL
AI4CE
349
2,208
0
22 Sep 2017
U-Net: Convolutional Networks for Biomedical Image Segmentation
Olaf Ronneberger
Philipp Fischer
Thomas Brox
SSeg
3DV
1.8K
77,133
0
18 May 2015
1