Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.10869
Cited By
UniMASK: Unified Inference in Sequential Decision Problems
20 November 2022
Micah Carroll
Orr Paradise
Jessy Lin
Raluca Georgescu
Mingfei Sun
David Bignell
Stephanie Milani
Katja Hofmann
Matthew J. Hausknecht
Anca Dragan
Sam Devlin
OffRL
Re-assign community
ArXiv (abs)
PDF
HTML
Github (55★)
Papers citing
"UniMASK: Unified Inference in Sequential Decision Problems"
37 / 37 papers shown
Title
Predictive Coding for Decision Transformer
Tung M. Luu
Donghoon Lee
Chang D. Yoo
OffRL
98
2
0
04 Oct 2024
Inference via Interpolation: Contrastive Representations Provably Enable Planning and Inference
Benjamin Eysenbach
Vivek Myers
Ruslan Salakhutdinov
Sergey Levine
AI4TS
112
12
0
06 Mar 2024
Masked Autoencoding for Scalable and Generalizable Decision Making
Fangchen Liu
Hao Liu
Aditya Grover
Pieter Abbeel
OffRL
83
48
0
23 Nov 2022
Foundation Posteriors for Approximate Probabilistic Inference
Mike Wu
Noah D. Goodman
UQCV
65
6
0
19 May 2022
A Generalist Agent
Scott E. Reed
Konrad Zolna
Emilio Parisotto
Sergio Gomez Colmenarejo
Alexander Novikov
...
Yutian Chen
R. Hadsell
Oriol Vinyals
Mahyar Bordbar
Nando de Freitas
LM&Ro
LLMAG
AI4CE
211
824
0
12 May 2022
InCoder: A Generative Model for Code Infilling and Synthesis
Daniel Fried
Armen Aghajanyan
Jessy Lin
Sida I. Wang
Eric Wallace
Freda Shi
Ruiqi Zhong
Wen-tau Yih
Luke Zettlemoyer
M. Lewis
SyDa
71
652
0
12 Apr 2022
TransDreamer: Reinforcement Learning with Transformer World Models
Changgu Chen
Yi-Fu Wu
Jaesik Yoon
Sungjin Ahn
OffRL
86
96
0
19 Feb 2022
MaskGIT: Masked Generative Image Transformer
Huiwen Chang
Han Zhang
Lu Jiang
Ce Liu
William T. Freeman
ViT
153
695
0
08 Feb 2022
CM3: A Causal Masked Multimodal Model of the Internet
Armen Aghajanyan
Po-Yao (Bernie) Huang
Candace Ross
Vladimir Karpukhin
Hu Xu
...
Dmytro Okhonko
Mandar Joshi
Gargi Ghosh
M. Lewis
Luke Zettlemoyer
86
158
0
19 Jan 2022
RvS: What is Essential for Offline RL via Supervised Learning?
Scott Emmons
Benjamin Eysenbach
Ilya Kostrikov
Sergey Levine
OffRL
73
183
0
20 Dec 2021
Assistive Tele-op: Leveraging Transformers to Collect Robotic Task Demonstrations
Henry M. Clever
Ankur Handa
H. Mazhar
Kevin Parker
Omer Shapira
Qian Wan
Yashraj S. Narang
Iretiayo Akinola
Maya Cakmak
Dieter Fox
63
18
0
09 Dec 2021
Generalized Decision Transformer for Offline Hindsight Information Matching
Hiroki Furuta
Y. Matsuo
S. Gu
OffRL
67
103
0
19 Nov 2021
Masked Autoencoders Are Scalable Vision Learners
Kaiming He
Xinlei Chen
Saining Xie
Yanghao Li
Piotr Dollár
Ross B. Girshick
ViT
TPM
467
7,814
0
11 Nov 2021
Teaching Autoregressive Language Models Complex Tasks By Demonstration
Gabriel Recchia
65
22
0
05 Sep 2021
Offline Reinforcement Learning as One Big Sequence Modeling Problem
Michael Janner
Qiyang Li
Sergey Levine
OffRL
158
685
0
03 Jun 2021
Decision Transformer: Reinforcement Learning via Sequence Modeling
Lili Chen
Kevin Lu
Aravind Rajeswaran
Kimin Lee
Aditya Grover
Michael Laskin
Pieter Abbeel
A. Srinivas
Igor Mordatch
OffRL
136
1,658
0
02 Jun 2021
Hyperparameter Selection for Imitation Learning
Léonard Hussenot
Marcin Andrychowicz
Damien Vincent
Robert Dadashi
Anton Raichuk
...
Sabela Ramos
Manu Orsini
Olivier Bachem
Matthieu Geist
Olivier Pietquin
87
18
0
25 May 2021
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
670
41,369
0
22 Oct 2020
PMI-Masking: Principled masking of correlated spans
Yoav Levine
Barak Lenz
Opher Lieber
Omri Abend
Kevin Leyton-Brown
Moshe Tennenholtz
Y. Shoham
56
73
0
05 Oct 2020
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
853
42,332
0
28 May 2020
Planning to Explore via Self-Supervised World Models
Ramanan Sekar
Oleh Rybkin
Kostas Daniilidis
Pieter Abbeel
Danijar Hafner
Deepak Pathak
SSL
77
412
0
12 May 2020
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems
Sergey Levine
Aviral Kumar
George Tucker
Justin Fu
OffRL
GP
564
2,040
0
04 May 2020
D4RL: Datasets for Deep Data-Driven Reinforcement Learning
Justin Fu
Aviral Kumar
Ofir Nachum
George Tucker
Sergey Levine
GP
OffRL
229
1,381
0
15 Apr 2020
Stabilizing Transformers for Reinforcement Learning
Emilio Parisotto
H. F. Song
Jack W. Rae
Razvan Pascanu
Çağlar Gülçehre
...
Aidan Clark
Seb Noury
M. Botvinick
N. Heess
R. Hadsell
OffRL
91
366
0
13 Oct 2019
Learning Calibratable Policies using Programmatic Style-Consistency
Eric Zhan
Albert Tseng
Yisong Yue
Adith Swaminathan
Matthew J. Hausknecht
54
18
0
02 Oct 2019
XLNet: Generalized Autoregressive Pretraining for Language Understanding
Zhilin Yang
Zihang Dai
Yiming Yang
J. Carbonell
Ruslan Salakhutdinov
Quoc V. Le
AI4CE
234
8,444
0
19 Jun 2019
Goal-conditioned Imitation Learning
Yiming Ding
Carlos Florensa
Mariano Phielipp
Pieter Abbeel
64
227
0
13 Jun 2019
A Generalized Framework of Sequence Generation with Application to Undirected Sequence Models
Elman Mansimov
Alex Jinpeng Wang
Sean Welleck
Kyunghyun Cho
AIMat
47
46
0
29 May 2019
PRECOG: PREdiction Conditioned On Goals in Visual Multi-Agent Settings
Nicholas Rhinehart
R. McAllister
Kris Kitani
Sergey Levine
66
374
0
03 May 2019
Preferences Implicit in the State of the World
Rohin Shah
Dmitrii Krasheninnikov
Jordan Alexander
Pieter Abbeel
Anca Dragan
65
55
0
12 Feb 2019
BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model
Alex Jinpeng Wang
Kyunghyun Cho
VLM
88
358
0
11 Feb 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.8K
95,175
0
11 Oct 2018
World Models
David R Ha
Jürgen Schmidhuber
SyDa
143
1,098
0
27 Mar 2018
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
730
132,363
0
12 Jun 2017
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
113
2,449
0
15 May 2017
Transfer from Simulation to Real World through Learning Deep Inverse Dynamics Model
Paul Christiano
Zain Shah
Igor Mordatch
Jonas Schneider
T. Blackwell
Joshua Tobin
Pieter Abbeel
Wojciech Zaremba
PINN
75
250
0
11 Oct 2016
Neural Autoregressive Distribution Estimation
Benigno Uria
Marc-Alexandre Côté
Karol Gregor
Iain Murray
Hugo Larochelle
81
314
0
07 May 2016
1