ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2203.10159
  4. Cited By
Discovering Objects that Can Move

Discovering Objects that Can Move

18 March 2022
Zhipeng Bao
P. Tokmakov
Allan Jabri
Yu-Xiong Wang
Adrien Gaidon
M. Hebert
    OCL
ArXiv (abs)PDFHTML

Papers citing "Discovering Objects that Can Move"

49 / 49 papers shown
Title
Slot-BERT: Self-supervised Object Discovery in Surgical Video
Slot-BERT: Self-supervised Object Discovery in Surgical Video
Guiqiu Liao
M. Jogan
Marcel Hussing
Kenta Nakahashi
Kazuhiro Yasufuku
Amin Madani
Eric Eaton
Daniel A. Hashimoto
438
0
0
21 Jan 2025
Unsupervised Foreground Extraction via Deep Region Competition
Unsupervised Foreground Extraction via Deep Region Competition
Peiyu Yu
Sirui Xie
Xiaojian Ma
Yixin Zhu
Ying Nian Wu
Song-Chun Zhu
OCL
62
42
0
29 Oct 2021
Unsupervised Object Learning via Common Fate
Unsupervised Object Learning via Common Fate
Matthias Tangemann
Steffen Schneider
Julius von Kügelgen
Francesco Locatello
Peter V. Gehler
Thomas Brox
Matthias Kümmerer
Matthias Bethge
Bernhard Schölkopf
OCL
75
25
0
13 Oct 2021
SMURF: Self-Teaching Multi-Frame Unsupervised RAFT with Full-Image
  Warping
SMURF: Self-Teaching Multi-Frame Unsupervised RAFT with Full-Image Warping
Austin Stone
Daniel Maurer
Alper Ayvaci
A. Angelova
Rico Jonschkowski
98
84
0
14 May 2021
VideoLT: Large-scale Long-tailed Video Recognition
VideoLT: Large-scale Long-tailed Video Recognition
Xing Zhang
Zuxuan Wu
Zejia Weng
Huazhu Fu
Jingjing Chen
Yu-Gang Jiang
Larry S. Davis
80
42
0
06 May 2021
Self-supervised Video Object Segmentation by Motion Grouping
Self-supervised Video Object Segmentation by Motion Grouping
Charig Yang
Hala Lamdouar
Erika Lu
Andrew Zisserman
Weidi Xie
VOSOCL
80
162
0
15 Apr 2021
Image-Level or Object-Level? A Tale of Two Resampling Strategies for
  Long-Tailed Detection
Image-Level or Object-Level? A Tale of Two Resampling Strategies for Long-Tailed Detection
Nadine Chang
Zhiding Yu
Yu-Xiong Wang
Anima Anandkumar
Sanja Fidler
J. Álvarez
89
39
0
12 Apr 2021
Distributional Robustness Loss for Long-tail Learning
Distributional Robustness Loss for Long-tail Learning
Dvir Samuel
Gal Chechik
OOD
82
100
0
07 Apr 2021
Decomposing 3D Scenes into Objects via Unsupervised Volume Segmentation
Decomposing 3D Scenes into Objects via Unsupervised Volume Segmentation
Karl Stelzner
Kristian Kersting
Adam R. Kosiorek
126
108
0
02 Apr 2021
Exploring Simple Siamese Representation Learning
Exploring Simple Siamese Representation Learning
Xinlei Chen
Kaiming He
SSL
258
4,072
0
20 Nov 2020
Unsupervised Discovery of 3D Physical Objects from Video
Unsupervised Discovery of 3D Physical Objects from Video
Yilun Du
Kevin A. Smith
Tomer Ulman
J. Tenenbaum
Jiajun Wu
OCL
169
38
0
24 Jul 2020
Unsupervised object-centric video generation and decomposition in 3D
Unsupervised object-centric video generation and decomposition in 3D
Paul Henderson
Christoph H. Lampert
OCL
99
36
0
07 Jul 2020
Object-Centric Learning with Slot Attention
Object-Centric Learning with Slot Attention
Francesco Locatello
Dirk Weissenborn
Thomas Unterthiner
Aravindh Mahendran
G. Heigold
Jakob Uszkoreit
Alexey Dosovitskiy
Thomas Kipf
OCL
225
856
0
26 Jun 2020
End-to-End Object Detection with Transformers
End-to-End Object Detection with Transformers
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT3DVPINN
432
13,094
0
26 May 2020
RAFT: Recurrent All-Pairs Field Transforms for Optical Flow
RAFT: Recurrent All-Pairs Field Transforms for Optical Flow
Zachary Teed
Jia Deng
MDE
244
2,644
0
26 Mar 2020
SPACE: Unsupervised Object-Oriented Scene Representation via Spatial
  Attention and Decomposition
SPACE: Unsupervised Object-Oriented Scene Representation via Spatial Attention and Decomposition
Zhixuan Lin
Yi-Fu Wu
Skand Peri
Weihao Sun
Gautam Singh
Fei Deng
Jindong Jiang
Sungjin Ahn
BDLOCL3DPC
168
250
0
08 Jan 2020
Dota 2 with Large Scale Deep Reinforcement Learning
Dota 2 with Large Scale Deep Reinforcement Learning
OpenAI OpenAI
:
Christopher Berner
Greg Brockman
Brooke Chan
...
Szymon Sidor
Ilya Sutskever
Jie Tang
Filip Wolski
Susan Zhang
GNNVLMCLLAI4CELRM
169
1,835
0
13 Dec 2019
Entity Abstraction in Visual Model-Based Reinforcement Learning
Entity Abstraction in Visual Model-Based Reinforcement Learning
Rishi Veerapaneni
John D. Co-Reyes
Michael Chang
Michael Janner
Chelsea Finn
Jiajun Wu
J. Tenenbaum
Sergey Levine
OCLOffRL
87
189
0
28 Oct 2019
Decoupling Representation and Classifier for Long-Tailed Recognition
Decoupling Representation and Classifier for Long-Tailed Recognition
Bingyi Kang
Saining Xie
Marcus Rohrbach
Zhicheng Yan
Albert Gordo
Jiashi Feng
Yannis Kalantidis
OODD
180
1,221
0
21 Oct 2019
CATER: A diagnostic dataset for Compositional Actions and TEmporal
  Reasoning
CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
Rohit Girdhar
Deva Ramanan
72
178
0
10 Oct 2019
SCALOR: Generative World Models with Scalable Object Representations
SCALOR: Generative World Models with Scalable Object Representations
Jindong Jiang
Sepehr Janghorbani
Gerard de Melo
Sungjin Ahn
OCLDRL
90
133
0
06 Oct 2019
GENESIS: Generative Scene Inference and Sampling with Object-Centric
  Latent Representations
GENESIS: Generative Scene Inference and Sampling with Object-Centric Latent Representations
Martin Engelcke
Adam R. Kosiorek
Oiwi Parker Jones
Ingmar Posner
OCL
124
307
0
30 Jul 2019
6-DOF GraspNet: Variational Grasp Generation for Object Manipulation
6-DOF GraspNet: Variational Grasp Generation for Object Manipulation
Arsalan Mousavian
Clemens Eppner
Dieter Fox
3DPC
88
565
0
25 May 2019
Multi-Object Representation Learning with Iterative Variational
  Inference
Multi-Object Representation Learning with Iterative Variational Inference
Klaus Greff
Raphael Lopez Kaufman
Rishabh Kabra
Nicholas Watters
Christopher P. Burgess
Daniel Zoran
Loic Matthey
M. Botvinick
Alexander Lerchner
OCLSSL
106
509
0
01 Mar 2019
Towards Segmenting Anything That Moves
Towards Segmenting Anything That Moves
Achal Dave
P. Tokmakov
Deva Ramanan
72
87
0
11 Feb 2019
MONet: Unsupervised Scene Decomposition and Representation
MONet: Unsupervised Scene Decomposition and Representation
Christopher P. Burgess
Loic Matthey
Nicholas Watters
Rishabh Kabra
I. Higgins
M. Botvinick
Alexander Lerchner
OCL
88
529
0
22 Jan 2019
A Structured Model For Action Detection
A Structured Model For Action Detection
Yubo Zhang
P. Tokmakov
M. Hebert
Cordelia Schmid
82
101
0
09 Dec 2018
Object Discovery in Videos as Foreground Motion Clustering
Object Discovery in Videos as Foreground Motion Clustering
Christopher Xie
Yu Xiang
Zaïd Harchaoui
Dieter Fox
VOS
83
70
0
06 Dec 2018
Learning Human-Object Interactions by Graph Parsing Neural Networks
Learning Human-Object Interactions by Graph Parsing Neural Networks
Siyuan Qi
Wenguan Wang
Baoxiong Jia
Jianbing Shen
Song-Chun Zhu
GNN
82
537
0
23 Aug 2018
Learning to Segment Moving Objects
Learning to Segment Moving Objects
P. Tokmakov
Cordelia Schmid
Alahari Karteek
VOS
73
97
0
01 Dec 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
730
132,363
0
12 Jun 2017
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
Priya Goyal
Piotr Dollár
Ross B. Girshick
P. Noordhuis
Lukasz Wesolowski
Aapo Kyrola
Andrew Tulloch
Yangqing Jia
Kaiming He
3DH
128
3,685
0
08 Jun 2017
Mask R-CNN
Mask R-CNN
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
ObjD
360
27,244
0
20 Mar 2017
Weakly Supervised Semantic Segmentation using Web-Crawled Videos
Weakly Supervised Semantic Segmentation using Web-Crawled Videos
Seunghoon Hong
Donghun Yeo
Suha Kwak
Honglak Lee
Bohyung Han
107
158
0
02 Jan 2017
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary
  Visual Reasoning
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
Justin Johnson
B. Hariharan
Laurens van der Maaten
Li Fei-Fei
C. L. Zitnick
Ross B. Girshick
CoGe
313
2,387
0
20 Dec 2016
Learning Features by Watching Objects Move
Learning Features by Watching Objects Move
Deepak Pathak
Ross B. Girshick
Piotr Dollár
Trevor Darrell
Bharath Hariharan
SSLVOSOCL
77
526
0
19 Dec 2016
FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
Eddy Ilg
N. Mayer
Tonmoy Saikia
Margret Keuper
Alexey Dosovitskiy
Thomas Brox
3DPC
255
3,081
0
06 Dec 2016
Tagger: Deep Unsupervised Perceptual Grouping
Tagger: Deep Unsupervised Perceptual Grouping
Klaus Greff
Antti Rasmus
Mathias Berglund
T. Hao
Jürgen Schmidhuber
Harri Valpola
OCL
79
161
0
21 Jun 2016
Attend, Infer, Repeat: Fast Scene Understanding with Generative Models
Attend, Infer, Repeat: Fast Scene Understanding with Generative Models
S. M. Ali Eslami
N. Heess
T. Weber
Yuval Tassa
David Szepesvari
Koray Kavukcuoglu
Geoffrey E. Hinton
3DVBDLOCL
129
551
0
28 Mar 2016
Weakly-Supervised Semantic Segmentation using Motion Cues
Weakly-Supervised Semantic Segmentation using Motion Cues
P. Tokmakov
Alahari Karteek
Cordelia Schmid
75
56
0
23 Mar 2016
Identity Mappings in Deep Residual Networks
Identity Mappings in Deep Residual Networks
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
354
10,192
0
16 Mar 2016
A Large Dataset to Train Convolutional Networks for Disparity, Optical
  Flow, and Scene Flow Estimation
A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation
N. Mayer
Eddy Ilg
Philip Häusser
Philipp Fischer
Daniel Cremers
Alexey Dosovitskiy
Thomas Brox
3DPC
67
2,648
0
07 Dec 2015
Delving Deeper into Convolutional Networks for Learning Video
  Representations
Delving Deeper into Convolutional Networks for Learning Video Representations
Nicolas Ballas
L. Yao
C. Pal
Aaron Courville
MDE
90
701
0
19 Nov 2015
End-to-end people detection in crowded scenes
End-to-end people detection in crowded scenes
Russell Stewart
Mykhaylo Andriluka
73
544
0
16 Jun 2015
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal
  Networks
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMatObjD
525
62,360
0
04 Jun 2015
Learning to See by Moving
Learning to See by Moving
Pulkit Agrawal
João Carreira
Jitendra Malik
SSL
80
555
0
07 May 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.0K
150,312
0
22 Dec 2014
Learning Phrase Representations using RNN Encoder-Decoder for
  Statistical Machine Translation
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Kyunghyun Cho
B. V. Merrienboer
Çağlar Gülçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
AIMat
1.1K
23,370
0
03 Jun 2014
Auto-Encoding Variational Bayes
Auto-Encoding Variational Bayes
Diederik P. Kingma
Max Welling
BDL
455
16,923
0
20 Dec 2013
1