Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2203.14712
Cited By
Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities
28 March 2022
Fadime Sener
Dibyadip Chatterjee
Daniel Shelepov
Kun He
Dipika Singhania
Robert Y. Wang
Angela Yao
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities"
32 / 132 papers shown
Title
HOH: Markerless Multimodal Human-Object-Human Handover Dataset with Large Object Count
N. Wiederhold
Ava Megyeri
DiMaggio Paris
Sean Banerjee
N. Banerjee
18
9
0
01 Oct 2023
A Survey on Deep Learning Techniques for Action Anticipation
Zeyun Zhong
Manuel Martin
Michael Voit
Juergen Gall
Jürgen Beyerer
24
7
0
29 Sep 2023
HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World
Linghao Yang
Taein Kwon
Mahdi Rad
Bowen Pan
Ishani Chakraborty
...
Ashley Feniello
Rui Tian
Felipe Vieira Frujeri
Neel Joshi
Marc Pollefeys
EgoV
31
49
0
29 Sep 2023
ENIGMA-51: Towards a Fine-Grained Understanding of Human-Object Interactions in Industrial Scenarios
Francesco Ragusa
Rosario Leonardi
Michele Mazzamuto
Claudia Bonanno
Rosario Scavo
Antonino Furnari
G. Farinella
30
7
0
26 Sep 2023
Opening the Vocabulary of Egocentric Actions
Dibyadip Chatterjee
Fadime Sener
Shugao Ma
Angela Yao
VLM
41
16
0
22 Aug 2023
How Much Temporal Long-Term Context is Needed for Action Segmentation?
Emad Bahrami Rad
Gianpiero Francesca
Juergen Gall
ViT
24
25
0
22 Aug 2023
An Outlook into the Future of Egocentric Vision
Chiara Plizzari
Gabriele Goletto
Antonino Furnari
Siddhant Bansal
Francesco Ragusa
G. Farinella
Dima Damen
Tatiana Tommasi
EgoV
40
38
0
14 Aug 2023
Every Mistake Counts in Assembly
Guodong Ding
Fadime Sener
Shugao Ma
Angela Yao
32
12
0
31 Jul 2023
AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?
Qi Zhao
Shijie Wang
Ce Zhang
Changcheng Fu
Minh Quan Do
Nakul Agarwal
Kwonjoon Lee
Chen Sun
LM&Ro
51
49
0
31 Jul 2023
POV-Surgery: A Dataset for Egocentric Hand and Tool Pose Estimation During Surgical Activities
Rui Wang
S. Ktistakis
Siwei Zhang
Mirko Meboldt
Q. Lohmeyer
30
11
0
19 Jul 2023
Fusing Hand and Body Skeletons for Human Action Recognition in Assembly
Dustin Aganian
Mona Köhler
Benedict Stephan
M. Eisenbach
H. Groß
16
2
0
18 Jul 2023
Interactive Spatiotemporal Token Attention Network for Skeleton-based General Interactive Action Recognition
Yuhang Wen
Zixuan Tang
Yunsheng Pang
Beichen Ding
Mengyuan Liu
26
21
0
14 Jul 2023
HA-ViD: A Human Assembly Video Dataset for Comprehensive Assembly Knowledge Understanding
Hao Zheng
R. Lee
Yuqian Lu
VGen
17
16
0
09 Jul 2023
Self-supervised Optimization of Hand Pose Estimation using Anatomical Features and Iterative Learning
Christian Jauch
Timo Leitritz
Marco Huber
3DH
35
0
0
06 Jul 2023
Action Anticipation with Goal Consistency
Olga Zatsarynna
Juergen Gall
25
10
0
26 Jun 2023
Exploiting Multimodal Synthetic Data for Egocentric Human-Object Interaction Detection in an Industrial Scenario
Rosario Leonardi
Francesco Ragusa
Antonino Furnari
G. Farinella
12
12
0
21 Jun 2023
How Object Information Improves Skeleton-based Human Action Recognition in Assembly Tasks
Dustin Aganian
Mona Köhler
Sebastian Baake
M. Eisenbach
H. Groß
41
7
0
09 Jun 2023
Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment
Zihui Xue
Kristen Grauman
EgoV
38
31
0
08 Jun 2023
Learning to Ground Instructional Articles in Videos through Narrations
E. Mavroudi
Triantafyllos Afouras
Lorenzo Torresani
DiffM
35
22
0
06 Jun 2023
AssemblyHands: Towards Egocentric Activity Understanding via 3D Hand Pose Estimation
Takehiko Ohkawa
Kun He
Fadime Sener
Tomás Hodan
Luan Tran
Cem Keskin
27
38
0
24 Apr 2023
ATTACH Dataset: Annotated Two-Handed Assembly Actions for Human Action Understanding
Dustin Aganian
Benedict Stephan
M. Eisenbach
Corinna Stretz
H. Groß
19
11
0
17 Apr 2023
Learning and Verification of Task Structure in Instructional Videos
Medhini Narasimhan
Licheng Yu
Sean Bell
Ning Zhang
Trevor Darrell
68
19
0
23 Mar 2023
Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos
Sixun Dong
Huazhang Hu
Dongze Lian
Weixin Luo
Yichen Qian
Shenghua Gao
ViT
AI4TS
23
11
0
22 Mar 2023
HaMuCo: Hand Pose Estimation via Multiview Collaborative Self-Supervised Learning
Xiaozheng Zheng
Chao Wen
Zhou Xue
Pengfei Ren
Jingyu Wang
3DH
29
9
0
02 Feb 2023
C2F-TCN: A Framework for Semi and Fully Supervised Temporal Action Segmentation
Dipika Singhania
R. Rahaman
Angela Yao
14
28
0
20 Dec 2022
OpenPack: A Large-scale Dataset for Recognizing Packaging Works in IoT-enabled Logistic Environments
Naoya Yoshimura
Jaime Morales
T. Maekawa
Takahiro Hara
22
19
0
10 Dec 2022
Human in the loop approaches in multi-modal conversational task guidance system development
R. Manuvinakurike
Sovan Biswas
G. Raffa
R. Beckwith
A. Rhodes
Meng Shi
Gesem Gudino Mejia
Saurav Sahay
L. Nachman
38
2
0
03 Nov 2022
Temporal Action Segmentation: An Analysis of Modern Techniques
Guodong Ding
Fadime Sener
Angela Yao
47
74
0
19 Oct 2022
MECCANO: A Multimodal Egocentric Dataset for Humans Behavior Understanding in the Industrial-like Domain
Francesco Ragusa
Antonino Furnari
G. Farinella
EgoV
40
23
0
19 Sep 2022
Efficient Annotation and Learning for 3D Hand Pose Estimation: A Survey
Takehiko Ohkawa
Ryosuke Furuta
Yoichi Sato
3DH
27
20
0
05 Jun 2022
ARCTIC: A Dataset for Dexterous Bimanual Hand-Object Manipulation
Zicong Fan
Omid Taheri
Dimitrios Tzionas
Muhammed Kocabas
Manuel Kaufmann
Michael J. Black
Otmar Hilliges
41
147
0
28 Apr 2022
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman
Andrew Westbury
Eugene Byrne
Zachary Chavis
Antonino Furnari
...
Mike Zheng Shou
Antonio Torralba
Lorenzo Torresani
Mingfei Yan
Jitendra Malik
EgoV
244
1,024
0
13 Oct 2021
Previous
1
2
3