Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities

28 March 2022

Angela Yao

Papers citing "Assembly101: A Large-Scale Multi-View Video Dataset for Understanding Procedural Activities"

32 / 132 papers shown

Title
HOH: Markerless Multimodal Human-Object-Human Handover Dataset with Large Object Count N. Wiederhold Ava Megyeri DiMaggio Paris Sean Banerjee N. Banerjee 18 9 0 01 Oct 2023
A Survey on Deep Learning Techniques for Action Anticipation Zeyun Zhong Manuel Martin Michael Voit Juergen Gall Jürgen Beyerer 24 7 0 29 Sep 2023
HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World Linghao Yang Taein Kwon Mahdi Rad Bowen Pan Ishani Chakraborty ... Ashley Feniello Rui Tian Felipe Vieira Frujeri Neel Joshi Marc Pollefeys EgoV 31 49 0 29 Sep 2023
ENIGMA-51: Towards a Fine-Grained Understanding of Human-Object Interactions in Industrial Scenarios Francesco Ragusa Rosario Leonardi Michele Mazzamuto Claudia Bonanno Rosario Scavo Antonino Furnari G. Farinella 30 7 0 26 Sep 2023
Opening the Vocabulary of Egocentric Actions Dibyadip Chatterjee Fadime Sener Shugao Ma Angela Yao VLM 41 16 0 22 Aug 2023
How Much Temporal Long-Term Context is Needed for Action Segmentation? Emad Bahrami Rad Gianpiero Francesca Juergen Gall ViT 24 25 0 22 Aug 2023
An Outlook into the Future of Egocentric Vision Chiara Plizzari Gabriele Goletto Antonino Furnari Siddhant Bansal Francesco Ragusa G. Farinella Dima Damen Tatiana Tommasi EgoV 40 38 0 14 Aug 2023
Every Mistake Counts in Assembly Guodong Ding Fadime Sener Shugao Ma Angela Yao 32 12 0 31 Jul 2023
AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos? Qi Zhao Shijie Wang Ce Zhang Changcheng Fu Minh Quan Do Nakul Agarwal Kwonjoon Lee Chen Sun LM&Ro 51 49 0 31 Jul 2023
POV-Surgery: A Dataset for Egocentric Hand and Tool Pose Estimation During Surgical Activities Rui Wang S. Ktistakis Siwei Zhang Mirko Meboldt Q. Lohmeyer 30 11 0 19 Jul 2023
Fusing Hand and Body Skeletons for Human Action Recognition in Assembly Dustin Aganian Mona Köhler Benedict Stephan M. Eisenbach H. Groß 16 2 0 18 Jul 2023
Interactive Spatiotemporal Token Attention Network for Skeleton-based General Interactive Action Recognition Yuhang Wen Zixuan Tang Yunsheng Pang Beichen Ding Mengyuan Liu 26 21 0 14 Jul 2023
HA-ViD: A Human Assembly Video Dataset for Comprehensive Assembly Knowledge Understanding Hao Zheng R. Lee Yuqian Lu VGen 17 16 0 09 Jul 2023
Self-supervised Optimization of Hand Pose Estimation using Anatomical Features and Iterative Learning Christian Jauch Timo Leitritz Marco Huber 3DH 35 0 0 06 Jul 2023
Action Anticipation with Goal Consistency Olga Zatsarynna Juergen Gall 25 10 0 26 Jun 2023
Exploiting Multimodal Synthetic Data for Egocentric Human-Object Interaction Detection in an Industrial Scenario Rosario Leonardi Francesco Ragusa Antonino Furnari G. Farinella 12 12 0 21 Jun 2023
How Object Information Improves Skeleton-based Human Action Recognition in Assembly Tasks Dustin Aganian Mona Köhler Sebastian Baake M. Eisenbach H. Groß 41 7 0 09 Jun 2023
Learning Fine-grained View-Invariant Representations from Unpaired Ego-Exo Videos via Temporal Alignment Zihui Xue Kristen Grauman EgoV 38 31 0 08 Jun 2023
Learning to Ground Instructional Articles in Videos through Narrations E. Mavroudi Triantafyllos Afouras Lorenzo Torresani DiffM 35 22 0 06 Jun 2023
AssemblyHands: Towards Egocentric Activity Understanding via 3D Hand Pose Estimation Takehiko Ohkawa Kun He Fadime Sener Tomás Hodan Luan Tran Cem Keskin 27 38 0 24 Apr 2023
ATTACH Dataset: Annotated Two-Handed Assembly Actions for Human Action Understanding Dustin Aganian Benedict Stephan M. Eisenbach Corinna Stretz H. Groß 19 11 0 17 Apr 2023
Learning and Verification of Task Structure in Instructional Videos Medhini Narasimhan Licheng Yu Sean Bell Ning Zhang Trevor Darrell 68 19 0 23 Mar 2023
Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos Sixun Dong Huazhang Hu Dongze Lian Weixin Luo Yichen Qian Shenghua Gao ViT AI4TS 23 11 0 22 Mar 2023
HaMuCo: Hand Pose Estimation via Multiview Collaborative Self-Supervised Learning Xiaozheng Zheng Chao Wen Zhou Xue Pengfei Ren Jingyu Wang 3DH 29 9 0 02 Feb 2023
C2F-TCN: A Framework for Semi and Fully Supervised Temporal Action Segmentation Dipika Singhania R. Rahaman Angela Yao 14 28 0 20 Dec 2022
OpenPack: A Large-scale Dataset for Recognizing Packaging Works in IoT-enabled Logistic Environments Naoya Yoshimura Jaime Morales T. Maekawa Takahiro Hara 22 19 0 10 Dec 2022
Human in the loop approaches in multi-modal conversational task guidance system development R. Manuvinakurike Sovan Biswas G. Raffa R. Beckwith A. Rhodes Meng Shi Gesem Gudino Mejia Saurav Sahay L. Nachman 38 2 0 03 Nov 2022
Temporal Action Segmentation: An Analysis of Modern Techniques Guodong Ding Fadime Sener Angela Yao 47 74 0 19 Oct 2022
MECCANO: A Multimodal Egocentric Dataset for Humans Behavior Understanding in the Industrial-like Domain Francesco Ragusa Antonino Furnari G. Farinella EgoV 40 23 0 19 Sep 2022
Efficient Annotation and Learning for 3D Hand Pose Estimation: A Survey Takehiko Ohkawa Ryosuke Furuta Yoichi Sato 3DH 27 20 0 05 Jun 2022
ARCTIC: A Dataset for Dexterous Bimanual Hand-Object Manipulation Zicong Fan Omid Taheri Dimitrios Tzionas Muhammed Kocabas Manuel Kaufmann Michael J. Black Otmar Hilliges 41 147 0 28 Apr 2022
Ego4D: Around the World in 3,000 Hours of Egocentric Video Kristen Grauman Andrew Westbury Eugene Byrne Zachary Chavis Antonino Furnari ... Mike Zheng Shou Antonio Torralba Lorenzo Torresani Mingfei Yan Jitendra Malik EgoV 244 1,024 0 13 Oct 2021