ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.01172
  4. Cited By
Procedure Planning in Instructional Videos

Procedure Planning in Instructional Videos

2 July 2019
C. Chang
De-An Huang
Danfei Xu
Ehsan Adeli
Li Fei-Fei
Juan Carlos Niebles
ArXivPDFHTML

Papers citing "Procedure Planning in Instructional Videos"

40 / 40 papers shown
Title
Predicting Implicit Arguments in Procedural Video Instructions
Predicting Implicit Arguments in Procedural Video Instructions
Anil Batra
Laura Sevilla-Lara
Marcus Rohrbach
Frank Keller
23
0
0
27 May 2025
Leveraging Surgical Activity Grammar for Primary Intention Prediction in Laparoscopy Procedures
Leveraging Surgical Activity Grammar for Primary Intention Prediction in Laparoscopy Procedures
Jie Zhang
Song Zhou
Yiwei Wang
Chidan Wan
Huan Zhao
Xiong Cai
Han Ding
53
0
0
29 Sep 2024
ExpertAF: Expert Actionable Feedback from Video
ExpertAF: Expert Actionable Feedback from Video
Kumar Ashutosh
Tushar Nagarajan
Georgios Pavlakos
Kris Kitani
Kristen Grauman
VGen
77
2
0
01 Aug 2024
Spacewalk-18: A Benchmark for Multimodal and Long-form Procedural Video Understanding in Novel Domains
Spacewalk-18: A Benchmark for Multimodal and Long-form Procedural Video Understanding in Novel Domains
Rohan Myer Krishnan
Zitian Tang
Zhiqiu Yu
Chen Sun
84
1
0
30 Nov 2023
PDPP: Projected Diffusion for Procedure Planning in Instructional Videos
PDPP: Projected Diffusion for Procedure Planning in Instructional Videos
Hanlin Wang
Yilu Wu
Sheng Guo
Limin Wang
VGen
DiffM
106
30
0
26 Mar 2023
Uncertainty-Aware Anticipation of Activities
Uncertainty-Aware Anticipation of Activities
Yazan Abu Farha
Juergen Gall
53
48
0
26 Aug 2019
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million
  Narrated Video Clips
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips
Antoine Miech
Dimitri Zhukov
Jean-Baptiste Alayrac
Makarand Tapaswi
Ivan Laptev
Josef Sivic
VGen
87
1,186
0
07 Jun 2019
What Would You Expect? Anticipating Egocentric Actions with
  Rolling-Unrolling LSTMs and Modality Attention
What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention
Antonino Furnari
G. Farinella
EgoV
97
173
0
22 May 2019
A Variational Auto-Encoder Model for Stochastic Point Processes
A Variational Auto-Encoder Model for Stochastic Point Processes
Nazanin Mehrasa
Akash Abdu Jyothi
Thibaut Durand
Jiawei He
Leonid Sigal
Greg Mori
DRL
34
56
0
05 Apr 2019
VideoBERT: A Joint Model for Video and Language Representation Learning
VideoBERT: A Joint Model for Video and Language Representation Learning
Chen Sun
Austin Myers
Carl Vondrick
Kevin Patrick Murphy
Cordelia Schmid
VLM
SSL
39
1,238
0
03 Apr 2019
Cross-task weakly supervised learning from instructional videos
Cross-task weakly supervised learning from instructional videos
Dimitri Zhukov
Jean-Baptiste Alayrac
R. G. Cinbis
David Fouhey
Ivan Laptev
Josef Sivic
SSL
105
245
0
19 Mar 2019
COIN: A Large-scale Dataset for Comprehensive Instructional Video
  Analysis
COIN: A Large-scale Dataset for Comprehensive Instructional Video Analysis
Yansong Tang
Dajun Ding
Yongming Rao
Yu Zheng
Danyang Zhang
Lili Zhao
Jiwen Lu
Jie Zhou
98
308
0
07 Mar 2019
D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly
  Supervised Action Alignment and Segmentation
D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation
C. Chang
De-An Huang
Yanan Sui
Li Fei-Fei
Juan Carlos Niebles
88
156
0
09 Jan 2019
Zero-Shot Anticipation for Instructional Activities
Zero-Shot Anticipation for Instructional Activities
Fadime Sener
Angela Yao
LM&Ro
101
68
0
06 Dec 2018
Learning Latent Dynamics for Planning from Pixels
Learning Latent Dynamics for Planning from Pixels
Danijar Hafner
Timothy Lillicrap
Ian S. Fischer
Ruben Villegas
David R Ha
Honglak Lee
James Davidson
BDL
63
1,416
0
12 Nov 2018
Time-Agnostic Prediction: Predicting Predictable Video Frames
Time-Agnostic Prediction: Predicting Predictable Video Frames
Dinesh Jayaraman
F. Ebert
Alexei A. Efros
Sergey Levine
48
93
0
23 Aug 2018
Learning Plannable Representations with Causal InfoGAN
Learning Plannable Representations with Causal InfoGAN
Thanard Kurutach
Aviv Tamar
Ge Yang
Stuart J. Russell
Pieter Abbeel
GAN
DRL
49
180
0
24 Jul 2018
Neural Task Graphs: Generalizing to Unseen Tasks from a Single Video
  Demonstration
Neural Task Graphs: Generalizing to Unseen Tasks from a Single Video Demonstration
De-An Huang
Suraj Nair
Danfei Xu
Yuke Zhu
Animesh Garg
Li Fei-Fei
Silvio Savarese
Juan Carlos Niebles
41
140
0
10 Jul 2018
NeuralNetwork-Viterbi: A Framework for Weakly Supervised Video Learning
NeuralNetwork-Viterbi: A Framework for Weakly Supervised Video Learning
Alexander Richard
Hilde Kuehne
Ahsan Iqbal
Juergen Gall
60
137
0
17 May 2018
Stochastic Adversarial Video Prediction
Stochastic Adversarial Video Prediction
Alex X. Lee
Richard Y. Zhang
F. Ebert
Pieter Abbeel
Chelsea Finn
Sergey Levine
DRL
VGen
GAN
46
450
0
04 Apr 2018
When will you do what? - Anticipating Temporal Occurrences of Activities
When will you do what? - Anticipating Temporal Occurrences of Activities
Yazan Abu Farha
Alexander Richard
Juergen Gall
52
190
0
03 Apr 2018
Universal Planning Networks
Universal Planning Networks
A. Srinivas
Allan Jabri
Pieter Abbeel
Sergey Levine
Chelsea Finn
SSL
58
145
0
02 Apr 2018
Who Let The Dogs Out? Modeling Dog Behavior From Visual Data
Who Let The Dogs Out? Modeling Dog Behavior From Visual Data
Kiana Ehsani
Hessam Bagherinezhad
Joseph Redmon
Roozbeh Mottaghi
Ali Farhadi
VGen
44
59
0
28 Mar 2018
Visual Forecasting by Imitating Dynamics in Natural Sequences
Visual Forecasting by Imitating Dynamics in Natural Sequences
Kuo-Hao Zeng
Bokui (William) Shen
De-An Huang
Min Sun
Juan Carlos Niebles
AI4TS
47
61
0
19 Aug 2017
Curiosity-driven Exploration by Self-supervised Prediction
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
96
2,416
0
15 May 2017
Time-Contrastive Networks: Self-Supervised Learning from Video
Time-Contrastive Networks: Self-Supervised Learning from Video
P. Sermanet
Corey Lynch
Yevgen Chebotar
Jasmine Hsu
Eric Jang
S. Schaal
Sergey Levine
SSL
85
822
0
23 Apr 2017
Towards Automatic Learning of Procedures from Web Instructional Videos
Towards Automatic Learning of Procedures from Web Instructional Videos
Luowei Zhou
Chenliang Xu
Jason J. Corso
EgoV
57
819
0
28 Mar 2017
Open Vocabulary Scene Parsing
Open Vocabulary Scene Parsing
Hang Zhao
Xavier Puig
Bolei Zhou
Sanja Fidler
Antonio Torralba
VLM
3DV
46
119
0
26 Mar 2017
Joint Discovery of Object States and Manipulation Actions
Joint Discovery of Object States and Manipulation Actions
Jean-Baptiste Alayrac
Josef Sivic
Ivan Laptev
Simon Lacoste-Julien
48
79
0
09 Feb 2017
First-Person Activity Forecasting with Online Inverse Reinforcement
  Learning
First-Person Activity Forecasting with Online Inverse Reinforcement Learning
Nicholas Rhinehart
Kris Kitani
EgoV
29
140
0
22 Dec 2016
Deep Visual Foresight for Planning Robot Motion
Deep Visual Foresight for Planning Robot Motion
Chelsea Finn
Sergey Levine
93
779
0
03 Oct 2016
Connectionist Temporal Modeling for Weakly Supervised Action Labeling
Connectionist Temporal Modeling for Weakly Supervised Action Labeling
De-An Huang
Li Fei-Fei
Juan Carlos Niebles
57
237
0
28 Jul 2016
Learning to Poke by Poking: Experiential Learning of Intuitive Physics
Learning to Poke by Poking: Experiential Learning of Intuitive Physics
Pulkit Agrawal
Ashvin Nair
Pieter Abbeel
Jitendra Malik
Sergey Levine
SSL
44
562
0
23 Jun 2016
Semi-supervised Vocabulary-informed Learning
Semi-supervised Vocabulary-informed Learning
Yanwei Fu
Leonid Sigal
VLM
36
132
0
24 Apr 2016
Deep Spatial Autoencoders for Visuomotor Learning
Deep Spatial Autoencoders for Visuomotor Learning
Chelsea Finn
X. Tan
Yan Duan
Trevor Darrell
Sergey Levine
Pieter Abbeel
SSL
37
551
0
21 Sep 2015
Action-Conditional Video Prediction using Deep Networks in Atari Games
Action-Conditional Video Prediction using Deep Networks in Atari Games
Junhyuk Oh
Xiaoxiao Guo
Honglak Lee
Richard L. Lewis
Satinder Singh
74
852
0
31 Jul 2015
Unsupervised Semantic Parsing of Video Collections
Unsupervised Semantic Parsing of Video Collections
Ozan Sener
Amir Zamir
Silvio Savarese
Ashutosh Saxena
42
98
0
28 Jun 2015
Embed to Control: A Locally Linear Latent Dynamics Model for Control
  from Raw Images
Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images
Manuel Watter
Jost Tobias Springenberg
Joschka Boedecker
Martin Riedmiller
BDL
44
839
0
24 Jun 2015
What's Cookin'? Interpreting Cooking Videos using Text, Speech and
  Vision
What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision
J. Malmaud
Jonathan Huang
V. Rathod
Nick Johnston
Andrew Rabinovich
Kevin Patrick Murphy
53
152
0
05 Mar 2015
Video (language) modeling: a baseline for generative models of natural
  videos
Video (language) modeling: a baseline for generative models of natural videos
MarcÁurelio Ranzato
Arthur Szlam
Joan Bruna
Michaël Mathieu
R. Collobert
S. Chopra
VGen
72
471
0
20 Dec 2014
1