ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1907.01172
  4. Cited By
Procedure Planning in Instructional Videos

Procedure Planning in Instructional Videos

2 July 2019
C. Chang
De-An Huang
Danfei Xu
Ehsan Adeli
Li Fei-Fei
Juan Carlos Niebles
ArXivPDFHTML

Papers citing "Procedure Planning in Instructional Videos"

40 / 40 papers shown
Title
Predicting Implicit Arguments in Procedural Video Instructions
Predicting Implicit Arguments in Procedural Video Instructions
Anil Batra
Laura Sevilla-Lara
Marcus Rohrbach
Frank Keller
11
0
0
27 May 2025
Leveraging Surgical Activity Grammar for Primary Intention Prediction in Laparoscopy Procedures
Leveraging Surgical Activity Grammar for Primary Intention Prediction in Laparoscopy Procedures
Jie Zhang
Song Zhou
Yiwei Wang
Chidan Wan
Huan Zhao
Xiong Cai
Han Ding
46
0
0
29 Sep 2024
ExpertAF: Expert Actionable Feedback from Video
ExpertAF: Expert Actionable Feedback from Video
Kumar Ashutosh
Tushar Nagarajan
Georgios Pavlakos
Kris Kitani
Kristen Grauman
VGen
71
2
0
01 Aug 2024
Spacewalk-18: A Benchmark for Multimodal and Long-form Procedural Video Understanding in Novel Domains
Spacewalk-18: A Benchmark for Multimodal and Long-form Procedural Video Understanding in Novel Domains
Rohan Myer Krishnan
Zitian Tang
Zhiqiu Yu
Chen Sun
78
1
0
30 Nov 2023
PDPP: Projected Diffusion for Procedure Planning in Instructional Videos
PDPP: Projected Diffusion for Procedure Planning in Instructional Videos
Hanlin Wang
Yilu Wu
Sheng Guo
Limin Wang
VGen
DiffM
89
30
0
26 Mar 2023
Uncertainty-Aware Anticipation of Activities
Uncertainty-Aware Anticipation of Activities
Yazan Abu Farha
Juergen Gall
44
48
0
26 Aug 2019
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million
  Narrated Video Clips
HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips
Antoine Miech
Dimitri Zhukov
Jean-Baptiste Alayrac
Makarand Tapaswi
Ivan Laptev
Josef Sivic
VGen
81
1,186
0
07 Jun 2019
What Would You Expect? Anticipating Egocentric Actions with
  Rolling-Unrolling LSTMs and Modality Attention
What Would You Expect? Anticipating Egocentric Actions with Rolling-Unrolling LSTMs and Modality Attention
Antonino Furnari
G. Farinella
EgoV
91
173
0
22 May 2019
A Variational Auto-Encoder Model for Stochastic Point Processes
A Variational Auto-Encoder Model for Stochastic Point Processes
Nazanin Mehrasa
Akash Abdu Jyothi
Thibaut Durand
Jiawei He
Leonid Sigal
Greg Mori
DRL
27
56
0
05 Apr 2019
VideoBERT: A Joint Model for Video and Language Representation Learning
VideoBERT: A Joint Model for Video and Language Representation Learning
Chen Sun
Austin Myers
Carl Vondrick
Kevin Patrick Murphy
Cordelia Schmid
VLM
SSL
28
1,238
0
03 Apr 2019
Cross-task weakly supervised learning from instructional videos
Cross-task weakly supervised learning from instructional videos
Dimitri Zhukov
Jean-Baptiste Alayrac
R. G. Cinbis
David Fouhey
Ivan Laptev
Josef Sivic
SSL
101
245
0
19 Mar 2019
COIN: A Large-scale Dataset for Comprehensive Instructional Video
  Analysis
COIN: A Large-scale Dataset for Comprehensive Instructional Video Analysis
Yansong Tang
Dajun Ding
Yongming Rao
Yu Zheng
Danyang Zhang
Lili Zhao
Jiwen Lu
Jie Zhou
96
308
0
07 Mar 2019
D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly
  Supervised Action Alignment and Segmentation
D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation
C. Chang
De-An Huang
Yanan Sui
Li Fei-Fei
Juan Carlos Niebles
88
156
0
09 Jan 2019
Zero-Shot Anticipation for Instructional Activities
Zero-Shot Anticipation for Instructional Activities
Fadime Sener
Angela Yao
LM&Ro
101
68
0
06 Dec 2018
Learning Latent Dynamics for Planning from Pixels
Learning Latent Dynamics for Planning from Pixels
Danijar Hafner
Timothy Lillicrap
Ian S. Fischer
Ruben Villegas
David R Ha
Honglak Lee
James Davidson
BDL
55
1,416
0
12 Nov 2018
Time-Agnostic Prediction: Predicting Predictable Video Frames
Time-Agnostic Prediction: Predicting Predictable Video Frames
Dinesh Jayaraman
F. Ebert
Alexei A. Efros
Sergey Levine
46
93
0
23 Aug 2018
Learning Plannable Representations with Causal InfoGAN
Learning Plannable Representations with Causal InfoGAN
Thanard Kurutach
Aviv Tamar
Ge Yang
Stuart J. Russell
Pieter Abbeel
GAN
DRL
40
180
0
24 Jul 2018
Neural Task Graphs: Generalizing to Unseen Tasks from a Single Video
  Demonstration
Neural Task Graphs: Generalizing to Unseen Tasks from a Single Video Demonstration
De-An Huang
Suraj Nair
Danfei Xu
Yuke Zhu
Animesh Garg
Li Fei-Fei
Silvio Savarese
Juan Carlos Niebles
32
140
0
10 Jul 2018
NeuralNetwork-Viterbi: A Framework for Weakly Supervised Video Learning
NeuralNetwork-Viterbi: A Framework for Weakly Supervised Video Learning
Alexander Richard
Hilde Kuehne
Ahsan Iqbal
Juergen Gall
53
137
0
17 May 2018
Stochastic Adversarial Video Prediction
Stochastic Adversarial Video Prediction
Alex X. Lee
Richard Y. Zhang
F. Ebert
Pieter Abbeel
Chelsea Finn
Sergey Levine
DRL
VGen
GAN
41
450
0
04 Apr 2018
When will you do what? - Anticipating Temporal Occurrences of Activities
When will you do what? - Anticipating Temporal Occurrences of Activities
Yazan Abu Farha
Alexander Richard
Juergen Gall
45
190
0
03 Apr 2018
Universal Planning Networks
Universal Planning Networks
A. Srinivas
Allan Jabri
Pieter Abbeel
Sergey Levine
Chelsea Finn
SSL
53
145
0
02 Apr 2018
Who Let The Dogs Out? Modeling Dog Behavior From Visual Data
Who Let The Dogs Out? Modeling Dog Behavior From Visual Data
Kiana Ehsani
Hessam Bagherinezhad
Joseph Redmon
Roozbeh Mottaghi
Ali Farhadi
VGen
34
59
0
28 Mar 2018
Visual Forecasting by Imitating Dynamics in Natural Sequences
Visual Forecasting by Imitating Dynamics in Natural Sequences
Kuo-Hao Zeng
Bokui (William) Shen
De-An Huang
Min Sun
Juan Carlos Niebles
AI4TS
42
61
0
19 Aug 2017
Curiosity-driven Exploration by Self-supervised Prediction
Curiosity-driven Exploration by Self-supervised Prediction
Deepak Pathak
Pulkit Agrawal
Alexei A. Efros
Trevor Darrell
LRM
SSL
89
2,416
0
15 May 2017
Time-Contrastive Networks: Self-Supervised Learning from Video
Time-Contrastive Networks: Self-Supervised Learning from Video
P. Sermanet
Corey Lynch
Yevgen Chebotar
Jasmine Hsu
Eric Jang
S. Schaal
Sergey Levine
SSL
72
820
0
23 Apr 2017
Towards Automatic Learning of Procedures from Web Instructional Videos
Towards Automatic Learning of Procedures from Web Instructional Videos
Luowei Zhou
Chenliang Xu
Jason J. Corso
EgoV
49
812
0
28 Mar 2017
Open Vocabulary Scene Parsing
Open Vocabulary Scene Parsing
Hang Zhao
Xavier Puig
Bolei Zhou
Sanja Fidler
Antonio Torralba
VLM
3DV
37
119
0
26 Mar 2017
Joint Discovery of Object States and Manipulation Actions
Joint Discovery of Object States and Manipulation Actions
Jean-Baptiste Alayrac
Josef Sivic
Ivan Laptev
Simon Lacoste-Julien
43
79
0
09 Feb 2017
First-Person Activity Forecasting with Online Inverse Reinforcement
  Learning
First-Person Activity Forecasting with Online Inverse Reinforcement Learning
Nicholas Rhinehart
Kris Kitani
EgoV
24
140
0
22 Dec 2016
Deep Visual Foresight for Planning Robot Motion
Deep Visual Foresight for Planning Robot Motion
Chelsea Finn
Sergey Levine
91
779
0
03 Oct 2016
Connectionist Temporal Modeling for Weakly Supervised Action Labeling
Connectionist Temporal Modeling for Weakly Supervised Action Labeling
De-An Huang
Li Fei-Fei
Juan Carlos Niebles
50
237
0
28 Jul 2016
Learning to Poke by Poking: Experiential Learning of Intuitive Physics
Learning to Poke by Poking: Experiential Learning of Intuitive Physics
Pulkit Agrawal
Ashvin Nair
Pieter Abbeel
Jitendra Malik
Sergey Levine
SSL
36
562
0
23 Jun 2016
Semi-supervised Vocabulary-informed Learning
Semi-supervised Vocabulary-informed Learning
Yanwei Fu
Leonid Sigal
VLM
27
132
0
24 Apr 2016
Deep Spatial Autoencoders for Visuomotor Learning
Deep Spatial Autoencoders for Visuomotor Learning
Chelsea Finn
X. Tan
Yan Duan
Trevor Darrell
Sergey Levine
Pieter Abbeel
SSL
25
551
0
21 Sep 2015
Action-Conditional Video Prediction using Deep Networks in Atari Games
Action-Conditional Video Prediction using Deep Networks in Atari Games
Junhyuk Oh
Xiaoxiao Guo
Honglak Lee
Richard L. Lewis
Satinder Singh
64
852
0
31 Jul 2015
Unsupervised Semantic Parsing of Video Collections
Unsupervised Semantic Parsing of Video Collections
Ozan Sener
Amir Zamir
Silvio Savarese
Ashutosh Saxena
32
98
0
28 Jun 2015
Embed to Control: A Locally Linear Latent Dynamics Model for Control
  from Raw Images
Embed to Control: A Locally Linear Latent Dynamics Model for Control from Raw Images
Manuel Watter
Jost Tobias Springenberg
Joschka Boedecker
Martin Riedmiller
BDL
35
839
0
24 Jun 2015
What's Cookin'? Interpreting Cooking Videos using Text, Speech and
  Vision
What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision
J. Malmaud
Jonathan Huang
V. Rathod
Nick Johnston
Andrew Rabinovich
Kevin Patrick Murphy
46
152
0
05 Mar 2015
Video (language) modeling: a baseline for generative models of natural
  videos
Video (language) modeling: a baseline for generative models of natural videos
MarcÁurelio Ranzato
Arthur Szlam
Joan Bruna
Michaël Mathieu
R. Collobert
S. Chopra
VGen
62
471
0
20 Dec 2014
1