ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1512.00795
  4. Cited By
Actions ~ Transformations

Actions ~ Transformations

2 December 2015
Xinyu Wang
Ali Farhadi
Abhinav Gupta
ArXivPDFHTML

Papers citing "Actions ~ Transformations"

50 / 51 papers shown
Title
Rethinking Image-to-Video Adaptation: An Object-centric Perspective
Rethinking Image-to-Video Adaptation: An Object-centric Perspective
Rui Qian
Shuangrui Ding
Dahua Lin
OCL
54
1
0
09 Jul 2024
STAR: A Benchmark for Situated Reasoning in Real-World Videos
STAR: A Benchmark for Situated Reasoning in Real-World Videos
Bo Wu
Shoubin Yu
Zhenfang Chen
Joshua B Tenenbaum
Chuang Gan
46
178
0
15 May 2024
Learning to Visually Connect Actions and their Effects
Learning to Visually Connect Actions and their Effects
Eric Peh
Paritosh Parmar
Basura Fernando
24
2
0
19 Jan 2024
Visual Reasoning: from State to Transformation
Visual Reasoning: from State to Transformation
Xin Hong
Yanyan Lan
Liang Pang
J. Guo
Xueqi Cheng
LRM
27
4
0
02 May 2023
Action Dynamics Task Graphs for Learning Plannable Representations of
  Procedural Tasks
Action Dynamics Task Graphs for Learning Plannable Representations of Procedural Tasks
Weichao Mao
Ruta Desai
Michael L. Iuzzolino
Nitin Kamra
36
5
0
11 Jan 2023
Video-based Human Action Recognition using Deep Learning: A Review
Video-based Human Action Recognition using Deep Learning: A Review
Hieu H. Pham
L. Khoudour
Alain Crouzil
Pablo Zegers
S. Velastín
35
34
0
07 Aug 2022
Understanding 3D Object Articulation in Internet Videos
Understanding 3D Object Articulation in Internet Videos
Shengyi Qian
Linyi Jin
C. Rockwell
Siyi Chen
David Fouhey
27
29
0
30 Mar 2022
Look for the Change: Learning Object States and State-Modifying Actions
  from Untrimmed Web Videos
Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos
Tomávs Souvcek
Jean-Baptiste Alayrac
Antoine Miech
Ivan Laptev
Josef Sivic
21
32
0
22 Mar 2022
Parallel Training of GRU Networks with a Multi-Grid Solver for Long
  Sequences
Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences
G. Moon
E. Cyr
25
5
0
07 Mar 2022
Precondition and Effect Reasoning for Action Recognition
Precondition and Effect Reasoning for Action Recognition
Hongsang Yoo
Haopeng Li
Qiuhong Ke
Liangchen Liu
Rui Zhang
CML
49
4
0
19 Dec 2021
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman
Andrew Westbury
Eugene Byrne
Zachary Chavis
Antonino Furnari
...
Mike Zheng Shou
Antonio Torralba
Lorenzo Torresani
Mingfei Yan
Jitendra Malik
EgoV
278
1,026
0
13 Oct 2021
PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D
  World
PIGLeT: Language Grounding Through Neuro-Symbolic Interaction in a 3D World
Rowan Zellers
Ari Holtzman
Matthew E. Peters
Roozbeh Mottaghi
Aniruddha Kembhavi
Ali Farhadi
Yejin Choi
27
68
0
01 Jun 2021
Temporal Distinct Representation Learning for Action Recognition
Temporal Distinct Representation Learning for Action Recognition
Junwu Weng
Donghao Luo
Yabiao Wang
Ying Tai
Chengjie Wang
Jilin Li
Feiyue Huang
Xudong Jiang
Junsong Yuan
17
26
0
15 Jul 2020
Aligning Videos in Space and Time
Aligning Videos in Space and Time
Senthil Purushwalkam
Tian-Chun Ye
Saurabh Gupta
Abhinav Gupta
30
23
0
09 Jul 2020
Generating Human Action Videos by Coupling 3D Game Engines and
  Probabilistic Graphical Models
Generating Human Action Videos by Coupling 3D Game Engines and Probabilistic Graphical Models
César Roberto de Souza
Adrien Gaidon
Yohann Cabon
Naila Murray
A. Peña
39
14
0
12 Oct 2019
Explainable Video Action Reasoning via Prior Knowledge and State
  Transitions
Explainable Video Action Reasoning via Prior Knowledge and State Transitions
Tao Zhuo
Zhiyong Cheng
Peng Zhang
Yongkang Wong
Mohan Kankanhalli
FAtt
33
60
0
28 Aug 2019
What Makes Training Multi-Modal Classification Networks Hard?
What Makes Training Multi-Modal Classification Networks Hard?
Weiyao Wang
Du Tran
Matt Feiszli
34
443
0
29 May 2019
Exploring Temporal Information for Improved Video Understanding
Exploring Temporal Information for Improved Video Understanding
Yi Zhu
23
0
0
25 May 2019
VATEX: A Large-Scale, High-Quality Multilingual Dataset for
  Video-and-Language Research
VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
Xin Eric Wang
Jiawei Wu
Junkun Chen
Lei Li
Yuan-fang Wang
William Yang Wang
32
539
0
06 Apr 2019
Spatiotemporal Pyramid Network for Video Action Recognition
Spatiotemporal Pyramid Network for Video Action Recognition
Yunbo Wang
Mingsheng Long
Jianmin Wang
Philip S. Yu
32
227
0
04 Mar 2019
Grasp2Vec: Learning Object Representations from Self-Supervised Grasping
Grasp2Vec: Learning Object Representations from Self-Supervised Grasping
Eric Jang
Coline Devin
Vincent Vanhoucke
Sergey Levine
SSL
29
112
0
16 Nov 2018
NeXtVLAD: An Efficient Neural Network to Aggregate Frame-level Features
  for Large-scale Video Classification
NeXtVLAD: An Efficient Neural Network to Aggregate Frame-level Features for Large-scale Video Classification
Rongcheng Lin
Jing Xiao
Jianping Fan
VLM
20
103
0
12 Nov 2018
Cross and Learn: Cross-Modal Self-Supervision
Cross and Learn: Cross-Modal Self-Supervision
Nawid Sayed
Biagio Brattoli
Bjorn Ommer
SSL
33
78
0
09 Nov 2018
MT-VAE: Learning Motion Transformations to Generate Multimodal Human
  Dynamics
MT-VAE: Learning Motion Transformations to Generate Multimodal Human Dynamics
Xinchen Yan
Akash Rastogi
Ruben Villegas
Kalyan Sunkavalli
Eli Shechtman
Sunil Hadap
Ersin Yumer
Honglak Lee
30
150
0
14 Aug 2018
Videos as Space-Time Region Graphs
Videos as Space-Time Region Graphs
Xinyu Wang
Abhinav Gupta
36
752
0
05 Jun 2018
Video Anomaly Detection and Localization via Gaussian Mixture Fully
  Convolutional Variational Autoencoder
Video Anomaly Detection and Localization via Gaussian Mixture Fully Convolutional Variational Autoencoder
Yaxiang Fan
G. Wen
Deren Li
S. Qiu
M. Levine
DRL
19
211
0
29 May 2018
Switchable Temporal Propagation Network
Switchable Temporal Propagation Network
Sifei Liu
Guangyu Zhong
Shalini De Mello
Liang Feng
Varun Jampani
Ming-Hsuan Yang
Jan Kautz
14
34
0
23 Apr 2018
Attributes as Operators: Factorizing Unseen Attribute-Object
  Compositions
Attributes as Operators: Factorizing Unseen Attribute-Object Compositions
Tushar Nagarajan
Kristen Grauman
OCL
CoGe
24
59
0
27 Mar 2018
Disjoint Multi-task Learning between Heterogeneous Human-centric Tasks
Disjoint Multi-task Learning between Heterogeneous Human-centric Tasks
Dong-Jin Kim
Jinsoo Choi
Tae-Hyun Oh
Youngjin Yoon
In So Kweon
24
27
0
14 Feb 2018
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in
  Video Classification
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Patrick Murphy
3DH
52
1,309
0
13 Dec 2017
From Lifestyle Vlogs to Everyday Interactions
From Lifestyle Vlogs to Everyday Interactions
David Fouhey
Weicheng Kuo
Alexei A. Efros
Jitendra Malik
22
124
0
06 Dec 2017
A Closer Look at Spatiotemporal Convolutions for Action Recognition
A Closer Look at Spatiotemporal Convolutions for Action Recognition
Du Tran
Heng Wang
Lorenzo Torresani
Jamie Ray
Yann LeCun
Manohar Paluri
126
2,990
0
30 Nov 2017
Action Recognition with Coarse-to-Fine Deep Feature Integration and
  Asynchronous Fusion
Action Recognition with Coarse-to-Fine Deep Feature Integration and Asynchronous Fusion
Weiyao Lin
Yang Mi
Jianxin Wu
K. Lu
H. Xiong
27
37
0
20 Nov 2017
RGB-D-based Human Motion Recognition with Deep Learning: A Survey
RGB-D-based Human Motion Recognition with Deep Learning: A Survey
Pichao Wang
W. Li
P. Ogunbona
Jun Wan
Sergio Escalera
3DH
39
353
0
31 Oct 2017
ConvNet Architecture Search for Spatiotemporal Feature Learning
ConvNet Architecture Search for Spatiotemporal Feature Learning
Du Tran
Jamie Ray
Zheng Shou
Shih-Fu Chang
Manohar Paluri
3DPC
40
382
0
16 Aug 2017
Unsupervised Representation Learning by Sorting Sequences
Unsupervised Representation Learning by Sorting Sequences
Hsin-Ying Lee
Jia-Bin Huang
Maneesh Kumar Singh
Ming-Hsuan Yang
SSL
DRL
32
533
0
03 Aug 2017
The Kinetics Human Action Video Dataset
The Kinetics Human Action Video Dataset
W. Kay
João Carreira
Karen Simonyan
Brian Zhang
Chloe Hillier
...
Tim Green
T. Back
Apostol Natsev
Mustafa Suleyman
Andrew Zisserman
74
3,757
0
19 May 2017
Am I Done? Predicting Action Progress in Videos
Am I Done? Predicting Action Progress in Videos
Federico Becattini
Tiberio Uricchio
Lorenzo Seidenari
Lamberto Ballan
A. Bimbo
30
33
0
04 May 2017
ActionVLAD: Learning spatio-temporal aggregation for action
  classification
ActionVLAD: Learning spatio-temporal aggregation for action classification
Rohit Girdhar
Deva Ramanan
Abhinav Gupta
Josef Sivic
Bryan C. Russell
AI4TS
19
450
0
10 Apr 2017
TS-LSTM and Temporal-Inception: Exploiting Spatiotemporal Dynamics for
  Activity Recognition
TS-LSTM and Temporal-Inception: Exploiting Spatiotemporal Dynamics for Activity Recognition
Chih-Yao Ma
Min-Hung Chen
Z. Kira
G. Al-Regib
AI4TS
32
241
0
30 Mar 2017
Trespassing the Boundaries: Labeling Temporal Bounds for Object
  Interactions in Egocentric Video
Trespassing the Boundaries: Labeling Temporal Bounds for Object Interactions in Egocentric Video
Davide Moltisanti
Michael Wray
W. Mayol-Cuevas
Dima Damen
EgoV
25
31
0
27 Mar 2017
Encouraging LSTMs to Anticipate Actions Very Early
Encouraging LSTMs to Anticipate Actions Very Early
Mohammad Sadegh Ali Akbarian
F. Saleh
Mathieu Salzmann
Basura Fernando
L. Petersson
Lars Andersson
34
169
0
21 Mar 2017
Joint Discovery of Object States and Manipulation Actions
Joint Discovery of Object States and Manipulation Actions
Jean-Baptiste Alayrac
Josef Sivic
Ivan Laptev
Simon Lacoste-Julien
22
79
0
09 Feb 2017
Asynchronous Temporal Fields for Action Recognition
Asynchronous Temporal Fields for Action Recognition
Gunnar A. Sigurdsson
S. Divvala
Ali Farhadi
Abhinav Gupta
BDL
24
170
0
19 Dec 2016
ActionFlowNet: Learning Motion Representation for Action Recognition
ActionFlowNet: Learning Motion Representation for Action Recognition
Joe Yue-Hei Ng
Jonghyun Choi
J. Neumann
L. Davis
36
117
0
09 Dec 2016
AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for
  Human Action Recognition in Videos
AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos
Amlan Kar
Nishant Rai
Karan Sikka
Gaurav Sharma
32
152
0
24 Nov 2016
Spatiotemporal Residual Networks for Video Action Recognition
Spatiotemporal Residual Networks for Video Action Recognition
Christoph Feichtenhofer
A. Pinz
Richard P. Wildes
63
716
0
07 Nov 2016
Pose from Action: Unsupervised Learning of Pose Features based on Motion
Pose from Action: Unsupervised Learning of Pose Features based on Motion
Senthil Purushwalkam
Abhinav Gupta
SSL
29
23
0
18 Sep 2016
Sympathy for the Details: Dense Trajectories and Hybrid Classification
  Architectures for Action Recognition
Sympathy for the Details: Dense Trajectories and Hybrid Classification Architectures for Action Recognition
César Roberto de Souza
Adrien Gaidon
E. Vig
A. Peña
27
44
0
25 Aug 2016
Depth2Action: Exploring Embedded Depth for Large-Scale Action
  Recognition
Depth2Action: Exploring Embedded Depth for Large-Scale Action Recognition
Yi Zhu
Shawn D. Newsam
19
41
0
15 Aug 2016
12
Next