Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1604.01753
Cited By
Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding
6 April 2016
Gunnar A. Sigurdsson
Gül Varol
Xueliang Wang
Ali Farhadi
Ivan Laptev
Abhinav Gupta
VGen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding"
50 / 287 papers shown
Title
Grounded Situation Recognition
Sarah M Pratt
Mark Yatskar
Luca Weihs
Ali Farhadi
Aniruddha Kembhavi
17
111
0
26 Mar 2020
PIC: Permutation Invariant Convolution for Recognizing Long-range Activities
Noureldien Hussein
E. Gavves
A. Smeulders
VLM
26
13
0
18 Mar 2020
ZSTAD: Zero-Shot Temporal Activity Detection
Lingling Zhang
Xiaojun Chang
Jun Liu
Minnan Luo
Sen Wang
Zongyuan Ge
Alexander G. Hauptmann
24
28
0
12 Mar 2020
Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks
M. Rashid
Hedvig Kjellström
Yong Jae Lee
WSOL
GNN
19
46
0
04 Feb 2020
Interpreting video features: a comparison of 3D convolutional networks and convolutional LSTM networks
Joonatan Mänttäri
Sofia Broomé
John Folkesson
Hedvig Kjellström
FAtt
24
27
0
02 Feb 2020
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Jie Lei
Licheng Yu
Tamara L. Berg
Joey Tianyi Zhou
119
276
0
24 Jan 2020
Audiovisual SlowFast Networks for Video Recognition
Fanyi Xiao
Yong Jae Lee
Kristen Grauman
Jitendra Malik
Christoph Feichtenhofer
197
207
0
23 Jan 2020
Weakly Supervised Temporal Action Localization Using Deep Metric Learning
Ashraful Islam
Richard J. Radke
27
46
0
21 Jan 2020
Tree-Structured Policy based Progressive Reinforcement Learning for Temporally Language Grounding in Video
Jie Wu
Guanbin Li
Si Liu
Liang Lin
OffRL
23
104
0
18 Jan 2020
Self-supervising Action Recognition by Statistical Moment and Subspace Descriptors
Lei Wang
Piotr Koniusz
24
50
0
14 Jan 2020
Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog
Shachi H. Kumar
Eda Okur
Saurav Sahay
Jonathan Huang
L. Nachman
16
7
0
20 Dec 2019
Action Genome: Actions as Composition of Spatio-temporal Scene Graphs
Jingwei Ji
Ranjay Krishna
Li Fei-Fei
Juan Carlos Niebles
39
336
0
15 Dec 2019
A Multigrid Method for Efficiently Training Video Models
Chaoxia Wu
Ross B. Girshick
Kaiming He
Christoph Feichtenhofer
Philipp Krahenbuhl
21
94
0
02 Dec 2019
Weakly-Supervised Video Moment Retrieval via Semantic Completion Network
Zhijie Lin
Zhou Zhao
Zhu Zhang
Qi. Wang
Huasheng Liu
22
149
0
19 Nov 2019
The Eighth Dialog System Technology Challenge
Seokhwan Kim
Michel Galley
Chulaka Gunasekara
Sungjin Lee
Adam Atkinson
...
Tim K. Marks
Abhinav Rastogi
Xiaoxue Zang
Srinivas Sunkara
Raghav Gupta
VLM
11
65
0
14 Nov 2019
CATER: A diagnostic dataset for Compositional Actions and TEmporal Reasoning
Rohit Girdhar
Deva Ramanan
19
176
0
10 Oct 2019
Grouped Spatial-Temporal Aggregation for Efficient Action Recognition
Chenxu Luo
Alan Yuille
130
150
0
28 Sep 2019
LoGAN: Latent Graph Co-Attention Network for Weakly-Supervised Video Moment Retrieval
Reuben Tan
Huijuan Xu
Kate Saenko
Bryan A. Plummer
28
67
0
27 Sep 2019
Temporal Reasoning Graph for Activity Recognition
Jingran Zhang
Fumin Shen
Xing Xu
Heng Tao Shen
39
60
0
27 Aug 2019
Proposal-free Temporal Moment Localization of a Natural-Language Query in Video using Guided Attention
Cristian Rodriguez-Opazo
Edison Marrese-Taylor
F. Saleh
Hongdong Li
Stephen Gould
27
147
0
20 Aug 2019
Weakly-supervised Action Localization with Background Modeling
P. Nguyen
Deva Ramanan
Charless C. Fowlkes
SSL
WSOL
24
157
0
19 Aug 2019
Moviescope: Large-scale Analysis of Movies using Multiple Modalities
Paola Cascante-Bonilla
Kalpathy Sitaraman
Mengjia Luo
Vicente Ordonez
24
39
0
08 Aug 2019
Only Time Can Tell: Discovering Temporal Data for Temporal Modeling
Laura Sevilla-Lara
Shengxin Cindy Zha
Zhicheng Yan
Vedanuj Goswami
Matt Feiszli
Lorenzo Torresani
50
75
0
19 Jul 2019
Few-Shot Video Classification via Temporal Alignment
Kaidi Cao
Jingwei Ji
Zhangjie Cao
C. Chang
Juan Carlos Niebles
AI4TS
27
235
0
27 Jun 2019
Multimodal Abstractive Summarization for How2 Videos
Shruti Palaskar
Jindrich Libovický
Spandana Gella
Florian Metze
22
95
0
19 Jun 2019
Two-Stream Region Convolutional 3D Network for Temporal Activity Detection
Huijuan Xu
Abir Das
Kate Saenko
3DPC
16
46
0
05 Jun 2019
Representation Learning on Visual-Symbolic Graphs for Video Understanding
E. Mavroudi
Benjamín Béjar Haro
René Vidal
27
8
0
17 May 2019
Interactive Video Retrieval with Dialog
Sho Maeoki
Kohei Uehara
Tatsuya Harada
11
9
0
07 May 2019
Large Scale Holistic Video Understanding
Ali Diba
Mohsen Fayyaz
Vivek Sharma
Manohar Paluri
Jurgen Gall
Rainer Stiefelhagen
Luc Van Gool
29
35
0
25 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
A. Schwing
Tamir Hazan
27
69
0
11 Apr 2019
VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
Xin Eric Wang
Jiawei Wu
Junkun Chen
Lei Li
Yuan-fang Wang
William Yang Wang
32
539
0
06 Apr 2019
Weakly Supervised Video Moment Retrieval From Text Queries
Niluthpol Chowdhury Mithun
S. Paul
A. Roy-Chowdhury
30
193
0
05 Apr 2019
RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization
Alejandro Pardo
Humam Alwassel
Fabian Caba Heilbron
Ali K. Thabet
Guohao Li
34
52
0
30 Mar 2019
Video Relationship Reasoning using Gated Spatio-Temporal Energy Graph
Yao-Hung Hubert Tsai
S. Divvala
Louis-Philippe Morency
Ruslan Salakhutdinov
Ali Farhadi
27
103
0
25 Mar 2019
A Mobile Robot Generating Video Summaries of Seniors' Indoor Activities
Chih-Yuan Yang
Heeseung Yun
Srenavis Varadaraj
Jane Yung-jen Hsu
23
0
0
30 Jan 2019
DistInit: Learning Video Representations Without a Single Labeled Video
Rohit Girdhar
Du Tran
Lorenzo Torresani
Deva Ramanan
27
54
0
26 Jan 2019
Audio-Visual Scene-Aware Dialog
Huda AlAmri
Vincent Cartillier
Abhishek Das
Jue Wang
A. Cherian
...
Tim K. Marks
Chiori Hori
Peter Anderson
Stefan Lee
Devi Parikh
VGen
25
189
0
25 Jan 2019
Read, Watch, and Move: Reinforcement Learning for Temporally Grounding Natural Language Descriptions in Videos
Dongliang He
Xiang Zhao
Jizhou Huang
Fu Li
Xiao-Chang Liu
Shilei Wen
22
152
0
21 Jan 2019
Anticipation and next action forecasting in video: an end-to-end model with memory
F. Pirri
L. Mauro
Edoardo Alati
Valsamis Ntouskos
Mahdieh Izadpanahkakhk
Elham Omrani
AI4TS
30
13
0
11 Jan 2019
Dialog System Technology Challenge 7
Koichiro Yoshino
Chiori Hori
Julien Perez
L. F. D’Haro
L. Polymenakos
...
Xiang Gao
Huda AlAmri
Tim K. Marks
Devi Parikh
Dhruv Batra
24
37
0
11 Jan 2019
D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation
C. Chang
De-An Huang
Yanan Sui
Li Fei-Fei
Juan Carlos Niebles
22
156
0
09 Jan 2019
From FiLM to Video: Multi-turn Question Answering with Multi-modal Context
T. Nguyen
Shikhar Sharma
Hannes Schulz
Layla El Asri
15
33
0
17 Dec 2018
TAN: Temporal Aggregation Network for Dense Multi-label Action Recognition
Xiyang Dai
Bharat Singh
Joe Yue-Hei Ng
L. Davis
ViT
32
25
0
14 Dec 2018
Long-Term Feature Banks for Detailed Video Understanding
Chao-Yuan Wu
Christoph Feichtenhofer
Haoqi Fan
Kaiming He
Philipp Krahenbuhl
Ross B. Girshick
62
477
0
12 Dec 2018
Video Action Transformer Network
Rohit Girdhar
João Carreira
Carl Doersch
Andrew Zisserman
ViT
28
702
0
06 Dec 2018
Zero-Shot Anticipation for Instructional Activities
Fadime Sener
Angela Yao
LM&Ro
25
68
0
06 Dec 2018
Timeception for Complex Action Recognition
Noureldien Hussein
E. Gavves
A. Smeulders
21
212
0
04 Dec 2018
Relational Long Short-Term Memory for Video Action Recognition
Zexi Chen
B. Ramachandra
Tianfu Wu
Ranga Raju Vatsavai
24
5
0
16 Nov 2018
Attentive Sequence to Sequence Translation for Localizing Clips of Interest by Natural Language Descriptions
Ke Ning
Linchao Zhu
Ming Cai
Yi Yang
Di Xie
Fei Wu
21
2
0
27 Aug 2018
Interaction-aware Spatio-temporal Pyramid Attention Networks for Action Classification
Yang Du
Chunfen Yuan
Bing Li
Lili Zhao
Yangxi Li
Weiming Hu
81
79
0
03 Aug 2018
Previous
1
2
3
4
5
6
Next