Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1411.4389
Cited By
Long-term Recurrent Convolutional Networks for Visual Recognition and Description
17 November 2014
Jeff Donahue
Lisa Anne Hendricks
Marcus Rohrbach
Subhashini Venugopalan
S. Guadarrama
Kate Saenko
Trevor Darrell
VLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Long-term Recurrent Convolutional Networks for Visual Recognition and Description"
50 / 642 papers shown
Title
Operational solar flare forecasting via video-based deep learning
Sabrina Guastavino
Francesco Marchetti
F. Benvenuto
C. Campi
Michele Piana
25
10
0
12 Sep 2022
Echocardiographic Image Quality Assessment Using Deep Neural Networks
Robert B. Labs
M. Zolgharni
John P. Loo
10
6
0
02 Sep 2022
Large-Scale Traffic Congestion Prediction based on Multimodal Fusion and Representation Mapping
Bo Zhou
Jiahui Liu
Songyi Cui
Yaping Zhao
23
5
0
23 Aug 2022
Two-person Graph Convolutional Network for Skeleton-based Human Interaction Recognition
Zheng Li
Yueran Li
Linlin Tang
Tong Zhang
Jingyong Su
GNN
3DH
35
17
0
12 Aug 2022
Aesthetic Attributes Assessment of Images with AMANv2 and DPC-CaptionsV2
Xinghui Zhou
Xin Jin
Jianwen Lv
Heng Huang
Ming Mao
Shuai Cui
CoGe
18
0
0
09 Aug 2022
Video-based Human Action Recognition using Deep Learning: A Review
Hieu H. Pham
L. Khoudour
Alain Crouzil
Pablo Zegers
S. Velastín
32
34
0
07 Aug 2022
Video Question Answering with Iterative Video-Text Co-Tokenization
A. Piergiovanni
K. Morton
Weicheng Kuo
Michael S. Ryoo
A. Angelova
25
18
0
01 Aug 2022
Bodily Behaviors in Social Interaction: Novel Annotations and State-of-the-Art Evaluation
Michal Balazia
Philippe Muller
Ákos Levente Tánczos
A. V. Liechtenstein
Franccois Brémond
17
22
0
26 Jul 2022
Programmatic Concept Learning for Human Motion Description and Synthesis
Sumith Kulal
Jiayuan Mao
A. Aiken
Jiajun Wu
30
7
0
27 Jun 2022
Bypass Network for Semantics Driven Image Paragraph Captioning
Qinjie Zheng
Chaoyue Wang
Dadong Wang
19
1
0
21 Jun 2022
Image Captioning based on Feature Refinement and Reflective Decoding
G. Alabduljabbar
Hafida Benhidour
Said Kerrache
3DV
14
3
0
16 Jun 2022
3D Convolutional with Attention for Action Recognition
Labina Shrestha
Shikha Dubey
Farrukh Olimov
M. Rafique
M. Jeon
29
0
0
05 Jun 2022
A Survey on Video Action Recognition in Sports: Datasets, Methods and Applications
Fei Wu
Qingzhong Wang
Jian Bian
Haoyi Xiong
Ning Ding
Feixiang Lu
Junqing Cheng
Dejing Dou
AI4TS
24
52
0
02 Jun 2022
Heterogeneous Data-Centric Architectures for Modern Data-Intensive Applications: Case Studies in Machine Learning and Databases
Geraldo F. Oliveira
Amirali Boroumand
Saugata Ghose
Juan Gómez Luna
O. Mutlu
28
7
0
29 May 2022
Machine Learning for Microcontroller-Class Hardware: A Review
Swapnil Sayan Saha
S. Sandha
Mani B. Srivastava
27
118
0
29 May 2022
It Isn't Sh!tposting, It's My CAT Posting
Parthsarthi Rawat
Sayan Das
Jorge Aguirre
Akhil Daphara
ViT
22
0
0
18 May 2022
Deep fusion of gray level co-occurrence matrices for lung nodule classification
A. Saihood
Hossein Karshenas
A. Naghsh-Nilchi
20
14
0
10 May 2022
Diverse Image Captioning with Grounded Style
Franz Klein
Shweta Mahajan
S. Roth
22
7
0
03 May 2022
Flamingo: a Visual Language Model for Few-Shot Learning
Jean-Baptiste Alayrac
Jeff Donahue
Pauline Luc
Antoine Miech
Iain Barr
...
Mikolaj Binkowski
Ricardo Barreira
Oriol Vinyals
Andrew Zisserman
Karen Simonyan
MLLM
VLM
46
3,349
0
29 Apr 2022
3D Convolutional Networks for Action Recognition: Application to Sport Gesture Recognition
Pierre-Etienne Martin
J. Benois-Pineau
Renaud Péteri
A. Zemmari
J. Morlier
24
5
0
13 Apr 2022
Hierarchical Self-supervised Representation Learning for Movie Understanding
Fanyi Xiao
Kaustav Kundu
Joseph Tighe
Davide Modolo
SSL
44
24
0
06 Apr 2022
Gate-Shift-Fuse for Video Action Recognition
Swathikiran Sudhakaran
Sergio Escalera
Oswald Lanz
22
22
0
16 Mar 2022
TFCNet: Temporal Fully Connected Networks for Static Unbiased Temporal Reasoning
Shiwen Zhang
AI4TS
24
9
0
11 Mar 2022
Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling
Tengpeng Li
Hanli Wang
Bin He
Changan Chen
DiffM
21
9
0
10 Mar 2022
Live Laparoscopic Video Retrieval with Compressed Uncertainty
Tong Yu
Pietro Mascagni
J. Verde
J. Marescaux
Didier Mutter
N. Padoy
39
7
0
08 Mar 2022
Parallel Training of GRU Networks with a Multi-Grid Solver for Long Sequences
G. Moon
E. Cyr
25
5
0
07 Mar 2022
Temporal Context Matters: Enhancing Single Image Prediction with Disease Progression Representations
Aishik Konwer
Xuan Xu
Joseph Bae
Chaoyu Chen
Prateek Prasanna
MedIm
33
15
0
02 Mar 2022
Rethinking Pretraining as a Bridge from ANNs to SNNs
Yihan Lin
Yifan Hu
Shiji Ma
Guo-Qi Li
Dongjie Yu
34
12
0
02 Mar 2022
Colar: Effective and Efficient Online Action Detection by Consulting Exemplars
Le Yang
Junwei Han
Dingwen Zhang
21
35
0
02 Mar 2022
Skeleton Sequence and RGB Frame Based Multi-Modality Feature Fusion Network for Action Recognition
Xiaoguang Zhu
Ye Zhu
Haoyu Wang
Honglin Wen
Yan Yan
Peilin Liu
19
25
0
23 Feb 2022
CaMEL: Mean Teacher Learning for Image Captioning
Manuele Barraco
Matteo Stefanini
Marcella Cornia
S. Cascianelli
Lorenzo Baraldi
Rita Cucchiara
ViT
VLM
35
27
0
21 Feb 2022
When Did It Happen? Duration-informed Temporal Localization of Narrated Actions in Vlogs
Oana Ignat
Santiago Castro
Yuhang Zhou
Jiajun Bao
Dandan Shan
Rada Mihalcea
18
3
0
16 Feb 2022
An Integrated Approach for Video Captioning and Applications
Soheyla Amirian
T. Taha
Khaled Rasheed
H. Arabnia
26
1
0
23 Jan 2022
OCSampler: Compressing Videos to One Clip with Single-step Sampling
Jintao Lin
Haodong Duan
Kai-xiang Chen
Dahua Lin
Limin Wang
34
24
0
12 Jan 2022
Adaptive Memory Networks with Self-supervised Learning for Unsupervised Anomaly Detection
Yu-xin Zhang
Jindong Wang
Yiqiang Chen
Hanchao Yu
Tao Qin
AI4TS
30
54
0
03 Jan 2022
3D Skeleton-based Few-shot Action Recognition with JEANIE is not so Naïve
Lei Wang
Jun Liu
Piotr Koniusz
24
20
0
23 Dec 2021
Wholesale Electricity Price Forecasting using Integrated Long-term Recurrent Convolutional Network Model
Vasudharini Sridharan
Mingjian Tuo
Xingpeng Li
11
31
0
23 Dec 2021
A Survey of Natural Language Generation
Chenhe Dong
Hai-Tao Zheng
Haifan Gong
M. Chen
Junxin Li
Ying Shen
Min Yang
3DV
27
43
0
22 Dec 2021
Adversarial Memory Networks for Action Prediction
Zhiqiang Tao
Yue Bai
Handong Zhao
Sheng Li
Yuanyuan Kong
Y. Fu
GAN
13
2
0
18 Dec 2021
Distillation of Human-Object Interaction Contexts for Action Recognition
Muna Almushyti
Frederick W. Li
34
3
0
17 Dec 2021
Temporal Shuffling for Defending Deep Action Recognition Models against Adversarial Attacks
Jaehui Hwang
Huan Zhang
Jun-Ho Choi
Cho-Jui Hsieh
Jong-Seok Lee
AAML
19
5
0
15 Dec 2021
SVIP: Sequence VerIfication for Procedures in Videos
Yichen Qian
Weixin Luo
Dongze Lian
Xu Tang
P. Zhao
Shenghua Gao
ViT
29
17
0
13 Dec 2021
Auto-X3D: Ultra-Efficient Video Understanding via Finer-Grained Neural Architecture Search
Yi Ding
Xinyu Gong
Junru Wu
Humphrey Shi
Zhicheng Yan
Zhangyang Wang
VGen
52
1
0
09 Dec 2021
Joint Learning of Localized Representations from Medical Images and Reports
Philipp Muller
Georgios Kaissis
Cong Zou
Daniel Munich
137
81
0
06 Dec 2021
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding
Dave Zhenyu Chen
Qirui Wu
Matthias Nießner
Angel X. Chang
21
29
0
02 Dec 2021
BEVT: BERT Pretraining of Video Transformers
Rui Wang
Dongdong Chen
Zuxuan Wu
Yinpeng Chen
Xiyang Dai
Mengchen Liu
Yu-Gang Jiang
Luowei Zhou
Lu Yuan
ViT
36
203
0
02 Dec 2021
Neural Attention for Image Captioning: Review of Outstanding Methods
Zanyar Zohourianshahzadi
Jugal Kalita
VLM
27
45
0
29 Nov 2021
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Yoad Tewel
Yoav Shalev
Idan Schwartz
Lior Wolf
VLM
34
192
0
29 Nov 2021
DVCFlow: Modeling Information Flow Towards Human-like Video Captioning
Xu Yan
Zhengcong Fei
Shuhui Wang
Qingming Huang
Qi Tian
VGen
40
4
0
19 Nov 2021
Sparse Adversarial Video Attacks with Spatial Transformations
Ronghui Mu
Wenjie Ruan
Leandro Soriano Marcolino
Q. Ni
AAML
25
18
0
10 Nov 2021
Previous
1
2
3
4
5
...
11
12
13
Next