ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.09577
  4. Cited By
Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?

Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?

27 November 2017
Kensho Hara
Hirokatsu Kataoka
Y. Satoh
    3DPC
ArXivPDFHTML

Papers citing "Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?"

50 / 287 papers shown
Title
Applying Deep-Learning-Based Computer Vision to Wireless Communications:
  Methodologies, Opportunities, and Challenges
Applying Deep-Learning-Based Computer Vision to Wireless Communications: Methodologies, Opportunities, and Challenges
Yu Tian
Gaofeng Pan
Mohamed-Slim Alouini
22
43
0
10 Jun 2020
Not made for each other- Audio-Visual Dissonance-based Deepfake
  Detection and Localization
Not made for each other- Audio-Visual Dissonance-based Deepfake Detection and Localization
Komal Chugh
Parul Gupta
Abhinav Dhall
Ramanathan Subramanian
29
164
0
29 May 2020
Retrieving and Highlighting Action with Spatiotemporal Reference
Retrieving and Highlighting Action with Spatiotemporal Reference
Seito Kasai
Yuchi Ishikawa
Masaki Hayashi
Y. Aoki
Kensho Hara
Hirokatsu Kataoka
11
0
0
19 May 2020
Context-aware and Scale-insensitive Temporal Repetition Counting
Context-aware and Scale-insensitive Temporal Repetition Counting
Huaidong Zhang
Xuemiao Xu
Guoqiang Han
Shengfeng He
21
47
0
18 May 2020
From Standard Summarization to New Tasks and Beyond: Summarization with
  Manifold Information
From Standard Summarization to New Tasks and Beyond: Summarization with Manifold Information
Shen Gao
Preslav Nakov
Z. Ren
Dongyan Zhao
Rui Yan
23
48
0
10 May 2020
Dual-Sampling Attention Network for Diagnosis of COVID-19 from Community
  Acquired Pneumonia
Dual-Sampling Attention Network for Diagnosis of COVID-19 from Community Acquired Pneumonia
Xi Ouyang
Jiayu Huo
L. Xia
F. Shan
Jun Liu
...
Xiaohuan Cao
Yaozong Gao
Dijia Wu
Qian Wang
Dinggang Shen
24
312
0
06 May 2020
Multiresolution and Multimodal Speech Recognition with Transformers
Multiresolution and Multimodal Speech Recognition with Transformers
Georgios Paraskevopoulos
Srinivas Parthasarathy
Aparna Khare
Shiva Sundaram
25
29
0
29 Apr 2020
Low-latency hand gesture recognition with a low resolution thermal
  imager
Low-latency hand gesture recognition with a low resolution thermal imager
Maarten Vandersteegen
Wouter Reusen
Kristof Van Beeck
24
15
0
24 Apr 2020
Cross-ethnicity Face Anti-spoofing Recognition Challenge: A Review
Cross-ethnicity Face Anti-spoofing Recognition Challenge: A Review
Ajian Liu
Xuan Li
Jun Wan
Sergio Escalera
Hugo Jair Escalante
...
Qiaoning Yuan
Ruikun Yang
Benjia Zhou
G. Guo
Stan Z. Li
CVBM
35
76
0
23 Apr 2020
Adversarial Distortion for Learned Video Compression
Adversarial Distortion for Learned Video Compression
Vijay Veerabadran
Reza Pourreza
A. Habibian
Taco S. Cohen
GAN
43
13
0
20 Apr 2020
Would Mega-scale Datasets Further Enhance Spatiotemporal 3D CNNs?
Would Mega-scale Datasets Further Enhance Spatiotemporal 3D CNNs?
Hirokatsu Kataoka
Tenga Wakamiya
Kensho Hara
Y. Satoh
3DPC
31
87
0
10 Apr 2020
X3D: Expanding Architectures for Efficient Video Recognition
X3D: Expanding Architectures for Efficient Video Recognition
Christoph Feichtenhofer
73
1,001
0
09 Apr 2020
Explaining Motion Relevance for Activity Recognition in Video Deep
  Learning Models
Explaining Motion Relevance for Activity Recognition in Video Deep Learning Models
Liam Hiley
Alun D. Preece
Y. Hicks
Supriyo Chakraborty
Prudhvi K. Gurram
Richard J. Tomsett
FAtt
25
14
0
31 Mar 2020
Learning Interactions and Relationships between Movie Characters
Learning Interactions and Relationships between Movie Characters
Anna Kukleva
Makarand Tapaswi
Ivan Laptev
41
51
0
29 Mar 2020
On Translation Invariance in CNNs: Convolutional Layers can Exploit
  Absolute Spatial Location
On Translation Invariance in CNNs: Convolutional Layers can Exploit Absolute Spatial Location
O. Kayhan
Jan van Gemert
211
233
0
16 Mar 2020
PANDA: A Gigapixel-level Human-centric Video Dataset
PANDA: A Gigapixel-level Human-centric Video Dataset
Xueyan Wang
Xiya Zhang
Yinheng Zhu
Yuchen Guo
Xiaoyun Yuan
...
Zerun Wang
Guiguang Ding
D. Brady
Qionghai Dai
Lu Fang
VGen
44
79
0
10 Mar 2020
Noise Estimation Using Density Estimation for Self-Supervised Multimodal
  Learning
Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning
Elad Amrani
Rami Ben-Ari
Daniel Rotman
A. Bronstein
17
121
0
06 Mar 2020
Rethinking Zero-shot Video Classification: End-to-end Training for
  Realistic Applications
Rethinking Zero-shot Video Classification: End-to-end Training for Realistic Applications
Biagio Brattoli
Joseph Tighe
Fedor Zhdanov
Pietro Perona
Krzysztof Chalupka
VLM
137
127
0
03 Mar 2020
Joint 2D-3D Breast Cancer Classification
Joint 2D-3D Breast Cancer Classification
G. Liang
Xiaoqin Wang
Yu Zhang
Xin Xing
Hunter Blanton
Tawfiq Salem
Nathan Jacobs
23
39
0
27 Feb 2020
Object Relational Graph with Teacher-Recommended Learning for Video
  Captioning
Object Relational Graph with Teacher-Recommended Learning for Video Captioning
Ziqi Zhang
Yaya Shi
Chunfen Yuan
Bing Li
Peijin Wang
Weiming Hu
Zhengjun Zha
VLM
37
271
0
26 Feb 2020
Deep learning predicts total knee replacement from magnetic resonance
  images
Deep learning predicts total knee replacement from magnetic resonance images
Aniket A. Tolpadi
Jinhee J. Lee
V. Pedoia
S. Majumdar
MedIm
AI4CE
13
95
0
24 Feb 2020
Fine-Grained Instance-Level Sketch-Based Video Retrieval
Fine-Grained Instance-Level Sketch-Based Video Retrieval
Peng Xu
Kun Liu
Tao Xiang
Timothy M. Hospedales
Zhanyu Ma
Jun Guo
Yi-Zhe Song
32
32
0
21 Feb 2020
An End-to-End Visual-Audio Attention Network for Emotion Recognition in
  User-Generated Videos
An End-to-End Visual-Audio Attention Network for Emotion Recognition in User-Generated Videos
Sicheng Zhao
Yunsheng Ma
Yang Gu
Jufeng Yang
Tengfei Xing
Pengfei Xu
Runbo Hu
Hua Chai
Kurt Keutzer
16
98
0
12 Feb 2020
Weakly-Supervised Multi-Person Action Recognition in 360$^{\circ}$
  Videos
Weakly-Supervised Multi-Person Action Recognition in 360∘^{\circ}∘ Videos
Junnan Li
Jianquan Liu
Yongkang Wong
Shoji Nishimura
Mohan S. Kankanhalli
28
13
0
09 Feb 2020
Solving Raven's Progressive Matrices with Neural Networks
Solving Raven's Progressive Matrices with Neural Networks
Tao Zhuo
Mohan S. Kankanhalli
27
26
0
05 Feb 2020
3D ResNet with Ranking Loss Function for Abnormal Activity Detection in
  Videos
3D ResNet with Ranking Loss Function for Abnormal Activity Detection in Videos
Shikha Dubey
Abhijeet Boragule
M. Jeon
26
29
0
04 Feb 2020
STAViS: Spatio-Temporal AudioVisual Saliency Network
STAViS: Spatio-Temporal AudioVisual Saliency Network
A. Tsiami
Petros Koutras
Petros Maragos
27
73
0
09 Jan 2020
DeepFakes and Beyond: A Survey of Face Manipulation and Fake Detection
DeepFakes and Beyond: A Survey of Face Manipulation and Fake Detection
Ruben Tolosana
R. Vera-Rodríguez
Julian Fierrez
Aythami Morales
J. Ortega-Garcia
3DPC
CVBM
51
775
0
01 Jan 2020
Synthetic Humans for Action Recognition from Unseen Viewpoints
Synthetic Humans for Action Recognition from Unseen Viewpoints
Gül Varol
Ivan Laptev
Cordelia Schmid
Andrew Zisserman
33
96
0
09 Dec 2019
Optimal checkpointing for heterogeneous chains: how to train deep neural
  networks with limited memory
Optimal checkpointing for heterogeneous chains: how to train deep neural networks with limited memory
Julien Herrmann
Olivier Beaumont
Lionel Eyraud-Dubois
J. Herrmann
Alexis Joly
Alena Shilova
BDL
31
29
0
27 Nov 2019
Reinventing 2D Convolutions for 3D Images
Reinventing 2D Convolutions for 3D Images
Jiancheng Yang
Xiaoyang Huang
Yi He
Jingwei Xu
Canqian Yang
Guozheng Xu
Bingbing Ni
16
11
0
24 Nov 2019
You Only Watch Once: A Unified CNN Architecture for Real-Time
  Spatiotemporal Action Localization
You Only Watch Once: A Unified CNN Architecture for Real-Time Spatiotemporal Action Localization
Okan Kopuklu
Xiangyu Wei
Gerhard Rigoll
28
143
0
15 Nov 2019
Interpretable Self-Attention Temporal Reasoning for Driving Behavior
  Understanding
Interpretable Self-Attention Temporal Reasoning for Driving Behavior Understanding
Yi-Chieh Liu
Yung-An Hsieh
Min-Hung Chen
Chao-Han Huck Yang
Jesper N. Tegnér
Y. Tsai
37
19
0
06 Nov 2019
A Spectral Nonlocal Block for Neural Networks
A Spectral Nonlocal Block for Neural Networks
Lei Zhu
Qi She
Lidan Zhang
Ping Guo
18
2
0
04 Nov 2019
Transformer-based Cascaded Multimodal Speech Translation
Transformer-based Cascaded Multimodal Speech Translation
Zixiu "Alex" Wu
Ozan Caglayan
Julia Ive
Josiah Wang
Lucia Specia
25
7
0
29 Oct 2019
Human Action Recognition in Drone Videos using a Few Aerial Training
  Examples
Human Action Recognition in Drone Videos using a Few Aerial Training Examples
Waqas Sultani
M. Shah
27
46
0
22 Oct 2019
Multi-Resolution Weak Supervision for Sequential Data
Multi-Resolution Weak Supervision for Sequential Data
Frederic Sala
P. Varma
Jason Alan Fries
Daniel Y. Fu
Shiori Sagawa
...
A. Ramamoorthy
K. Xiao
Kayvon Fatahalian
J. Priest
Christopher Ré
NoLa
14
28
0
21 Oct 2019
Graph-based Spatial-temporal Feature Learning for Neuromorphic Vision
  Sensing
Graph-based Spatial-temporal Feature Learning for Neuromorphic Vision Sensing
Yin Bi
Aaron Chadha
Alhabib Abbas
Eirina Bourtsoulatze
Y. Andreopoulos
25
26
0
08 Oct 2019
Rekall: Specifying Video Events using Compositions of Spatiotemporal
  Labels
Rekall: Specifying Video Events using Compositions of Spatiotemporal Labels
Daniel Y. Fu
Will Crichton
James Hong
Xinwei Yao
Haotian Zhang
A. Truong
A. Narayan
Maneesh Agrawala
Christopher Ré
Kayvon Fatahalian
19
48
0
07 Oct 2019
Grouped Spatial-Temporal Aggregation for Efficient Action Recognition
Grouped Spatial-Temporal Aggregation for Efficient Action Recognition
Chenxu Luo
Alan Yuille
130
150
0
28 Sep 2019
Class Feature Pyramids for Video Explanation
Class Feature Pyramids for Video Explanation
Alexandros Stergiou
G. Kapidis
Grigorios Kalliatakis
C. Chrysoulas
R. Poppe
R. Veltkamp
FAtt
33
18
0
18 Sep 2019
Comparative Analysis of CNN-based Spatiotemporal Reasoning in Videos
Comparative Analysis of CNN-based Spatiotemporal Reasoning in Videos
Okan Kopuklu
Fabian Herzog
Gerhard Rigoll
19
6
0
11 Sep 2019
Extreme Low Resolution Activity Recognition with Confident
  Spatial-Temporal Attention Transfer
Extreme Low Resolution Activity Recognition with Confident Spatial-Temporal Attention Transfer
Yucai Bai
Qinglong Zou
Xieyuanli Chen
Lingxi Li
Zhengming Ding
Long Chen
18
3
0
09 Sep 2019
Exploring Temporal Differences in 3D Convolutional Neural Networks
Exploring Temporal Differences in 3D Convolutional Neural Networks
Gagan Kanojia
Sudhakar Kumawat
Shanmuganathan Raman
3DPC
AI4TS
21
3
0
07 Sep 2019
ForkNet: Multi-branch Volumetric Semantic Completion from a Single Depth
  Image
ForkNet: Multi-branch Volumetric Semantic Completion from a Single Depth Image
Yida Wang
D. Tan
Nassir Navab
Federico Tombari
3DV
3DPC
25
56
0
03 Sep 2019
TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder Network for
  Video Saliency Detection
TASED-Net: Temporally-Aggregating Spatial Encoder-Decoder Network for Video Saliency Detection
Kyle Min
Jason J. Corso
28
149
0
15 Aug 2019
Bypass Enhancement RGB Stream Model for Pedestrian Action Recognition of
  Autonomous Vehicles
Bypass Enhancement RGB Stream Model for Pedestrian Action Recognition of Autonomous Vehicles
Dong Cao
Lisha Xu
17
2
0
15 Aug 2019
Predicting Actions to Help Predict Translations
Predicting Actions to Help Predict Translations
Zixiu "Alex" Wu
Julia Ive
Josiah Wang
Pranava Madhyastha
Lucia Specia
17
7
0
05 Aug 2019
An Efficient 3D CNN for Action/Object Segmentation in Video
An Efficient 3D CNN for Action/Object Segmentation in Video
Rui Hou
Chong Chen
Rahul Sukthankar
M. Shah
24
27
0
21 Jul 2019
Profiling based Out-of-core Hybrid Method for Large Neural Networks
Profiling based Out-of-core Hybrid Method for Large Neural Networks
Yuki Ito
Haruki Imai
Tung D. Le
Yasushi Negishi
K. Kawachiya
R. Matsumiya
Toshio Endo
24
9
0
11 Jul 2019
Previous
123456
Next