Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2007.04687
Cited By
v1
v2 (latest)
Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision
9 July 2020
Peng Wu
Jing Liu
Yujia Shi
Yujia Sun
Fang Shao
Zhaoyang Wu
Zhiwei Yang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision"
44 / 44 papers shown
Title
The Role of Video Generation in Enhancing Data-Limited Action Understanding
Wei Li
Dezhao Luo
Dongbao Yang
Zhenhang Li
Weiping Wang
Yu Zhou
DiffM
VGen
269
0
0
26 May 2025
Uncertainty-Weighted Image-Event Multimodal Fusion for Video Anomaly Detection
SungHeon Jeong
Jihong Park
Mohsen Imani
158
0
0
05 May 2025
Chain-of-Thought Textual Reasoning for Few-shot Temporal Action Localization
Hongwei Ji
Wulian Yun
Mengshi Qi
Huadong Ma
LRM
426
0
0
18 Apr 2025
Hoi2Anomaly: An Explainable Anomaly Detection Approach Guided by Human-Object Interaction
Yuhan Wang
Cheng Liu
Daou Zhang
Weichao Wu
92
0
0
13 Mar 2025
Detection, Retrieval, and Explanation Unified: A Violence Detection System Based on Knowledge Graphs and GAT
Wen-Dong Jiang
Chih-Yung Chang
Diptendu Sinha Roy
128
0
0
07 Jan 2025
Cross-Modal Fusion and Attention Mechanism for Weakly Supervised Video Anomaly Detection
Ayush Ghadiya
P. Kar
Vishal M. Chudasama
Pankaj Wasnik
101
3
0
31 Dec 2024
Do Language Models Understand Time?
Xi Ding
Lei Wang
272
1
0
18 Dec 2024
Learnable Expansion of Graph Operators for Multi-Modal Feature Fusion
Dexuan Ding
Lei Wang
Liyun Zhu
Tom Gedeon
Piotr Koniusz
111
9
0
02 Oct 2024
MissionGNN: Hierarchical Multimodal GNN-based Weakly Supervised Video Anomaly Recognition with Mission-Specific Knowledge Graph Generation
Sanggeon Yun
Ryozo Masukawa
Minhyoung Na
Mohsen Imani
99
8
0
27 Jun 2024
Distilling Aggregated Knowledge for Weakly-Supervised Video Anomaly Detection
Jash Dalvi
Ali Dabouei
Gunjan Dhanuka
Min Xu
57
0
0
05 Jun 2024
Networking Systems for Video Anomaly Detection: A Tutorial and Survey
Jing Liu
Yang Liu
Jieyu Lin
Jielin Li
Peng Sun
Bo Hu
Liang Song
Azzedine Boukerche
Victor C.M. Leung
Victor C.M. Leung
179
12
0
16 May 2024
A Closer Look at AUROC and AUPRC under Class Imbalance
Matthew B. A. McDermott
Lasse Hyldig Hansen
Haoran Zhang
Giovanni Angelotti
Jack Gallifant
121
32
0
11 Jan 2024
Graph Anomaly Detection in Time Series: A Survey
Thi Kieu Khanh Ho
Ali Karami
Narges Armanfard
AI4TS
110
7
0
31 Jan 2023
Graph Convolutional Networks for Temporal Action Localization
Runhao Zeng
Wenbing Huang
Mingkui Tan
Yu Rong
P. Zhao
Junzhou Huang
Chuang Gan
GNN
87
480
0
07 Sep 2019
EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition
Evangelos Kazakos
Arsha Nagrani
Andrew Zisserman
Dima Damen
EgoV
62
337
0
22 Aug 2019
Learning Individual Styles of Conversational Gesture
Shiry Ginosar
Amir Bar
Gefen Kohavi
Caroline Chan
Andrew Owens
Jitendra Malik
SLR
45
332
0
10 Jun 2019
Speech2Face: Learning the Face Behind a Voice
Tae-Hyun Oh
Tali Dekel
Changil Kim
Inbar Mosseri
William T. Freeman
Michael Rubinstein
Wojciech Matusik
SSL
CVBM
104
163
0
23 May 2019
DeepGCNs: Can GCNs Go as Deep as CNNs?
Ge Li
Matthias Muller
Ali K. Thabet
Guohao Li
3DPC
GNN
130
1,349
0
07 Apr 2019
Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection
Jia-Xing Zhong
Nannan Li
Weijie Kong
Shan Liu
Thomas H. Li
Ge Li
NoLa
SSL
114
406
0
18 Mar 2019
Long-Term Feature Banks for Detailed Video Understanding
Chao-Yuan Wu
Christoph Feichtenhofer
Haoqi Fan
Kaiming He
Philipp Krahenbuhl
Ross B. Girshick
179
480
0
12 Dec 2018
Video Action Transformer Network
Rohit Girdhar
João Carreira
Carl Doersch
Andrew Zisserman
ViT
131
709
0
06 Dec 2018
Compact Generalized Non-local Network
Kaiyu Yue
Ming Sun
Yuchen Yuan
Feng Zhou
Errui Ding
Fuxin Xu
49
162
0
31 Oct 2018
Exploring Visual Relationship for Image Captioning
Ting Yao
Yingwei Pan
Yehao Li
Tao Mei
76
834
0
19 Sep 2018
Dual Attention Network for Scene Segmentation
J. Fu
Qingbin Liu
Haijie Tian
Yong Li
Yongjun Bao
Zhiwei Fang
Hanqing Lu
SSeg
322
5,108
0
09 Sep 2018
Actor-Centric Relation Network
Chen Sun
Abhinav Shrivastava
Carl Vondrick
Kevin Patrick Murphy
Rahul Sukthankar
Cordelia Schmid
102
221
0
28 Jul 2018
W-TALC: Weakly-supervised Temporal Activity Localization and Classification
S. Paul
Sourya Roy
Amit K. Roy-Chowdhury
80
309
0
27 Jul 2018
Videos as Space-Time Region Graphs
Xinyu Wang
Abhinav Gupta
106
756
0
05 Jun 2018
Eye in the Sky: Real-time Drone Surveillance System (DSS) for Violent Individuals Identification using ScatterNet Hybrid Deep Learning Network
Amarjot Singh
Devendra Patil
SN Omkar
59
119
0
03 Jun 2018
Learning to Localize Sound Source in Visual Scenes
Arda Senocak
Tae-Hyun Oh
Junsik Kim
Ming-Hsuan Yang
In So Kweon
SSL
66
344
0
10 Mar 2018
Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition
Sijie Yan
Yuanjun Xiong
Dahua Lin
GNN
241
4,172
0
23 Jan 2018
Real-world Anomaly Detection in Surveillance Videos
Waqas Sultani
Chen Chen
M. Shah
AI4TS
177
1,488
0
12 Jan 2018
Relation Networks for Object Detection
Han Hu
Jiayuan Gu
Zheng Zhang
Jifeng Dai
Yichen Wei
ObjD
122
1,223
0
30 Nov 2017
Temporal Relational Reasoning in Videos
Bolei Zhou
A. Andonian
Aude Oliva
Antonio Torralba
NAI
98
1,039
0
22 Nov 2017
Non-local Neural Networks
Xinyu Wang
Ross B. Girshick
Abhinav Gupta
Kaiming He
OffRL
289
8,906
0
21 Nov 2017
Graph Attention Networks
Petar Velickovic
Guillem Cucurull
Arantxa Casanova
Adriana Romero
Pietro Lio
Yoshua Bengio
GNN
479
20,164
0
30 Oct 2017
Learning to Detect Violent Videos using Convolutional Long Short-Term Memory
Swathikiran Sudhakaran
Oswald Lanz
58
216
0
19 Sep 2017
See, Hear, and Read: Deep Aligned Representations
Y. Aytar
Carl Vondrick
Antonio Torralba
VLM
AI4TS
92
136
0
03 Jun 2017
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
Chunhui Gu
Chen Sun
David A. Ross
Carl Vondrick
C. Pantofaru
...
G. Toderici
Susanna Ricco
Rahul Sukthankar
Cordelia Schmid
Jitendra Malik
VGen
107
1,030
0
23 May 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
João Carreira
Andrew Zisserman
235
8,019
0
22 May 2017
SoundNet: Learning Sound Representations from Unlabeled Video
Y. Aytar
Carl Vondrick
Antonio Torralba
SSL
117
1,044
0
27 Oct 2016
CNN Architectures for Large-Scale Audio Classification
Shawn Hershey
Sourish Chaudhuri
D. Ellis
J. Gemmeke
A. Jansen
...
Rif A. Saurous
Bryan Seybold
M. Slaney
Ron J. Weiss
K. Wilson
123
2,506
0
29 Sep 2016
Semi-Supervised Classification with Graph Convolutional Networks
Thomas Kipf
Max Welling
GNN
SSL
644
29,076
0
09 Sep 2016
Learning Temporal Regularity in Video Sequences
Mahmudul Hasan
Jonghyun Choi
J. Neumann
Amit K. Roy-Chowdhury
L. Davis
169
1,106
0
15 Apr 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
194,020
0
10 Dec 2015
1