ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2007.04687
  4. Cited By
Not only Look, but also Listen: Learning Multimodal Violence Detection
  under Weak Supervision
v1v2 (latest)

Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision

9 July 2020
Peng Wu
Jing Liu
Yujia Shi
Yujia Sun
Fang Shao
Zhaoyang Wu
Zhiwei Yang
ArXiv (abs)PDFHTML

Papers citing "Not only Look, but also Listen: Learning Multimodal Violence Detection under Weak Supervision"

44 / 44 papers shown
Title
The Role of Video Generation in Enhancing Data-Limited Action Understanding
The Role of Video Generation in Enhancing Data-Limited Action Understanding
Wei Li
Dezhao Luo
Dongbao Yang
Zhenhang Li
Weiping Wang
Yu Zhou
DiffMVGen
269
0
0
26 May 2025
Uncertainty-Weighted Image-Event Multimodal Fusion for Video Anomaly Detection
Uncertainty-Weighted Image-Event Multimodal Fusion for Video Anomaly Detection
SungHeon Jeong
Jihong Park
Mohsen Imani
158
0
0
05 May 2025
Chain-of-Thought Textual Reasoning for Few-shot Temporal Action Localization
Chain-of-Thought Textual Reasoning for Few-shot Temporal Action Localization
Hongwei Ji
Wulian Yun
Mengshi Qi
Huadong Ma
LRM
426
0
0
18 Apr 2025
Hoi2Anomaly: An Explainable Anomaly Detection Approach Guided by Human-Object Interaction
Hoi2Anomaly: An Explainable Anomaly Detection Approach Guided by Human-Object Interaction
Yuhan Wang
Cheng Liu
Daou Zhang
Weichao Wu
92
0
0
13 Mar 2025
Detection, Retrieval, and Explanation Unified: A Violence Detection System Based on Knowledge Graphs and GAT
Detection, Retrieval, and Explanation Unified: A Violence Detection System Based on Knowledge Graphs and GAT
Wen-Dong Jiang
Chih-Yung Chang
Diptendu Sinha Roy
128
0
0
07 Jan 2025
Cross-Modal Fusion and Attention Mechanism for Weakly Supervised Video Anomaly Detection
Cross-Modal Fusion and Attention Mechanism for Weakly Supervised Video Anomaly Detection
Ayush Ghadiya
P. Kar
Vishal M. Chudasama
Pankaj Wasnik
101
3
0
31 Dec 2024
Do Language Models Understand Time?
Do Language Models Understand Time?
Xi Ding
Lei Wang
272
1
0
18 Dec 2024
Learnable Expansion of Graph Operators for Multi-Modal Feature Fusion
Learnable Expansion of Graph Operators for Multi-Modal Feature Fusion
Dexuan Ding
Lei Wang
Liyun Zhu
Tom Gedeon
Piotr Koniusz
111
9
0
02 Oct 2024
MissionGNN: Hierarchical Multimodal GNN-based Weakly Supervised Video Anomaly Recognition with Mission-Specific Knowledge Graph Generation
MissionGNN: Hierarchical Multimodal GNN-based Weakly Supervised Video Anomaly Recognition with Mission-Specific Knowledge Graph Generation
Sanggeon Yun
Ryozo Masukawa
Minhyoung Na
Mohsen Imani
99
8
0
27 Jun 2024
Distilling Aggregated Knowledge for Weakly-Supervised Video Anomaly Detection
Distilling Aggregated Knowledge for Weakly-Supervised Video Anomaly Detection
Jash Dalvi
Ali Dabouei
Gunjan Dhanuka
Min Xu
57
0
0
05 Jun 2024
Networking Systems for Video Anomaly Detection: A Tutorial and Survey
Networking Systems for Video Anomaly Detection: A Tutorial and Survey
Jing Liu
Yang Liu
Jieyu Lin
Jielin Li
Peng Sun
Bo Hu
Liang Song
Azzedine Boukerche
Victor C.M. Leung
Victor C.M. Leung
179
12
0
16 May 2024
A Closer Look at AUROC and AUPRC under Class Imbalance
A Closer Look at AUROC and AUPRC under Class Imbalance
Matthew B. A. McDermott
Lasse Hyldig Hansen
Haoran Zhang
Giovanni Angelotti
Jack Gallifant
121
32
0
11 Jan 2024
Graph Anomaly Detection in Time Series: A Survey
Graph Anomaly Detection in Time Series: A Survey
Thi Kieu Khanh Ho
Ali Karami
Narges Armanfard
AI4TS
110
7
0
31 Jan 2023
Graph Convolutional Networks for Temporal Action Localization
Graph Convolutional Networks for Temporal Action Localization
Runhao Zeng
Wenbing Huang
Mingkui Tan
Yu Rong
P. Zhao
Junzhou Huang
Chuang Gan
GNN
87
480
0
07 Sep 2019
EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action
  Recognition
EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition
Evangelos Kazakos
Arsha Nagrani
Andrew Zisserman
Dima Damen
EgoV
62
337
0
22 Aug 2019
Learning Individual Styles of Conversational Gesture
Learning Individual Styles of Conversational Gesture
Shiry Ginosar
Amir Bar
Gefen Kohavi
Caroline Chan
Andrew Owens
Jitendra Malik
SLR
45
332
0
10 Jun 2019
Speech2Face: Learning the Face Behind a Voice
Speech2Face: Learning the Face Behind a Voice
Tae-Hyun Oh
Tali Dekel
Changil Kim
Inbar Mosseri
William T. Freeman
Michael Rubinstein
Wojciech Matusik
SSLCVBM
104
163
0
23 May 2019
DeepGCNs: Can GCNs Go as Deep as CNNs?
DeepGCNs: Can GCNs Go as Deep as CNNs?
Ge Li
Matthias Muller
Ali K. Thabet
Guohao Li
3DPCGNN
130
1,349
0
07 Apr 2019
Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action
  Classifier for Anomaly Detection
Graph Convolutional Label Noise Cleaner: Train a Plug-and-play Action Classifier for Anomaly Detection
Jia-Xing Zhong
Nannan Li
Weijie Kong
Shan Liu
Thomas H. Li
Ge Li
NoLaSSL
114
406
0
18 Mar 2019
Long-Term Feature Banks for Detailed Video Understanding
Long-Term Feature Banks for Detailed Video Understanding
Chao-Yuan Wu
Christoph Feichtenhofer
Haoqi Fan
Kaiming He
Philipp Krahenbuhl
Ross B. Girshick
179
480
0
12 Dec 2018
Video Action Transformer Network
Video Action Transformer Network
Rohit Girdhar
João Carreira
Carl Doersch
Andrew Zisserman
ViT
131
709
0
06 Dec 2018
Compact Generalized Non-local Network
Compact Generalized Non-local Network
Kaiyu Yue
Ming Sun
Yuchen Yuan
Feng Zhou
Errui Ding
Fuxin Xu
49
162
0
31 Oct 2018
Exploring Visual Relationship for Image Captioning
Exploring Visual Relationship for Image Captioning
Ting Yao
Yingwei Pan
Yehao Li
Tao Mei
76
834
0
19 Sep 2018
Dual Attention Network for Scene Segmentation
Dual Attention Network for Scene Segmentation
J. Fu
Qingbin Liu
Haijie Tian
Yong Li
Yongjun Bao
Zhiwei Fang
Hanqing Lu
SSeg
322
5,108
0
09 Sep 2018
Actor-Centric Relation Network
Actor-Centric Relation Network
Chen Sun
Abhinav Shrivastava
Carl Vondrick
Kevin Patrick Murphy
Rahul Sukthankar
Cordelia Schmid
102
221
0
28 Jul 2018
W-TALC: Weakly-supervised Temporal Activity Localization and
  Classification
W-TALC: Weakly-supervised Temporal Activity Localization and Classification
S. Paul
Sourya Roy
Amit K. Roy-Chowdhury
80
309
0
27 Jul 2018
Videos as Space-Time Region Graphs
Videos as Space-Time Region Graphs
Xinyu Wang
Abhinav Gupta
106
756
0
05 Jun 2018
Eye in the Sky: Real-time Drone Surveillance System (DSS) for Violent
  Individuals Identification using ScatterNet Hybrid Deep Learning Network
Eye in the Sky: Real-time Drone Surveillance System (DSS) for Violent Individuals Identification using ScatterNet Hybrid Deep Learning Network
Amarjot Singh
Devendra Patil
SN Omkar
59
119
0
03 Jun 2018
Learning to Localize Sound Source in Visual Scenes
Learning to Localize Sound Source in Visual Scenes
Arda Senocak
Tae-Hyun Oh
Junsik Kim
Ming-Hsuan Yang
In So Kweon
SSL
66
344
0
10 Mar 2018
Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action
  Recognition
Spatial Temporal Graph Convolutional Networks for Skeleton-Based Action Recognition
Sijie Yan
Yuanjun Xiong
Dahua Lin
GNN
241
4,172
0
23 Jan 2018
Real-world Anomaly Detection in Surveillance Videos
Real-world Anomaly Detection in Surveillance Videos
Waqas Sultani
Chen Chen
M. Shah
AI4TS
177
1,488
0
12 Jan 2018
Relation Networks for Object Detection
Relation Networks for Object Detection
Han Hu
Jiayuan Gu
Zheng Zhang
Jifeng Dai
Yichen Wei
ObjD
122
1,223
0
30 Nov 2017
Temporal Relational Reasoning in Videos
Temporal Relational Reasoning in Videos
Bolei Zhou
A. Andonian
Aude Oliva
Antonio Torralba
NAI
98
1,039
0
22 Nov 2017
Non-local Neural Networks
Non-local Neural Networks
Xinyu Wang
Ross B. Girshick
Abhinav Gupta
Kaiming He
OffRL
289
8,906
0
21 Nov 2017
Graph Attention Networks
Graph Attention Networks
Petar Velickovic
Guillem Cucurull
Arantxa Casanova
Adriana Romero
Pietro Lio
Yoshua Bengio
GNN
479
20,164
0
30 Oct 2017
Learning to Detect Violent Videos using Convolutional Long Short-Term
  Memory
Learning to Detect Violent Videos using Convolutional Long Short-Term Memory
Swathikiran Sudhakaran
Oswald Lanz
58
216
0
19 Sep 2017
See, Hear, and Read: Deep Aligned Representations
See, Hear, and Read: Deep Aligned Representations
Y. Aytar
Carl Vondrick
Antonio Torralba
VLMAI4TS
92
136
0
03 Jun 2017
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual
  Actions
AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
Chunhui Gu
Chen Sun
David A. Ross
Carl Vondrick
C. Pantofaru
...
G. Toderici
Susanna Ricco
Rahul Sukthankar
Cordelia Schmid
Jitendra Malik
VGen
107
1,030
0
23 May 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
João Carreira
Andrew Zisserman
235
8,019
0
22 May 2017
SoundNet: Learning Sound Representations from Unlabeled Video
SoundNet: Learning Sound Representations from Unlabeled Video
Y. Aytar
Carl Vondrick
Antonio Torralba
SSL
117
1,044
0
27 Oct 2016
CNN Architectures for Large-Scale Audio Classification
CNN Architectures for Large-Scale Audio Classification
Shawn Hershey
Sourish Chaudhuri
D. Ellis
J. Gemmeke
A. Jansen
...
Rif A. Saurous
Bryan Seybold
M. Slaney
Ron J. Weiss
K. Wilson
123
2,506
0
29 Sep 2016
Semi-Supervised Classification with Graph Convolutional Networks
Semi-Supervised Classification with Graph Convolutional Networks
Thomas Kipf
Max Welling
GNNSSL
644
29,076
0
09 Sep 2016
Learning Temporal Regularity in Video Sequences
Learning Temporal Regularity in Video Sequences
Mahmudul Hasan
Jonghyun Choi
J. Neumann
Amit K. Roy-Chowdhury
L. Davis
169
1,106
0
15 Apr 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
194,020
0
10 Dec 2015
1