Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2212.01638
Cited By
VLG: General Video Recognition with Web Textual Knowledge
3 December 2022
Jintao Lin
Zhaoyang Liu
Wenhai Wang
Wayne Wu
Limin Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"VLG: General Video Recognition with Web Textual Knowledge"
28 / 78 papers shown
Title
Learning Imbalanced Datasets with Label-Distribution-Aware Margin Loss
Kaidi Cao
Colin Wei
Adrien Gaidon
Nikos Arechiga
Tengyu Ma
107
1,600
0
18 Jun 2019
Specifying Weight Priors in Bayesian Deep Neural Networks with Empirical Bayes
R. Krishnan
Mahesh Subedar
Omesh Tickoo
BDL
39
47
0
12 Jun 2019
Towards VQA Models That Can Read
Amanpreet Singh
Vivek Natarajan
Meet Shah
Yu Jiang
Xinlei Chen
Dhruv Batra
Devi Parikh
Marcus Rohrbach
EgoV
69
1,215
0
18 Apr 2019
Large-Scale Long-Tailed Recognition in an Open World
Ziwei Liu
Zhongqi Miao
Xiaohang Zhan
Jiayun Wang
Boqing Gong
Stella X. Yu
142
1,158
0
10 Apr 2019
ODN: Opening the Deep Network for Open-set Action Recognition
Yu Shu
Yemin Shi
Yaowei Wang
Yixiong Zou
Qingsheng Yuan
Yonghong Tian
45
47
0
23 Jan 2019
Class-Balanced Loss Based on Effective Number of Samples
Huayu Chen
Menglin Jia
Nayeon Lee
Yang Song
Serge J. Belongie
183
2,276
0
16 Jan 2019
D3D: Distilled 3D Networks for Video Action Recognition
Jonathan C. Stroud
David A. Ross
Chen Sun
Jia Deng
Rahul Sukthankar
3DPC
54
160
0
19 Dec 2018
SlowFast Networks for Video Recognition
Christoph Feichtenhofer
Haoqi Fan
Jitendra Malik
Kaiming He
162
3,262
0
10 Dec 2018
TSM: Temporal Shift Module for Efficient Video Understanding
Ji Lin
Chuang Gan
Song Han
85
1,688
0
20 Nov 2018
BAR: Bayesian Activity Recognition using variational inference
R. Krishnan
Mahesh Subedar
S. Bhatnagar
BDL
UQCV
33
20
0
08 Nov 2018
Spatio-Temporal Channel Correlation Networks for Action Classification
Ali Diba
Mohsen Fayyaz
Vivek Sharma
M. M. Arzani
Rahman Yousefzadeh
Juergen Gall
Luc Van Gool
3DPC
59
181
0
19 Jun 2018
Moments in Time Dataset: one million videos for event understanding
Mathew Monfort
A. Andonian
Bolei Zhou
K. Ramakrishnan
Sarah Adel Bargal
...
L. Brown
Quanfu Fan
Dan Gutfreund
Carl Vondrick
A. Oliva
92
548
0
09 Jan 2018
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Patrick Murphy
3DH
137
1,328
0
13 Dec 2017
A Closer Look at Spatiotemporal Convolutions for Action Recognition
Du Tran
Heng Wang
Lorenzo Torresani
Jamie Ray
Yann LeCun
Manohar Paluri
196
3,029
0
30 Nov 2017
A systematic study of the class imbalance problem in convolutional neural networks
Mateusz Buda
A. Maki
Maciej A. Mazurowski
207
2,362
0
15 Oct 2017
The "something something" video database for learning and evaluating visual common sense
Raghav Goyal
Samira Ebrahimi Kahou
Vincent Michalski
Joanna Materzynska
S. Westphal
...
Moritz Mueller-Freitag
F. Hoppe
Christian Thurau
Ingo Bax
Roland Memisevic
VLM
82
1,529
0
13 Jun 2017
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset
João Carreira
Andrew Zisserman
219
8,012
0
22 May 2017
The Kinetics Human Action Video Dataset
W. Kay
João Carreira
Karen Simonyan
Brian Zhang
Chloe Hillier
...
Tim Green
T. Back
Apostol Natsev
Mustafa Suleyman
Andrew Zisserman
231
3,801
0
19 May 2017
Categorical Reparameterization with Gumbel-Softmax
Eric Jang
S. Gu
Ben Poole
BDL
297
5,364
0
03 Nov 2016
SGDR: Stochastic Gradient Descent with Warm Restarts
I. Loshchilov
Frank Hutter
ODL
288
8,114
0
13 Aug 2016
Temporal Segment Networks: Towards Good Practices for Deep Action Recognition
Limin Wang
Yuanjun Xiong
Zhe Wang
Yu Qiao
Dahua Lin
Xiaoou Tang
Luc Van Gool
ViT
98
3,831
0
02 Aug 2016
Convolutional Two-Stream Network Fusion for Video Action Recognition
Christoph Feichtenhofer
A. Pinz
Andrew Zisserman
156
2,610
0
22 Apr 2016
Relay Backpropagation for Effective Learning of Deep Convolutional Neural Networks
Li Shen
Zhouchen Lin
Qingming Huang
60
291
0
18 Dec 2015
Towards Open Set Deep Networks
Abhijit Bendale
Terrance Boult
BDL
EDL
101
1,426
0
19 Nov 2015
Cost Sensitive Learning of Deep Feature Representations from Imbalanced Data
Salman H. Khan
Munawar Hayat
Bennamoun
Ferdous Sohel
R. Togneri
72
882
0
14 Aug 2015
Two-Stream Convolutional Networks for Action Recognition in Videos
Karen Simonyan
Andrew Zisserman
237
7,526
0
09 Jun 2014
UCF101: A Dataset of 101 Human Actions Classes From Videos in The Wild
K. Soomro
Amir Zamir
M. Shah
CLIP
VGen
137
6,145
0
03 Dec 2012
SMOTE: Synthetic Minority Over-sampling Technique
Nitesh Chawla
Kevin W. Bowyer
Lawrence Hall
W. Kegelmeyer
AI4TS
350
25,621
0
09 Jun 2011
Previous
1
2