Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.15599
Cited By
UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition
27 November 2023
Xiaohan Ding
Yiyuan Zhang
Yixiao Ge
Sijie Zhao
Lin Song
Xiangyu Yue
Ying Shan
VLM
AI4TS
SSL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"UniRepLKNet: A Universal Perception Large-Kernel ConvNet for Audio, Video, Point Cloud, Time-Series and Image Recognition"
38 / 38 papers shown
Title
Vision-Centric Representation-Efficient Fine-Tuning for Robust Universal Foreground Segmentation
Guoyi Zhang
Siyang Chen
Guangsheng Xu
Han Wang
Xiaohu Zhang
34
0
0
20 Apr 2025
LSNet: See Large, Focus Small
Ao Wang
Hui Chen
Zijia Lin
J. Han
Guiguang Ding
42
0
0
29 Mar 2025
Frequency Dynamic Convolution for Dense Image Prediction
Linwei Chen
Lin Gu
Liang Li
C. Yan
Ying Fu
44
0
0
24 Mar 2025
OverLoCK: An Overview-first-Look-Closely-next ConvNet with Context-Mixing Dynamic Kernels
Meng Lou
Yizhou Yu
115
1
0
27 Feb 2025
TimeDART: A Diffusion Autoregressive Transformer for Self-Supervised Time Series Representation
Daoyu Wang
Mingyue Cheng
Z. Liu
Q. Liu
Enhong Chen
AI4TS
DiffM
47
1
0
24 Feb 2025
RecConv: Efficient Recursive Convolutions for Multi-Frequency Representations
Mingshu Zhao
Yi Luo
Yong Ouyang
38
0
0
27 Dec 2024
Learning Dynamic Local Context Representations for Infrared Small Target Detection
Guoyi Zhang
Guangsheng Xu
Han Wang
Siyang Chen
Yunxiao Shan
Xiaohu Zhang
29
1
0
23 Dec 2024
DREAM: Domain-agnostic Reverse Engineering Attributes of Black-box Model
Rongqing Li
Jiaqi Yu
Changsheng Li
Wenhan Luo
Ye Yuan
Guoren Wang
MLAU
78
0
0
08 Dec 2024
Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines
Zhixin Zhang
Yiyuan Zhang
Xiaohan Ding
Xiangyu Yue
24
3
0
28 Oct 2024
ContextDet: Temporal Action Detection with Adaptive Context Aggregation
Ning Wang
Yun Xiao
Xiaopeng Peng
Xiaojun Chang
Xuanhong Wang
Dingyi Fang
26
2
0
20 Oct 2024
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention
Nguyen Huu Bao Long
Chenyu Zhang
Yuzhi Shi
Tsubasa Hirakawa
Takayoshi Yamashita
Tohgoroh Matsui
H. Fujiyoshi
31
2
0
11 Oct 2024
Unveiling the Backbone-Optimizer Coupling Bias in Visual Representation Learning
Siyuan Li
Juanxi Tian
Zedong Wang
Luyuan Zhang
Zicheng Liu
Weiyang Jin
Yang Liu
Baigui Sun
Stan Z. Li
34
0
0
08 Oct 2024
Timer-XL: Long-Context Transformers for Unified Time Series Forecasting
Yong Liu
Guo Qin
Xiangdong Huang
Jianmin Wang
Mingsheng Long
AI4TS
29
6
0
07 Oct 2024
Reparameterized Multi-Resolution Convolutions for Long Sequence Modelling
Harry Jake Cunningham
Giorgio Giannone
Mingtian Zhang
M. Deisenroth
28
0
0
18 Aug 2024
Siamese Multiple Attention Temporal Convolution Networks for Human Mobility Signature Identification
Zhipeng Zheng
Yuchen Jiang
Shiyao Zhang
Xuetao Wei
22
0
0
17 Aug 2024
RoadFormer+: Delivering RGB-X Scene Parsing through Scale-Aware Information Decoupling and Advanced Heterogeneous Feature Fusion
Jianxin Huang
Jiahang Li
Ning Jia
Yuxiang Sun
Chengju Liu
Qijun Chen
Rui Fan
ViT
51
8
0
31 Jul 2024
Real-Time 3D Occupancy Prediction via Geometric-Semantic Disentanglement
Yulin He
Wei Chen
Tianci Xun
Yusong Tan
3DPC
47
0
0
18 Jul 2024
Revealing the Dark Secrets of Extremely Large Kernel ConvNets on Robustness
Honghao Chen
Yurong Zhang
Xiaokun Feng
Xiangxiang Chu
Kaiqi Huang
AAML
40
5
0
12 Jul 2024
RepNeXt: A Fast Multi-Scale CNN using Structural Reparameterization
Mingshu Zhao
Yi Luo
Yong Ouyang
34
2
0
23 Jun 2024
Explore the Limits of Omni-modal Pretraining at Scale
Yiyuan Zhang
Handong Li
Jing Liu
Xiangyu Yue
VLM
LRM
47
1
0
13 Jun 2024
GrootVL: Tree Topology is All You Need in State Space Model
Yicheng Xiao
Lin Song
Shaoli Huang
Jiangshan Wang
Siyu Song
Yixiao Ge
Xiu Li
Ying Shan
Mamba
41
10
0
04 Jun 2024
StarLKNet: Star Mixup with Large Kernel Networks for Palm Vein Identification
Xin Jin
Hongyu Zhu
M. El-Yacoubi
Hongchao Liao
Huafeng Qin
Yun Jiang
37
6
0
21 May 2024
Partial Large Kernel CNNs for Efficient Super-Resolution
Dongheon Lee
Seokju Yun
Youngmin Ro
SupR
36
1
0
18 Apr 2024
InteractiveVideo: User-Centric Controllable Video Generation with Synergistic Multimodal Instructions
Yiyuan Zhang
Yuhao Kang
Zhixin Zhang
Xiaohan Ding
Sanyuan Zhao
Xiangyu Yue
VGen
57
4
0
05 Feb 2024
Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities
Yiyuan Zhang
Xiaohan Ding
Kaixiong Gong
Yixiao Ge
Ying Shan
Xiangyu Yue
ViT
16
7
0
25 Jan 2024
Advancing Vision Transformers with Group-Mix Attention
Chongjian Ge
Xiaohan Ding
Zhan Tong
Li Yuan
Jiangliu Wang
Yibing Song
Ping Luo
112
16
0
26 Nov 2023
Multi-view learning for automatic classification of multi-wavelength auroral images
Qiuju Yang
Hang Su
Lili Liu
Yixuan Wang
Ze-Jun Hu
16
2
0
06 Nov 2023
Make Explicit Calibration Implicit: Calibrate Denoiser Instead of the Noise Model
Xin Jin
Jianqiang Xiao
Linghao Han
Chunle Guo
Xialei Liu
Chongyi Li
Ruixun Zhang
24
3
0
07 Aug 2023
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions
Wenhai Wang
Enze Xie
Xiang Li
Deng-Ping Fan
Kaitao Song
Ding Liang
Tong Lu
Ping Luo
Ling Shao
ViT
277
3,622
0
24 Feb 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
298
3,693
0
11 Feb 2021
Is Space-Time Attention All You Need for Video Understanding?
Gedas Bertasius
Heng Wang
Lorenzo Torresani
ViT
280
1,981
0
09 Feb 2021
PSLA: Improving Audio Tagging with Pretraining, Sampling, Labeling, and Aggregation
Yuan Gong
Yu-An Chung
James R. Glass
VLM
101
144
0
02 Feb 2021
Bottleneck Transformers for Visual Recognition
A. Srinivas
Tsung-Yi Lin
Niki Parmar
Jonathon Shlens
Pieter Abbeel
Ashish Vaswani
SLR
290
979
0
27 Jan 2021
RepVGG: Making VGG-style ConvNets Great Again
Xiaohan Ding
X. Zhang
Ningning Ma
Jungong Han
Guiguang Ding
Jian-jun Sun
136
1,546
0
11 Jan 2021
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
950
20,561
0
17 Apr 2017
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
C. Qi
Hao Su
Kaichun Mo
Leonidas J. Guibas
3DH
3DPC
3DV
PINN
222
14,099
0
02 Dec 2016
Densely Connected Convolutional Networks
Gao Huang
Zhuang Liu
L. V. D. van der Maaten
Kilian Q. Weinberger
PINN
3DV
252
36,362
0
25 Aug 2016
Semantic Understanding of Scenes through the ADE20K Dataset
Bolei Zhou
Hang Zhao
Xavier Puig
Tete Xiao
Sanja Fidler
Adela Barriuso
Antonio Torralba
SSeg
253
1,827
0
18 Aug 2016
1