Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1711.05101
Cited By
Decoupled Weight Decay Regularization
14 November 2017
I. Loshchilov
Frank Hutter
OffRL
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Decoupled Weight Decay Regularization"
50 / 381 papers shown
Title
LOANet: A Lightweight Network Using Object Attention for Extracting Buildings and Roads from UAV Aerial Remote Sensing Images
Xiaoxiang Han
Yiman Liu
Gang Liu
Yuanjie Lin
Qiaohong Liu
27
11
0
16 Dec 2022
Co-training
2
L
2^L
2
L
Submodels for Visual Recognition
Hugo Touvron
Matthieu Cord
Maxime Oquab
Piotr Bojanowski
Jakob Verbeek
Hervé Jégou
VLM
35
9
0
09 Dec 2022
Discovering Latent Knowledge in Language Models Without Supervision
Collin Burns
Haotian Ye
Dan Klein
Jacob Steinhardt
70
327
0
07 Dec 2022
Estimation of fibre architecture and scar in myocardial tissue using electrograms: an in-silico study
Konstantinos Ntagiantas
E. Pignatelli
N. Peters
C. Cantwell
R. Chowdhury
Anil A. Bharath
21
1
0
06 Dec 2022
Pretrained Diffusion Models for Unified Human Motion Synthesis
Jianxin Ma
Shuai Bai
Chang Zhou
DiffM
VGen
AI4CE
33
31
0
06 Dec 2022
Automatic Generation of Factual News Headlines in Finnish
Maximilian Koppatz
Khalid Alnajjar
Mika Hämäläinen
Thierry Poibeau
29
2
0
05 Dec 2022
FREDSR: Fourier Residual Efficient Diffusive GAN for Single Image Super Resolution
Kyoungwan Woo
Achyuta Rajaram
DiffM
27
1
0
30 Nov 2022
STAGE: Span Tagging and Greedy Inference Scheme for Aspect Sentiment Triplet Extraction
Shuo Liang
Wei Wei
Xian-Ling Mao
Yuanyuan Fu
Rui Fang
Dangyang Chen
40
40
0
28 Nov 2022
SPCXR: Self-supervised Pretraining using Chest X-rays Towards a Domain Specific Foundation Model
Syed Muhammad Anwar
Abhijeet Parida
Sara Atito
Muhammad Awais
G. Nino
Josef Kitler
M. Linguraru
ViT
SSL
OOD
29
6
0
23 Nov 2022
Unifying Tracking and Image-Video Object Detection
Peirong Liu
Rui Wang
Pengchuan Zhang
Omid Poursaeed
Yipin Zhou
Xuefei Cao
Sreya . Dutta Roy
Ashish Shah
Ser-Nam Lim
21
0
0
20 Nov 2022
D
3
^3
3
ETR: Decoder Distillation for Detection Transformer
Xiaokang Chen
Jiahui Chen
Yong-Jin Liu
Gang Zeng
42
16
0
17 Nov 2022
UniFormerV2: Spatiotemporal Learning by Arming Image ViTs with Video UniFormer
Kunchang Li
Yali Wang
Yinan He
Yizhuo Li
Yi Wang
Limin Wang
Yu Qiao
ViT
30
107
0
17 Nov 2022
Easy to Decide, Hard to Agree: Reducing Disagreements Between Saliency Methods
Josip Jukić
Martin Tutek
Jan Snajder
FAtt
21
0
0
15 Nov 2022
Gradient Imitation Reinforcement Learning for General Low-Resource Information Extraction
Xuming Hu
Shiao Meng
Chenwei Zhang
Xiangli Yang
Lijie Wen
Irwin King
Philip S. Yu
52
0
0
11 Nov 2022
Active Relation Discovery: Towards General and Label-aware Open Relation Extraction
Yong Li
Hai-Tao Zheng
Xi Chen
Haitao Zheng
Ying Shen
Hong-Gee Kim
VLM
11
14
0
08 Nov 2022
MuMIC -- Multimodal Embedding for Multi-label Image Classification with Tempered Sigmoid
Feng Wang
Sarai Mizrachi
Moran Beladev
Guy Nadav
Gil Amsalem
Karen Lastmann Assaraf
Hadas Harush Boker
VLM
22
13
0
02 Nov 2022
FADO: Feedback-Aware Double COntrolling Network for Emotional Support Conversation
Wei Peng
Ziyuan Qin
Yue Hu
Yuqiang Xie
Yunpeng Li
27
29
0
01 Nov 2022
A simple, efficient and scalable contrastive masked autoencoder for learning visual representations
Shlok Kumar Mishra
Joshua Robinson
Huiwen Chang
David Jacobs
Aaron Sarna
Aaron Maschinot
Dilip Krishnan
DiffM
43
30
0
30 Oct 2022
Pair DETR: Contrastive Learning Speeds Up DETR Training
M. Iranmanesh
Xiaotong Chen
Kuo-Chin Lien
ViT
21
0
0
29 Oct 2022
Universal and Independent: Multilingual Probing Framework for Exhaustive Model Interpretation and Evaluation
O. Serikov
Vitaly Protasov
E. Voloshina
V. Knyazkova
Tatiana Shavrina
35
3
0
24 Oct 2022
fMRI from EEG is only Deep Learning away: the use of interpretable DL to unravel EEG-fMRI relationships
A. Kovalev
Ilia Mikheev
A. Ossadtchi
15
3
0
23 Oct 2022
A BERT-based Deep Learning Approach for Reputation Analysis in Social Media
Mohammad Wali Ur Rahman
Sicong Shao
Pratik Satam
Salim Hariri
Chris Padilla
Zoe Taylor
C. Nevarez
22
5
0
23 Oct 2022
Scene Text Recognition with Semantics
Joshua Cesare Placidi
Yishu Miao
Zixu Wang
Lucia Specia
21
1
0
19 Oct 2022
Linguistic Rules-Based Corpus Generation for Native Chinese Grammatical Error Correction
Shirong Ma
Hai-Tao Zheng
Rongyi Sun
Qingyu Zhou
Shulin Huang
...
Ruiyang Liu
Zhongli Li
Yunbo Cao
Haitao Zheng
Ying Shen
26
25
0
19 Oct 2022
Parameter-Efficient Masking Networks
Yue Bai
Huan Wang
Xu Ma
Yitian Zhang
Zhiqiang Tao
Yun Fu
23
10
0
13 Oct 2022
Improving Multi-turn Emotional Support Dialogue Generation with Lookahead Strategy Planning
Yi Cheng
Wenge Liu
Wenjie Li
Jiashuo Wang
Ruihui Zhao
Bang Liu
Xiaodan Liang
Yefeng Zheng
35
50
0
09 Oct 2022
Multi-Scale Wavelet Transformer for Face Forgery Detection
Jie Liu
Jingjing Wang
Peng Zhang
Chunmao Wang
Di Xie
Shiliang Pu
ViT
CVBM
41
8
0
08 Oct 2022
Bridged Transformer for Vision and Point Cloud 3D Object Detection
Yikai Wang
Tengqi Ye
Lele Cao
Wen-bing Huang
Gang Hua
Fengxiang He
Dacheng Tao
ViT
45
34
0
04 Oct 2022
SmallCap: Lightweight Image Captioning Prompted with Retrieval Augmentation
R. Ramos
Bruno Martins
Desmond Elliott
Yova Kementchedjhieva
VLM
30
86
0
30 Sep 2022
Audio Barlow Twins: Self-Supervised Audio Representation Learning
Jonah Anton
H. Coppock
Pancham Shukla
Bjorn W. Schuller
BDL
SSL
43
8
0
28 Sep 2022
Learned Force Fields Are Ready For Ground State Catalyst Discovery
Michael Schaarschmidt
M. Rivière
A. Ganose
J. Spencer
Alex Gaunt
J. Kirkpatrick
Simon Axelrod
Peter W. Battaglia
Jonathan Godwin
26
10
0
26 Sep 2022
Leveraging Self-Supervised Training for Unintentional Action Recognition
Enea Duka
Anna Kukleva
Bernt Schiele
30
1
0
23 Sep 2022
ET5: A Novel End-to-end Framework for Conversational Machine Reading Comprehension
Xiao Zhang
Heyan Huang
Zewen Chi
Xian-Ling Mao
LRM
40
2
0
23 Sep 2022
Scope of Pre-trained Language Models for Detecting Conflicting Health Information
Josepho D. Gatto
Madhusudan Basak
S. Preum
32
7
0
22 Sep 2022
Predicting Brain Multigraph Population From a Single Graph Template for Boosting One-Shot Classification
Furkan Pala
I. Rekik
38
2
0
13 Sep 2022
Enhancing Semantic Understanding with Self-supervised Methods for Abstractive Dialogue Summarization
Hyun-Yong Lee
Jaewoong Yun
Hyunjin Choi
Seongho Joe
Youngjune Gwon
21
3
0
01 Sep 2022
SB-SSL: Slice-Based Self-Supervised Transformers for Knee Abnormality Classification from MRI
Sara Atito
Syed Muhammad Anwar
Muhammad Awais
Josef Kitler
ViT
MedIm
29
12
0
29 Aug 2022
Minkowski Tracker: A Sparse Spatio-Temporal R-CNN for Joint Object Detection and Tracking
JunYoung Gwak
Silvio Savarese
Jeannette Bohg
VOT
27
13
0
22 Aug 2022
SSDPT: Self-Supervised Dual-Path Transformer for Anomalous Sound Detection in Machine Condition Monitoring
Jisheng Bai
Jianfeng Chen
Mou Wang
Muhammad Saad Ayub
Qingli Yan
54
15
0
06 Aug 2022
Multi-Feature Vision Transformer via Self-Supervised Representation Learning for Improvement of COVID-19 Diagnosis
Xiao Qi
D. Foran
J. Nosher
I. Hacihaliloglu
ViT
MedIm
30
3
0
03 Aug 2022
Automatically Discovering Novel Visual Categories with Self-supervised Prototype Learning
Lu Zhang
Lu Qi
Xu Yang
Hong Qiao
Ming Yang
Zhiyong Liu
SSL
38
3
0
01 Aug 2022
Group DETR: Fast DETR Training with Group-Wise One-to-Many Assignment
Qiang Chen
Xiaokang Chen
Jian Wang
Shan Zhang
Kun Yao
Haocheng Feng
Junyu Han
Errui Ding
Gang Zeng
Jingdong Wang
ViT
49
120
0
26 Jul 2022
Semantic Abstraction: Open-World 3D Scene Understanding from 2D Vision-Language Models
Huy Ha
Shuran Song
LM&Ro
VLM
43
102
0
23 Jul 2022
On the cross-lingual transferability of multilingual prototypical models across NLU tasks
Oralie Cattan
Christophe Servan
S. Rosset
27
9
0
19 Jul 2022
Time Is MattEr: Temporal Self-supervision for Video Transformers
Sukmin Yun
Jaehyung Kim
Dongyoon Han
Hwanjun Song
Jung-Woo Ha
Jinwoo Shin
ViT
19
12
0
19 Jul 2022
Conditional DETR V2: Efficient Detection Transformer with Box Queries
Xiaokang Chen
Fangyun Wei
Gang Zeng
Jingdong Wang
ViT
30
33
0
18 Jul 2022
Open-world Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding
Quan Liu
Youpeng Wen
Jianhua Han
Chunjing Xu
Hang Xu
Xiaodan Liang
VLM
26
67
0
18 Jul 2022
Neighbor Correspondence Matching for Flow-based Video Frame Synthesis
Zhaoyang Jia
Yan-Heng Lu
Houqiang Li
22
14
0
14 Jul 2022
VidConv: A modernized 2D ConvNet for Efficient Video Recognition
Chuong H. Nguyen
Su Huynh
Vinh Nguyen
Ngoc-Khanh Nguyen
ViT
27
3
0
08 Jul 2022
Transformers discover an elementary calculation system exploiting local attention and grid-like problem representation
Samuel Cognolato
Alberto Testolin
42
7
0
06 Jul 2022
Previous
1
2
3
4
5
6
7
8
Next