Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1901.10430
Cited By
v1
v2 (latest)
Pay Less Attention with Lightweight and Dynamic Convolutions
29 January 2019
Felix Wu
Angela Fan
Alexei Baevski
Yann N. Dauphin
Michael Auli
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Pay Less Attention with Lightweight and Dynamic Convolutions"
50 / 241 papers shown
Title
A Call for Clarity in Beam Search: How It Works and When It Stops
Jungo Kasai
Keisuke Sakaguchi
Ronan Le Bras
Dragomir R. Radev
Yejin Choi
Noah A. Smith
104
9
0
11 Apr 2022
MAESTRO: Matched Speech Text Representations through Modality Matching
Zhehuai Chen
Yu Zhang
Andrew Rosenberg
Bhuvana Ramabhadran
Pedro J. Moreno
Ankur Bapna
Heiga Zen
98
108
0
07 Apr 2022
MaxViT: Multi-Axis Vision Transformer
Zhengzhong Tu
Hossein Talebi
Han Zhang
Feng Yang
P. Milanfar
A. Bovik
Yinxiao Li
ViT
163
676
0
04 Apr 2022
COOL, a Context Outlooker, and its Application to Question Answering and other Natural Language Processing Tasks
Fangyi Zhu
See-Kiong Ng
S. Bressan
LRM
58
1
0
01 Apr 2022
Logit Normalization for Long-tail Object Detection
Liang Zhao
Yao Teng
Limin Wang
91
11
0
31 Mar 2022
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs
Xiaohan Ding
Xinming Zhang
Yi Zhou
Jungong Han
Guiguang Ding
Jian Sun
VLM
153
554
0
13 Mar 2022
Look Backward and Forward: Self-Knowledge Distillation with Bidirectional Decoder for Neural Machine Translation
Xuan Zhang
Libin Shen
Disheng Pan
Liangguo Wang
Yanjun Miao
47
1
0
10 Mar 2022
EdgeFormer: A Parameter-Efficient Transformer for On-Device Seq2seq Generation
Tao Ge
Si-Qing Chen
Furu Wei
MoE
91
23
0
16 Feb 2022
General-purpose, long-context autoregressive modeling with Perceiver AR
Curtis Hawthorne
Andrew Jaegle
Cătălina Cangea
Sebastian Borgeaud
C. Nash
...
Hannah R. Sheahan
Neil Zeghidour
Jean-Baptiste Alayrac
João Carreira
Jesse Engel
115
66
0
15 Feb 2022
Improving Neural Machine Translation by Denoising Training
Liang Ding
Keqin Peng
Dacheng Tao
VLM
AI4CE
90
6
0
19 Jan 2022
PAEG: Phrase-level Adversarial Example Generation for Neural Machine Translation
Juncheng Wan
Jian Yang
Shuming Ma
Dongdong Zhang
Weinan Zhang
Yong Yu
Zhoujun Li
SILM
AAML
71
5
0
06 Jan 2022
GAT-CADNet: Graph Attention Network for Panoptic Symbol Spotting in CAD Drawings
Zhaohua Zheng
Jianfang Li
Lingjie Zhu
Honghua Li
F. Petzold
Ping Tan
47
15
0
03 Jan 2022
Spatio-temporal Relation Modeling for Few-shot Action Recognition
Anirudh Thatipelli
Sanath Narayan
Salman Khan
Rao Muhammad Anwer
Fahad Shahbaz Khan
Guohao Li
ViT
83
92
0
09 Dec 2021
3D Medical Point Transformer: Introducing Convolution to Attention Networks for Medical Point Cloud Analysis
Jianhui Yu
Chaoyi Zhang
Heng Wang
Dingxin Zhang
Yang Song
Tiange Xiang
Dongnan Liu
Weidong (Tom) Cai
ViT
MedIm
81
32
0
09 Dec 2021
OW-DETR: Open-world Detection Transformer
Akshita Gupta
Sanath Narayan
K. J. Joseph
Salman Khan
Fahad Shahbaz Khan
M. Shah
ViT
96
175
0
02 Dec 2021
Dynamic Parameterized Network for CTR Prediction
Jian Zhu
Congcong Liu
Pei Wang
Xiwei Zhao
Guangpeng Chen
Junsheng Jin
Changping Peng
Zhangang Lin
Jingping Shao
54
2
0
09 Nov 2021
Mixed Transformer U-Net For Medical Image Segmentation
Hongyi Wang
Shiao Xie
Lanfen Lin
Yutaro Iwamoto
X. Han
Yenwei Chen
Ruofeng Tong
ViT
MedIm
76
255
0
08 Nov 2021
Direct Multi-view Multi-person 3D Pose Estimation
Tao Wang
Jianfeng Zhang
Yujun Cai
Shuicheng Yan
Jiashi Feng
3DH
92
95
0
07 Nov 2021
Towards Building ASR Systems for the Next Billion Users
Tahir Javed
Sumanth Doddapaneni
A. Raman
Kaushal Bhogale
Gowtham Ramesh
Anoop Kunchukuttan
Pratyush Kumar
Mitesh M. Khapra
84
55
0
06 Nov 2021
Efficiently Modeling Long Sequences with Structured State Spaces
Albert Gu
Karan Goel
Christopher Ré
287
1,859
0
31 Oct 2021
Empirical Analysis of Korean Public AI Hub Parallel Corpora and in-depth Analysis using LIWC
Chanjun Park
Midan Shim
Sugyeong Eo
Seolhwa Lee
Jaehyung Seo
Hyeonseok Moon
Heuiseok Lim
28
8
0
28 Oct 2021
GNN-LM: Language Modeling based on Global Contexts via GNN
Yuxian Meng
Shi Zong
Xiaoya Li
Xiaofei Sun
Tianwei Zhang
Leilei Gan
Jiwei Li
LRM
127
39
0
17 Oct 2021
Taming Sparsely Activated Transformer with Stochastic Experts
Simiao Zuo
Xiaodong Liu
Jian Jiao
Young Jin Kim
Hany Hassan
Ruofei Zhang
T. Zhao
Jianfeng Gao
MoE
125
115
0
08 Oct 2021
The NiuTrans System for WNGT 2020 Efficiency Task
Chi Hu
Bei Li
Ye Lin
Yinqiao Li
Yanyang Li
Chenglong Wang
Tong Xiao
Jingbo Zhu
33
7
0
16 Sep 2021
Improving Neural Machine Translation by Bidirectional Training
Liang Ding
Di Wu
Dacheng Tao
81
30
0
16 Sep 2021
Topic-Aware Contrastive Learning for Abstractive Dialogue Summarization
Junpeng Liu
Yanyan Zou
Hainan Zhang
Hongshen Chen
Zhuoye Ding
Caixia Yuan
Xiaojie Wang
65
66
0
10 Sep 2021
BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translation
Haoran Xu
Benjamin Van Durme
Kenton W. Murray
110
62
0
09 Sep 2021
Low-Resource Dialogue Summarization with Domain-Agnostic Multi-Source Pretraining
Yicheng Zou
Bolin Zhu
Xingwu Hu
Tao Gui
Qi Zhang
141
32
0
09 Sep 2021
Survey of Low-Resource Machine Translation
Barry Haddow
Rachel Bawden
Antonio Valerio Miceli Barone
Jindvrich Helcl
Alexandra Birch
AIMat
118
164
0
01 Sep 2021
Lightweight Self-Attentive Sequential Recommendation
Yang Li
Tong Chen
Pengfei Zhang
Hongzhi Yin
HAI
AI4TS
81
109
0
25 Aug 2021
Discriminative Region-based Multi-Label Zero-Shot Learning
Sanath Narayan
Akshita Gupta
Salman Khan
Fahad Shahbaz Khan
Ling Shao
M. Shah
VLM
120
48
0
20 Aug 2021
Adaptive Graph Convolution for Point Cloud Analysis
Hao Zhou
Yidan Feng
Mingsheng Fang
Mingqiang Wei
J. Qin
Tong Lu
3DPC
108
142
0
18 Aug 2021
Fast Convergence of DETR with Spatially Modulated Co-Attention
Peng Gao
Minghang Zheng
Xiaogang Wang
Jifeng Dai
Hongsheng Li
ViT
93
308
0
05 Aug 2021
Dialogue Summarization with Supporting Utterance Flow Modeling and Fact Regularization
Wang Chen
Pijian Li
Hou Pong Chan
Irwin King
HILM
AI4TS
53
10
0
03 Aug 2021
A Survey on Deep Domain Adaptation and Tiny Object Detection Challenges, Techniques and Datasets
Muhammed Muzammul
Xi Li
ObjD
101
11
0
16 Jul 2021
AutoBERT-Zero: Evolving BERT Backbone from Scratch
Jiahui Gao
Hang Xu
Han Shi
Xiaozhe Ren
Philip L. H. Yu
Xiaodan Liang
Xin Jiang
Zhenguo Li
85
37
0
15 Jul 2021
Visual Parser: Representing Part-whole Hierarchies with Transformers
Shuyang Sun
Xiaoyu Yue
S. Bai
Philip Torr
128
27
0
13 Jul 2021
ABD-Net: Attention Based Decomposition Network for 3D Point Cloud Decomposition
Siddharth Katageri
S. V. Kudari
Akshaykumar Gunari
R. Tabib
U. Mudenagudi
3DPC
53
5
0
09 Jul 2021
A Survey on Dialogue Summarization: Recent Advances and New Frontiers
Xiachong Feng
Xiaocheng Feng
Bing Qin
80
103
0
07 Jul 2021
VOLO: Vision Outlooker for Visual Recognition
Li-xin Yuan
Qibin Hou
Zihang Jiang
Jiashi Feng
Shuicheng Yan
ViT
135
328
0
24 Jun 2021
Probabilistic Attention for Interactive Segmentation
Prasad Gabbur
Manjot Bilkhu
J. Movellan
103
13
0
23 Jun 2021
Eigen Analysis of Self-Attention and its Reconstruction from Partial Computation
Srinadh Bhojanapalli
Ayan Chakrabarti
Himanshu Jain
Sanjiv Kumar
Michal Lukasik
Andreas Veit
65
8
0
16 Jun 2021
HELP: Hardware-Adaptive Efficient Latency Prediction for NAS via Meta-Learning
Hayeon Lee
Sewoong Lee
Song Chong
Sung Ju Hwang
83
26
0
16 Jun 2021
Coreference-Aware Dialogue Summarization
Zhengyuan Liu
Ke Shi
Nancy F. Chen
82
60
0
16 Jun 2021
GroupBERT: Enhanced Transformer Architecture with Efficient Grouped Structures
Ivan Chelombiev
Daniel Justus
Douglas Orr
A. Dietrich
Frithjof Gressmann
A. Koliousis
Carlo Luschi
60
5
0
10 Jun 2021
Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models
Tyler A. Chang
Yifan Xu
Weijian Xu
Zhuowen Tu
ViT
57
15
0
10 Jun 2021
A Survey of Transformers
Tianyang Lin
Yuxin Wang
Xiangyang Liu
Xipeng Qiu
ViT
202
1,150
0
08 Jun 2021
On the Connection between Local Attention and Dynamic Depth-wise Convolution
Qi Han
Zejia Fan
Qi Dai
Lei-huan Sun
Ming-Ming Cheng
Jiaying Liu
Jingdong Wang
ViT
123
112
0
08 Jun 2021
On the Language Coverage Bias for Neural Machine Translation
Shuo Wang
Zhaopeng Tu
Zhixing Tan
Shuming Shi
Maosong Sun
Yang Liu
54
21
0
07 Jun 2021
Attention mechanisms and deep learning for machine vision: A survey of the state of the art
A. M. Hafiz
S. A. Parah
R. A. Bhat
93
45
0
03 Jun 2021
Previous
1
2
3
4
5
Next