ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1901.10430
  4. Cited By
Pay Less Attention with Lightweight and Dynamic Convolutions
v1v2 (latest)

Pay Less Attention with Lightweight and Dynamic Convolutions

29 January 2019
Felix Wu
Angela Fan
Alexei Baevski
Yann N. Dauphin
Michael Auli
ArXiv (abs)PDFHTML

Papers citing "Pay Less Attention with Lightweight and Dynamic Convolutions"

50 / 241 papers shown
Title
Ensemble-Based Survival Models with the Self-Attended Beran Estimator Predictions
Ensemble-Based Survival Models with the Self-Attended Beran Estimator Predictions
Lev V. Utkin
Semen P. Khomets
Vlada A. Efremenko
A. Konstantinov
Natalya M. Verbova
17
0
0
09 Jun 2025
Revisiting Backdoor Attacks on Time Series Classification in the Frequency Domain
Revisiting Backdoor Attacks on Time Series Classification in the Frequency Domain
Yuanmin Huang
Mi Zhang
Zhaoxiang Wang
Wenxuan Li
Min Yang
AAMLAI4TS
100
1
0
12 Mar 2025
On the Performance Analysis of Momentum Method: A Frequency Domain Perspective
On the Performance Analysis of Momentum Method: A Frequency Domain Perspective
Xianliang Li
Jun Luo
Zhiwei Zheng
Hanxiao Wang
Li Luo
Lingkun Wen
Linlong Wu
Sheng Xu
185
0
0
29 Nov 2024
TransfoRhythm: A Transformer Architecture Conductive to Blood Pressure Estimation via Solo PPG Signal Capturing
TransfoRhythm: A Transformer Architecture Conductive to Blood Pressure Estimation via Solo PPG Signal Capturing
Amir Arjomand
Amin Boudesh
Farnoush Bayatmakou
Kenneth B. Kent
Arash Mohammadi
130
0
0
15 Apr 2024
Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling
Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling
Mahdi Karami
Ali Ghodsi
VLM
116
6
0
28 Feb 2024
Computation and Parameter Efficient Multi-Modal Fusion Transformer for
  Cued Speech Recognition
Computation and Parameter Efficient Multi-Modal Fusion Transformer for Cued Speech Recognition
Lei Liu
Li Liu
Haizhou Li
82
7
0
31 Jan 2024
Gated Linear Attention Transformers with Hardware-Efficient Training
Gated Linear Attention Transformers with Hardware-Efficient Training
Aaron Courville
Bailin Wang
Songlin Yang
Yikang Shen
Yoon Kim
128
180
0
11 Dec 2023
Surveying the Landscape of Text Summarization with Deep Learning: A
  Comprehensive Review
Surveying the Landscape of Text Summarization with Deep Learning: A Comprehensive Review
Guanghua Wang
Weili Wu
AI4TSAILaw
91
4
0
13 Oct 2023
Spherical and Hyperbolic Toric Topology-Based Codes On Graph Embedding
  for Ising MRF Models: Classical and Quantum Topology Machine Learning
Spherical and Hyperbolic Toric Topology-Based Codes On Graph Embedding for Ising MRF Models: Classical and Quantum Topology Machine Learning
V. Usatyuk
Sergey Egorov
Denis Sapozhnikov
79
3
0
28 Jul 2023
Edge Guided GANs with Multi-Scale Contrastive Learning for Semantic
  Image Synthesis
Edge Guided GANs with Multi-Scale Contrastive Learning for Semantic Image Synthesis
Hao Tang
Guolei Sun
N. Sebe
Luc Van Gool
GAN
103
12
0
22 Jul 2023
A Quantitative Review on Language Model Efficiency Research
A Quantitative Review on Language Model Efficiency Research
Meng Jiang
Hy Dang
Lingbo Tong
76
0
0
28 May 2023
Parallel Data Helps Neural Entity Coreference Resolution
Parallel Data Helps Neural Entity Coreference Resolution
Gongbo Tang
Christian Hardmeier
60
3
0
28 May 2023
Neural Machine Translation for Mathematical Formulae
Neural Machine Translation for Mathematical Formulae
Felix Petersen
M. Schubotz
André Greiner-Petter
Bela Gipp
74
7
0
25 May 2023
Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT Operator
Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT Operator
Ziwei He
Meng Yang
Minwei Feng
Jingcheng Yin
Xiang Wang
Jingwen Leng
Zhouhan Lin
ViT
97
14
0
24 May 2023
Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics
  Without the Reference
Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference
Vilém Zouhar
Shehzaad Dhuliawala
Wangchunshu Zhou
Nico Daheim
Tom Kocmi
Yuchen Eleanor Jiang
Mrinmaya Sachan
66
11
0
21 Jan 2023
A Large-scale Film Style Dataset for Learning Multi-frequency Driven
  Film Enhancement
A Large-scale Film Style Dataset for Learning Multi-frequency Driven Film Enhancement
Zinuo Li
Xuhang Chen
Shuqiang Wang
Chi-Man Pun
65
23
0
21 Jan 2023
From English to More Languages: Parameter-Efficient Model Reprogramming
  for Cross-Lingual Speech Recognition
From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition
Chao-Han Huck Yang
Yue Liu
Yu Zhang
Nanxin Chen
Rohit Prabhavalkar
Tara N. Sainath
Trevor Strohman
68
30
0
19 Jan 2023
HanoiT: Enhancing Context-aware Translation via Selective Context
HanoiT: Enhancing Context-aware Translation via Selective Context
Jian Yang
Yuwei Yin
Shuming Ma
Liqun Yang
Hongcheng Guo
Haoyang Huang
Dongdong Zhang
Yutao Zeng
Zhoujun Li
Furu Wei
85
5
0
17 Jan 2023
Exploring the Approximation Capabilities of Multiplicative Neural
  Networks for Smooth Functions
Exploring the Approximation Capabilities of Multiplicative Neural Networks for Smooth Functions
Ido Ben-Shaul
Tomer Galanti
S. Dekel
80
3
0
11 Jan 2023
Improving Continuous Sign Language Recognition with Consistency
  Constraints and Signer Removal
Improving Continuous Sign Language Recognition with Consistency Constraints and Signer Removal
Ronglai Zuo
Brian Mak
SLR
113
21
0
26 Dec 2022
Findings of the WMT 2022 Shared Task on Translation Suggestion
Findings of the WMT 2022 Shared Task on Translation Suggestion
Zhen Yang
Fandong Meng
Yingxue Zhang
Ernan Li
Jie Zhou
LRM
72
2
0
30 Nov 2022
Aligning Source Visual and Target Language Domains for Unpaired Video
  Captioning
Aligning Source Visual and Target Language Domains for Unpaired Video Captioning
Fenglin Liu
Xian Wu
Chenyu You
Shen Ge
Yuexian Zou
Xu Sun
95
25
0
22 Nov 2022
Domain Curricula for Code-Switched MT at MixMT 2022
Domain Curricula for Code-Switched MT at MixMT 2022
Lekan Raheem
Maab Elrashid
52
1
0
31 Oct 2022
OTSeq2Set: An Optimal Transport Enhanced Sequence-to-Set Model for
  Extreme Multi-label Text Classification
OTSeq2Set: An Optimal Transport Enhanced Sequence-to-Set Model for Extreme Multi-label Text Classification
Jie Cao
Yin Zhang
VLM
85
4
0
26 Oct 2022
MetaFormer Baselines for Vision
MetaFormer Baselines for Vision
Weihao Yu
Chenyang Si
Pan Zhou
Mi Luo
Yichen Zhou
Jiashi Feng
Shuicheng Yan
Xinchao Wang
MoE
99
171
0
24 Oct 2022
Maestro-U: Leveraging joint speech-text representation learning for zero
  supervised speech ASR
Maestro-U: Leveraging joint speech-text representation learning for zero supervised speech ASR
Zhehuai Chen
Ankur Bapna
Andrew Rosenberg
Yu Zhang
Bhuvana Ramabhadran
Pedro J. Moreno
Nanxin Chen
107
17
0
18 Oct 2022
LSG Attention: Extrapolation of pretrained Transformers to long
  sequences
LSG Attention: Extrapolation of pretrained Transformers to long sequences
Charles Condevaux
S. Harispe
84
24
0
13 Oct 2022
Improved Data Augmentation for Translation Suggestion
Improved Data Augmentation for Translation Suggestion
Hongxiao Zhang
Siyu Lai
Songming Zhang
Hui Huang
Jinan Xu
Jinan Xu
Jian Liu
67
1
0
12 Oct 2022
Mixture of Attention Heads: Selecting Attention Heads Per Token
Mixture of Attention Heads: Selecting Attention Heads Per Token
Xiaofeng Zhang
Songlin Yang
Zeyu Huang
Jie Zhou
Wenge Rong
Zhang Xiong
MoE
171
48
0
11 Oct 2022
E-Branchformer: Branchformer with Enhanced merging for speech
  recognition
E-Branchformer: Branchformer with Enhanced merging for speech recognition
Kwangyoun Kim
Felix Wu
Yifan Peng
Jing Pan
Prashant Sridhar
Kyu Jeong Han
Shinji Watanabe
181
117
0
30 Sep 2022
Lightweight Transformers for Human Activity Recognition on Mobile
  Devices
Lightweight Transformers for Human Activity Recognition on Mobile Devices
Sannara Ek
François Portet
P. Lalanda
83
32
0
22 Sep 2022
Relaxed Attention for Transformer Models
Relaxed Attention for Transformer Models
Timo Lohrenz
Björn Möller
Zhengyang Li
Tim Fingscheidt
KELM
59
12
0
20 Sep 2022
Pre-Training a Graph Recurrent Network for Language Representation
Pre-Training a Graph Recurrent Network for Language Representation
Yile Wang
Linyi Yang
Zhiyang Teng
M. Zhou
Yue Zhang
GNN
81
1
0
08 Sep 2022
Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue
  Summarization
Mind the Gap! Injecting Commonsense Knowledge for Abstractive Dialogue Summarization
Seungone Kim
Se June Joo
Hyungjoo Chae
Chaehyeong Kim
Seung-won Hwang
Jinyoung Yeo
54
20
0
02 Sep 2022
Real-time 3D Single Object Tracking with Transformer
Real-time 3D Single Object Tracking with Transformer
Jiayao Shan
Sifan Zhou
Yubo Cui
Zheng Fang
ViT
78
50
0
02 Sep 2022
PointConvFormer: Revenge of the Point-based Convolution
PointConvFormer: Revenge of the Point-based Convolution
Wenxuan Wu
Li Fuxin
Qi Shan
3DPC
84
32
0
04 Aug 2022
Scaling Laws vs Model Architectures: How does Inductive Bias Influence
  Scaling?
Scaling Laws vs Model Architectures: How does Inductive Bias Influence Scaling?
Yi Tay
Mostafa Dehghani
Samira Abnar
Hyung Won Chung
W. Fedus
J. Rao
Sharan Narang
Vinh Q. Tran
Dani Yogatama
Donald Metzler
AI4CE
125
107
0
21 Jul 2022
SeedFormer: Patch Seeds based Point Cloud Completion with Upsample
  Transformer
SeedFormer: Patch Seeds based Point Cloud Completion with Upsample Transformer
Hao Zhou
Yun Cao
Wenqing Chu
Junwei Zhu
Tong Lu
Ying Tai
Chengjie Wang
3DPCViT
89
113
0
21 Jul 2022
Forming Trees with Treeformers
Forming Trees with Treeformers
Nilay Patel
Jeffrey Flanigan
AI4CE
87
3
0
14 Jul 2022
Attention and Self-Attention in Random Forests
Attention and Self-Attention in Random Forests
Lev V. Utkin
A. Konstantinov
73
7
0
09 Jul 2022
Cross-receptive Focused Inference Network for Lightweight Image
  Super-Resolution
Cross-receptive Focused Inference Network for Lightweight Image Super-Resolution
Wenjie Li
Juncheng Li
Guangwei Gao
Jiantao Zhou
Jian Yang
Guo-Jun Qi
SupR
119
35
0
06 Jul 2022
Wav2Vec-Aug: Improved self-supervised training with limited data
Wav2Vec-Aug: Improved self-supervised training with limited data
Anuroop Sriram
Michael Auli
Alexei Baevski
SSLVLM
43
15
0
27 Jun 2022
Learning Multiscale Transformer Models for Sequence Generation
Learning Multiscale Transformer Models for Sequence Generation
Bei Li
Tong Zheng
Yi Jing
Chengbo Jiao
Tong Xiao
Jingbo Zhu
70
9
0
19 Jun 2022
AGConv: Adaptive Graph Convolution on 3D Point Clouds
AGConv: Adaptive Graph Convolution on 3D Point Clouds
Mingqiang Wei
Zeyong Wei
Hao Zhou
Fei-Jiang Hu
Huajian Si
...
Jingbo Qiu
Xu Yan
Yan Guo
Jun Wang
J. Qin
3DPC
110
40
0
09 Jun 2022
FlashAttention: Fast and Memory-Efficient Exact Attention with
  IO-Awareness
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
Tri Dao
Daniel Y. Fu
Stefano Ermon
Atri Rudra
Christopher Ré
VLM
445
2,299
0
27 May 2022
A Template-based Method for Constrained Neural Machine Translation
A Template-based Method for Constrained Neural Machine Translation
Shuo Wang
Peng Li
Zhixing Tan
Zhaopeng Tu
Maosong Sun
Yang Liu
BDL
48
2
0
23 May 2022
ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks
ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks
Haoran You
Baopu Li
Huihong Shi
Y. Fu
Yingyan Lin
122
17
0
17 May 2022
Efficient dynamic filter for robust and low computational feature
  extraction
Efficient dynamic filter for robust and low computational feature extraction
Donghyeon Kim
Gwantae Kim
Bokyeung Lee
Jeong-gi Kwak
D. Han
Hanseok Ko
60
3
0
03 May 2022
Enable Deep Learning on Mobile Devices: Methods, Systems, and
  Applications
Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Han Cai
Ji Lin
Chengyue Wu
Zhijian Liu
Haotian Tang
Hanrui Wang
Ligeng Zhu
Song Han
116
117
0
25 Apr 2022
BLISS: Robust Sequence-to-Sequence Learning via Self-Supervised Input
  Representation
BLISS: Robust Sequence-to-Sequence Learning via Self-Supervised Input Representation
Zheng Zhang
Liang Ding
Dazhao Cheng
Xuebo Liu
Min Zhang
Dacheng Tao
79
11
0
16 Apr 2022
12345
Next