ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.03059
  4. Cited By
Depthwise Separable Convolutions for Neural Machine Translation

Depthwise Separable Convolutions for Neural Machine Translation

9 June 2017
Lukasz Kaiser
Aidan Gomez
François Chollet
ArXivPDFHTML

Papers citing "Depthwise Separable Convolutions for Neural Machine Translation"

38 / 38 papers shown
Title
Pluggable Style Representation Learning for Multi-Style Transfer
Pluggable Style Representation Learning for Multi-Style Transfer
Hongda Liu
Longguang Wang
Weijun Guan
Ye Zhang
Yulan Guo
75
1
0
26 Mar 2025
NdLinear Is All You Need for Representation Learning
NdLinear Is All You Need for Representation Learning
Alex Reneau
Jerry Yao-Chieh Hu
Zhongfang Zhuang
Ting-Chun Liu
HAI
44
0
0
21 Mar 2025
Geometry-Constrained EEG Channel Selection for Brain-Assisted Speech
  Enhancement
Geometry-Constrained EEG Channel Selection for Brain-Assisted Speech Enhancement
Keying Zuo
Qingtian Xu
Jie Zhang
Zhenhua Ling
39
0
0
19 Sep 2024
COVID-Net Assistant: A Deep Learning-Driven Virtual Assistant for
  COVID-19 Symptom Prediction and Recommendation
COVID-Net Assistant: A Deep Learning-Driven Virtual Assistant for COVID-19 Symptom Prediction and Recommendation
Peng Shi
Yuetong Wang
S. Abbasi
Alexander Wong
21
0
0
22 Nov 2022
Learning on tree architectures outperforms a convolutional feedforward
  network
Learning on tree architectures outperforms a convolutional feedforward network
Yuval Meir
Itamar Ben-Noam
Yarden Tzach
Shiri Hodassman
Ido Kanter
AI4CE
11
6
0
21 Nov 2022
PS-ARM: An End-to-End Attention-aware Relation Mixer Network for Person
  Search
PS-ARM: An End-to-End Attention-aware Relation Mixer Network for Person Search
M. Fiaz
Hisham Cholakkal
Sanath Narayan
Rao Muhammad Anwer
Fahad Shahbaz Khan
36
4
0
07 Oct 2022
DeepLSS: breaking parameter degeneracies in large scale structure with
  deep learning analysis of combined probes
DeepLSS: breaking parameter degeneracies in large scale structure with deep learning analysis of combined probes
T. Kacprzak
J. Fluri
22
12
0
17 Mar 2022
SmartSplit: Latency-Energy-Memory Optimisation for CNN Splitting on
  Smartphone Environment
SmartSplit: Latency-Energy-Memory Optimisation for CNN Splitting on Smartphone Environment
I. Prakash
Aniruddh Bansal
Rohit Verma
R. Shorey
27
8
0
01 Nov 2021
Rethinking Token-Mixing MLP for MLP-based Vision Backbone
Rethinking Token-Mixing MLP for MLP-based Vision Backbone
Tan Yu
Xu Li
Yunfeng Cai
Mingming Sun
Ping Li
45
26
0
28 Jun 2021
Scalable Transformers for Neural Machine Translation
Scalable Transformers for Neural Machine Translation
Peng Gao
Shijie Geng
Ping Luo
Xiaogang Wang
Jifeng Dai
Hongsheng Li
31
13
0
04 Jun 2021
Security Vulnerability Detection Using Deep Learning Natural Language
  Processing
Security Vulnerability Detection Using Deep Learning Natural Language Processing
Noah Ziems
Shaoen Wu
19
55
0
06 May 2021
BeamLearning: an end-to-end Deep Learning approach for the angular
  localization of sound sources using raw multichannel acoustic pressure data
BeamLearning: an end-to-end Deep Learning approach for the angular localization of sound sources using raw multichannel acoustic pressure data
Hadrien Pujol
Éric Bavu
Alexandre Garcia
44
22
0
27 Apr 2021
Neural Machine Translation: A Review of Methods, Resources, and Tools
Neural Machine Translation: A Review of Methods, Resources, and Tools
Zhixing Tan
Shuo Wang
Zonghan Yang
Gang Chen
Xuancheng Huang
Maosong Sun
Yang Liu
3DV
AI4TS
22
105
0
31 Dec 2020
Speech Command Recognition in Computationally Constrained Environments
  with a Quadratic Self-organized Operational Layer
Speech Command Recognition in Computationally Constrained Environments with a Quadratic Self-organized Operational Layer
M. Soltanian
Junaid Malik
Jenni Raitoharju
Alexandros Iosifidis
S. Kiranyaz
Denmark
22
11
0
23 Nov 2020
MarbleNet: Deep 1D Time-Channel Separable Convolutional Neural Network
  for Voice Activity Detection
MarbleNet: Deep 1D Time-Channel Separable Convolutional Neural Network for Voice Activity Detection
Fei Jia
Somshubra Majumdar
Boris Ginsburg
19
48
0
26 Oct 2020
Lightweight End-to-End Speech Recognition from Raw Audio Data Using
  Sinc-Convolutions
Lightweight End-to-End Speech Recognition from Raw Audio Data Using Sinc-Convolutions
Ludwig Kurzinger
Nicolas Lindae
Palle Klewitz
Gerhard Rigoll
27
5
0
15 Oct 2020
Efficient Transformers: A Survey
Efficient Transformers: A Survey
Yi Tay
Mostafa Dehghani
Dara Bahri
Donald Metzler
VLM
114
1,103
0
14 Sep 2020
ConvBERT: Improving BERT with Span-based Dynamic Convolution
ConvBERT: Improving BERT with Span-based Dynamic Convolution
Zihang Jiang
Weihao Yu
Daquan Zhou
Yunpeng Chen
Jiashi Feng
Shuicheng Yan
43
157
0
06 Aug 2020
Improving Robustness using Joint Attention Network For Detecting Retinal
  Degeneration From Optical Coherence Tomography Images
Improving Robustness using Joint Attention Network For Detecting Retinal Degeneration From Optical Coherence Tomography Images
Sharif Amit Kamran
Alireza Tavakkoli
S. Zuckerbrod
15
25
0
16 May 2020
Lite Transformer with Long-Short Range Attention
Lite Transformer with Long-Short Range Attention
Zhanghao Wu
Zhijian Liu
Ji Lin
Yujun Lin
Song Han
23
317
0
24 Apr 2020
A Survey of Deep Learning Techniques for Neural Machine Translation
A Survey of Deep Learning Techniques for Neural Machine Translation
Shu Yang
Yuxin Wang
Xiaowen Chu
VLM
AI4TS
AI4CE
22
138
0
18 Feb 2020
Are Transformers universal approximators of sequence-to-sequence
  functions?
Are Transformers universal approximators of sequence-to-sequence functions?
Chulhee Yun
Srinadh Bhojanapalli
A. S. Rawat
Sashank J. Reddi
Sanjiv Kumar
11
335
0
20 Dec 2019
Neural Machine Translation: A Review and Survey
Neural Machine Translation: A Review and Survey
Felix Stahlberg
3DV
AI4TS
MedIm
20
312
0
04 Dec 2019
Spatio-Temporal FAST 3D Convolutions for Human Action Recognition
Spatio-Temporal FAST 3D Convolutions for Human Action Recognition
Alexandros Stergiou
R. Poppe
3DH
20
19
0
30 Sep 2019
On NMT Search Errors and Model Errors: Cat Got Your Tongue?
On NMT Search Errors and Model Errors: Cat Got Your Tongue?
Felix Stahlberg
Bill Byrne
LRM
13
152
0
27 Aug 2019
Deep Learning Based Chatbot Models
Deep Learning Based Chatbot Models
Richard Csaky
29
46
0
23 Aug 2019
TVQA+: Spatio-Temporal Grounding for Video Question Answering
TVQA+: Spatio-Temporal Grounding for Video Question Answering
Jie Lei
Licheng Yu
Tamara L. Berg
Joey Tianyi Zhou
31
227
0
25 Apr 2019
Depthwise Convolution is All You Need for Learning Multiple Visual
  Domains
Depthwise Convolution is All You Need for Learning Multiple Visual Domains
Yunhui Guo
Yandong Li
Rogerio Feris
Liqiang Wang
Tajana Simunic
OOD
26
145
0
03 Feb 2019
Accurate, Data-Efficient, Unconstrained Text Recognition with
  Convolutional Neural Networks
Accurate, Data-Efficient, Unconstrained Text Recognition with Convolutional Neural Networks
Mohamed Yousef
K. Hussain
U. S. Mohammed
3DV
26
124
0
31 Dec 2018
C3: Concentrated-Comprehensive Convolution and its application to
  semantic segmentation
C3: Concentrated-Comprehensive Convolution and its application to semantic segmentation
Hyojin Park
Y. Yoo
Geonseok Seo
Dongyoon Han
Sangdoo Yun
Nojun Kwak
SSeg
24
9
0
12 Dec 2018
Phrase-Based Attentions
Phrase-Based Attentions
Phi Xuan Nguyen
Chenyu You
14
8
0
30 Sep 2018
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for
  Speech Separation
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation
Yi Luo
N. Mesgarani
54
1,750
0
20 Sep 2018
Universal Transformers
Universal Transformers
Mostafa Dehghani
Stephan Gouws
Oriol Vinyals
Jakob Uszkoreit
Lukasz Kaiser
37
742
0
10 Jul 2018
QANet: Combining Local Convolution with Global Self-Attention for
  Reading Comprehension
QANet: Combining Local Convolution with Global Self-Attention for Reading Comprehension
Adams Wei Yu
David Dohan
Minh-Thang Luong
Rui Zhao
Kai Chen
Mohammad Norouzi
Quoc V. Le
RALM
AIMat
35
1,092
0
23 Apr 2018
Tensor2Tensor for Neural Machine Translation
Tensor2Tensor for Neural Machine Translation
Ashish Vaswani
Samy Bengio
E. Brevdo
François Chollet
Aidan Gomez
...
Nal Kalchbrenner
Niki Parmar
Ryan Sepassi
Noam M. Shazeer
Jakob Uszkoreit
60
527
0
16 Mar 2018
Revisiting the Effectiveness of Off-the-shelf Temporal Modeling
  Approaches for Large-scale Video Classification
Revisiting the Effectiveness of Off-the-shelf Temporal Modeling Approaches for Large-scale Video Classification
Yunlong Bian
Chuang Gan
Xiao-Chang Liu
Fu Li
Xiang Long
Yandong Li
Heng Qi
Jie Zhou
Shilei Wen
Yuanqing Lin
18
48
0
12 Aug 2017
One Model To Learn Them All
One Model To Learn Them All
Lukasz Kaiser
Aidan Gomez
Noam M. Shazeer
Ashish Vaswani
Niki Parmar
Llion Jones
Jakob Uszkoreit
VLM
ViT
28
333
0
16 Jun 2017
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Z. Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
716
6,748
0
26 Sep 2016
1