Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.03906
Cited By
Massive Exploration of Neural Machine Translation Architectures
11 March 2017
D. Britz
Anna Goldie
Minh-Thang Luong
Quoc V. Le
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Massive Exploration of Neural Machine Translation Architectures"
50 / 72 papers shown
Title
High-Dimension Human Value Representation in Large Language Models
Samuel Cahyawijaya
Delong Chen
Yejin Bang
Leila Khalatbari
Bryan Wilie
Ziwei Ji
Etsuko Ishii
Pascale Fung
71
5
0
11 Apr 2024
An Analysis of BPE Vocabulary Trimming in Neural Machine Translation
Marco Cognetta
Tatsuya Hiraoka
Naoaki Okazaki
Rico Sennrich
Yuval Pinter
34
2
0
30 Mar 2024
Enhancing Efficiency in Vision Transformer Networks: Design Techniques and Insights
Moein Heidari
Reza Azad
Sina Ghorbani Kolahi
René Arimond
Leon Niggemeier
...
Afshin Bozorgpour
Ehsan Khodapanah Aghdam
A. Kazerouni
I. Hacihaliloglu
Dorit Merhof
51
7
0
28 Mar 2024
Integrating Human Vision Perception in Vision Transformers for Classifying Waste Items
Akshat Shrivastava
Tapan K. Gandhi
27
1
0
19 Dec 2023
HOT: Higher-Order Dynamic Graph Representation Learning with Efficient Transformers
Maciej Besta
Afonso Claudino Catarino
Lukas Gianinazzi
Nils Blach
Piotr Nyczyk
H. Niewiadomski
Torsten Hoefler
35
6
0
30 Nov 2023
ViPE: Visualise Pretty-much Everything
Hassan Shahmohammadi
Adhiraj Ghosh
Hendrik P. A. Lensch
DiffM
31
1
0
16 Oct 2023
An Ensemble Approach to Question Classification: Integrating Electra Transformer, GloVe, and LSTM
Sanad Aburass
O. Dorgham
Maha Abu Rumman
32
3
0
13 Aug 2023
Data-Driven Approach for Formality-Sensitive Machine Translation: Language-Specific Handling and Synthetic Data Generation
Seugnjun Lee
Hyeonseok Moon
Chanjun Park
Heu-Jeoung Lim
32
0
0
26 Jun 2023
Does Attention Mechanism Possess the Feature of Human Reading? A Perspective of Sentiment Classification Task
Leilei Zhao
Yingyi Zhang
Chengzhi Zhang
35
2
0
08 Sep 2022
A General Survey on Attention Mechanisms in Deep Learning
Gianni Brauwers
Flavius Frasincar
31
298
0
27 Mar 2022
The Ecological Footprint of Neural Machine Translation Systems
D. Shterionov
Eva Vanmassenhove
40
3
0
04 Feb 2022
DRF Codes: Deep SNR-Robust Feedback Codes
Mahdi Boloursaz Mashhadi
Deniz Gunduz
A. Perotti
B. Popović
27
10
0
22 Dec 2021
Capitalization and Punctuation Restoration: a Survey
V. Pais
D. Tufis
19
19
0
21 Nov 2021
A Review of Text Style Transfer using Deep Learning
Martina Toshevska
Sonja Gievska
CLIP
48
43
0
30 Sep 2021
Transfer Learning in Electronic Health Records through Clinical Concept Embedding
J. R. A. Solares
Yajie Zhu
A. Hassaine
Shishir Rao
Yikuan Li
M. Mamouei
D. Canoy
K. Rahimi
G. Salimi-Khorshidi
34
6
0
27 Jul 2021
A Survey on Data Augmentation for Text Classification
Markus Bayer
M. Kaufhold
Christian A. Reuter
38
336
0
07 Jul 2021
On Adversarial Robustness of Synthetic Code Generation
Mrinal Anand
Pratik Kayal
M. Singh
34
3
0
22 Jun 2021
Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey
Danielle Saunders
AI4CE
27
86
0
14 Apr 2021
On Automatic Parsing of Log Records
Jared Rand
A. Miranskyy
29
6
0
12 Feb 2021
TextBox: A Unified, Modularized, and Extensible Framework for Text Generation
Junyi Li
Tianyi Tang
Gaole He
Jinhao Jiang
Xiaoxuan Hu
Puzhao Xie
Zhipeng Chen
Zhuohao Yu
Wayne Xin Zhao
Ji-Rong Wen
37
25
0
06 Jan 2021
LazyBatching: An SLA-aware Batching System for Cloud Machine Learning Inference
Yujeong Choi
Yunseong Kim
Minsoo Rhu
24
66
0
25 Oct 2020
What Do Position Embeddings Learn? An Empirical Study of Pre-Trained Language Model Positional Encoding
Yu-An Wang
Yun-Nung Chen
SSL
10
94
0
10 Oct 2020
Clustering-based Unsupervised Generative Relation Extraction
Chenhan Yuan
Ryan Rossi
Andrew Katz
Hoda Eldardiry
19
4
0
26 Sep 2020
DeepRemaster: Temporal Source-Reference Attention Networks for Comprehensive Video Enhancement
S. Iizuka
E. Simo-Serra
27
39
0
18 Sep 2020
Review: Deep Learning in Electron Microscopy
Jeffrey M. Ede
36
79
0
17 Sep 2020
Learning from the Scene and Borrowing from the Rich: Tackling the Long Tail in Scene Graph Generation
Tao He
Lianli Gao
Jingkuan Song
Jianfei Cai
Yuan-Fang Li
24
30
0
13 Jun 2020
NAS-Bench-NLP: Neural Architecture Search Benchmark for Natural Language Processing
Nikita Klyuchnikov
I. Trofimov
Ekaterina Artemova
Mikhail Salnikov
M. Fedorov
Evgeny Burnaev
VLM
21
101
0
12 Jun 2020
A Survey of Deep Learning Techniques for Neural Machine Translation
Shu Yang
Yuxin Wang
Xiaowen Chu
VLM
AI4TS
AI4CE
22
138
0
18 Feb 2020
BatchEnsemble: An Alternative Approach to Efficient Ensemble and Lifelong Learning
Yeming Wen
Dustin Tran
Jimmy Ba
OOD
FedML
UQCV
32
483
0
17 Feb 2020
DeepMutation: A Neural Mutation Tool
Michele Tufano
J. Kimko
Shiya Wang
Cody Watson
Gabriele Bavota
M. D. Penta
Denys Poshyvanyk
23
20
0
12 Feb 2020
Teaching Machines to Converse
Jiwei Li
29
4
0
31 Jan 2020
Trajectron++: Dynamically-Feasible Trajectory Forecasting With Heterogeneous Data
Tim Salzmann
Boris Ivanovic
Punarjay Chakravarty
Marco Pavone
33
130
0
09 Jan 2020
DeepONet: Learning nonlinear operators for identifying differential equations based on the universal approximation theorem of operators
Lu Lu
Pengzhan Jin
George Karniadakis
43
2,029
0
08 Oct 2019
NeMo: a toolkit for building AI applications using Neural Modules
Oleksii Kuchaiev
Jason Chun Lok Li
Huyen Nguyen
Oleksii Hrinchuk
Ryan Leary
...
Jack Cook
P. Castonguay
Mariya Popova
Jocelyn Huang
Jonathan M. Cohen
211
296
0
14 Sep 2019
Generative Question Refinement with Deep Reinforcement Learning in Retrieval-based QA System
Ye Liu
Chenwei Zhang
Xiaohui Yan
Yi-Ju Chang
Philip S. Yu
30
18
0
13 Aug 2019
RobustTP: End-to-End Trajectory Prediction for Heterogeneous Road-Agents in Dense Traffic with Noisy Sensor Inputs
Rohan Chandra
Uttaran Bhattacharya
Christian Roncal
Aniket Bera
Tianyi Zhou
35
59
0
20 Jul 2019
Quick, Stat!: A Statistical Analysis of the Quick, Draw! Dataset
Jennifer J. Gago
J. Victores
Ugo Pattacini
C. Balaguer
28
10
0
15 Jul 2019
Lost in Translation: Loss and Decay of Linguistic Richness in Machine Translation
Eva Vanmassenhove
D. Shterionov
Andy Way
22
91
0
28 Jun 2019
Sharing Attention Weights for Fast Transformer
Tong Xiao
Yinqiao Li
Jingbo Zhu
Zhengtao Yu
Tongran Liu
17
50
0
26 Jun 2019
Bag of Color Features For Color Constancy
Firas Laakom
Nikolaos Passalis
Jenni Raitoharju
Jarno Nikkanen
Anastasios Tefas
Alexandros Iosifidis
Moncef Gabbouj
24
33
0
11 Jun 2019
Discrete Flows: Invertible Generative Models of Discrete Data
Dustin Tran
Keyon Vafa
Kumar Krishna Agrawal
Laurent Dinh
Ben Poole
DRL
24
114
0
24 May 2019
A CNN-RNN Architecture for Multi-Label Weather Recognition
Bin Zhao
Xuelong Li
Xiaoqiang Lu
Zhigang Wang
22
109
0
24 Apr 2019
Context-Aware Self-Attention Networks
Baosong Yang
Jian Li
Derek F. Wong
Lidia S. Chao
Xing Wang
Zhaopeng Tu
39
113
0
15 Feb 2019
On Learning Meaningful Code Changes via Neural Machine Translation
Michele Tufano
Jevgenija Pantiuchina
Cody Watson
Gabriele Bavota
Denys Poshyvanyk
27
203
0
25 Jan 2019
Evaluating the State-of-the-Art of End-to-End Natural Language Generation: The E2E NLG Challenge
Ondrej Dusek
Jekaterina Novikova
Verena Rieser
ELM
46
232
0
23 Jan 2019
How Much Does Tokenization Affect Neural Machine Translation?
Miguel Domingo
Mercedes García-Martínez
A. Helle
F. Casacuberta
Manuel Herranz
20
55
0
20 Dec 2018
TraPHic: Trajectory Prediction in Dense and Heterogeneous Traffic Using Weighted Interactions
Rohan Chandra
Uttaran Bhattacharya
Aniket Bera
Tianyi Zhou
28
256
0
12 Dec 2018
Data-parallel distributed training of very large models beyond GPU capacity
Samuel Matzek
M. Grossman
Minsik Cho
Anar Yusifov
Bryant Nelson
A. Juneja
GNN
22
3
0
29 Nov 2018
Scene Text Detection and Recognition: The Deep Learning Era
Shangbang Long
Xin He
Cong Yao
VLM
44
389
0
10 Nov 2018
Area Attention
Yang Li
Lukasz Kaiser
Samy Bengio
Si Si
31
20
0
23 Oct 2018
1
2
Next