ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1703.03906
  4. Cited By
Massive Exploration of Neural Machine Translation Architectures

Massive Exploration of Neural Machine Translation Architectures

11 March 2017
D. Britz
Anna Goldie
Minh-Thang Luong
Quoc V. Le
ArXivPDFHTML

Papers citing "Massive Exploration of Neural Machine Translation Architectures"

50 / 72 papers shown
Title
High-Dimension Human Value Representation in Large Language Models
High-Dimension Human Value Representation in Large Language Models
Samuel Cahyawijaya
Delong Chen
Yejin Bang
Leila Khalatbari
Bryan Wilie
Ziwei Ji
Etsuko Ishii
Pascale Fung
71
5
0
11 Apr 2024
An Analysis of BPE Vocabulary Trimming in Neural Machine Translation
An Analysis of BPE Vocabulary Trimming in Neural Machine Translation
Marco Cognetta
Tatsuya Hiraoka
Naoaki Okazaki
Rico Sennrich
Yuval Pinter
34
2
0
30 Mar 2024
Enhancing Efficiency in Vision Transformer Networks: Design Techniques
  and Insights
Enhancing Efficiency in Vision Transformer Networks: Design Techniques and Insights
Moein Heidari
Reza Azad
Sina Ghorbani Kolahi
René Arimond
Leon Niggemeier
...
Afshin Bozorgpour
Ehsan Khodapanah Aghdam
A. Kazerouni
I. Hacihaliloglu
Dorit Merhof
51
7
0
28 Mar 2024
Integrating Human Vision Perception in Vision Transformers for
  Classifying Waste Items
Integrating Human Vision Perception in Vision Transformers for Classifying Waste Items
Akshat Shrivastava
Tapan K. Gandhi
27
1
0
19 Dec 2023
HOT: Higher-Order Dynamic Graph Representation Learning with Efficient
  Transformers
HOT: Higher-Order Dynamic Graph Representation Learning with Efficient Transformers
Maciej Besta
Afonso Claudino Catarino
Lukas Gianinazzi
Nils Blach
Piotr Nyczyk
H. Niewiadomski
Torsten Hoefler
35
6
0
30 Nov 2023
ViPE: Visualise Pretty-much Everything
ViPE: Visualise Pretty-much Everything
Hassan Shahmohammadi
Adhiraj Ghosh
Hendrik P. A. Lensch
DiffM
31
1
0
16 Oct 2023
An Ensemble Approach to Question Classification: Integrating Electra
  Transformer, GloVe, and LSTM
An Ensemble Approach to Question Classification: Integrating Electra Transformer, GloVe, and LSTM
Sanad Aburass
O. Dorgham
Maha Abu Rumman
32
3
0
13 Aug 2023
Data-Driven Approach for Formality-Sensitive Machine Translation:
  Language-Specific Handling and Synthetic Data Generation
Data-Driven Approach for Formality-Sensitive Machine Translation: Language-Specific Handling and Synthetic Data Generation
Seugnjun Lee
Hyeonseok Moon
Chanjun Park
Heu-Jeoung Lim
32
0
0
26 Jun 2023
Does Attention Mechanism Possess the Feature of Human Reading? A
  Perspective of Sentiment Classification Task
Does Attention Mechanism Possess the Feature of Human Reading? A Perspective of Sentiment Classification Task
Leilei Zhao
Yingyi Zhang
Chengzhi Zhang
35
2
0
08 Sep 2022
A General Survey on Attention Mechanisms in Deep Learning
A General Survey on Attention Mechanisms in Deep Learning
Gianni Brauwers
Flavius Frasincar
31
298
0
27 Mar 2022
The Ecological Footprint of Neural Machine Translation Systems
The Ecological Footprint of Neural Machine Translation Systems
D. Shterionov
Eva Vanmassenhove
40
3
0
04 Feb 2022
DRF Codes: Deep SNR-Robust Feedback Codes
DRF Codes: Deep SNR-Robust Feedback Codes
Mahdi Boloursaz Mashhadi
Deniz Gunduz
A. Perotti
B. Popović
27
10
0
22 Dec 2021
Capitalization and Punctuation Restoration: a Survey
Capitalization and Punctuation Restoration: a Survey
V. Pais
D. Tufis
19
19
0
21 Nov 2021
A Review of Text Style Transfer using Deep Learning
A Review of Text Style Transfer using Deep Learning
Martina Toshevska
Sonja Gievska
CLIP
48
43
0
30 Sep 2021
Transfer Learning in Electronic Health Records through Clinical Concept
  Embedding
Transfer Learning in Electronic Health Records through Clinical Concept Embedding
J. R. A. Solares
Yajie Zhu
A. Hassaine
Shishir Rao
Yikuan Li
M. Mamouei
D. Canoy
K. Rahimi
G. Salimi-Khorshidi
34
6
0
27 Jul 2021
A Survey on Data Augmentation for Text Classification
A Survey on Data Augmentation for Text Classification
Markus Bayer
M. Kaufhold
Christian A. Reuter
38
336
0
07 Jul 2021
On Adversarial Robustness of Synthetic Code Generation
On Adversarial Robustness of Synthetic Code Generation
Mrinal Anand
Pratik Kayal
M. Singh
34
3
0
22 Jun 2021
Domain Adaptation and Multi-Domain Adaptation for Neural Machine
  Translation: A Survey
Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey
Danielle Saunders
AI4CE
27
86
0
14 Apr 2021
On Automatic Parsing of Log Records
On Automatic Parsing of Log Records
Jared Rand
A. Miranskyy
29
6
0
12 Feb 2021
TextBox: A Unified, Modularized, and Extensible Framework for Text
  Generation
TextBox: A Unified, Modularized, and Extensible Framework for Text Generation
Junyi Li
Tianyi Tang
Gaole He
Jinhao Jiang
Xiaoxuan Hu
Puzhao Xie
Zhipeng Chen
Zhuohao Yu
Wayne Xin Zhao
Ji-Rong Wen
37
25
0
06 Jan 2021
LazyBatching: An SLA-aware Batching System for Cloud Machine Learning
  Inference
LazyBatching: An SLA-aware Batching System for Cloud Machine Learning Inference
Yujeong Choi
Yunseong Kim
Minsoo Rhu
24
66
0
25 Oct 2020
What Do Position Embeddings Learn? An Empirical Study of Pre-Trained
  Language Model Positional Encoding
What Do Position Embeddings Learn? An Empirical Study of Pre-Trained Language Model Positional Encoding
Yu-An Wang
Yun-Nung Chen
SSL
10
94
0
10 Oct 2020
Clustering-based Unsupervised Generative Relation Extraction
Clustering-based Unsupervised Generative Relation Extraction
Chenhan Yuan
Ryan Rossi
Andrew Katz
Hoda Eldardiry
19
4
0
26 Sep 2020
DeepRemaster: Temporal Source-Reference Attention Networks for
  Comprehensive Video Enhancement
DeepRemaster: Temporal Source-Reference Attention Networks for Comprehensive Video Enhancement
S. Iizuka
E. Simo-Serra
27
39
0
18 Sep 2020
Review: Deep Learning in Electron Microscopy
Review: Deep Learning in Electron Microscopy
Jeffrey M. Ede
36
79
0
17 Sep 2020
Learning from the Scene and Borrowing from the Rich: Tackling the Long
  Tail in Scene Graph Generation
Learning from the Scene and Borrowing from the Rich: Tackling the Long Tail in Scene Graph Generation
Tao He
Lianli Gao
Jingkuan Song
Jianfei Cai
Yuan-Fang Li
24
30
0
13 Jun 2020
NAS-Bench-NLP: Neural Architecture Search Benchmark for Natural Language
  Processing
NAS-Bench-NLP: Neural Architecture Search Benchmark for Natural Language Processing
Nikita Klyuchnikov
I. Trofimov
Ekaterina Artemova
Mikhail Salnikov
M. Fedorov
Evgeny Burnaev
VLM
21
101
0
12 Jun 2020
A Survey of Deep Learning Techniques for Neural Machine Translation
A Survey of Deep Learning Techniques for Neural Machine Translation
Shu Yang
Yuxin Wang
Xiaowen Chu
VLM
AI4TS
AI4CE
22
138
0
18 Feb 2020
BatchEnsemble: An Alternative Approach to Efficient Ensemble and
  Lifelong Learning
BatchEnsemble: An Alternative Approach to Efficient Ensemble and Lifelong Learning
Yeming Wen
Dustin Tran
Jimmy Ba
OOD
FedML
UQCV
32
483
0
17 Feb 2020
DeepMutation: A Neural Mutation Tool
DeepMutation: A Neural Mutation Tool
Michele Tufano
J. Kimko
Shiya Wang
Cody Watson
Gabriele Bavota
M. D. Penta
Denys Poshyvanyk
23
20
0
12 Feb 2020
Teaching Machines to Converse
Teaching Machines to Converse
Jiwei Li
29
4
0
31 Jan 2020
Trajectron++: Dynamically-Feasible Trajectory Forecasting With
  Heterogeneous Data
Trajectron++: Dynamically-Feasible Trajectory Forecasting With Heterogeneous Data
Tim Salzmann
Boris Ivanovic
Punarjay Chakravarty
Marco Pavone
33
130
0
09 Jan 2020
DeepONet: Learning nonlinear operators for identifying differential
  equations based on the universal approximation theorem of operators
DeepONet: Learning nonlinear operators for identifying differential equations based on the universal approximation theorem of operators
Lu Lu
Pengzhan Jin
George Karniadakis
43
2,029
0
08 Oct 2019
NeMo: a toolkit for building AI applications using Neural Modules
NeMo: a toolkit for building AI applications using Neural Modules
Oleksii Kuchaiev
Jason Chun Lok Li
Huyen Nguyen
Oleksii Hrinchuk
Ryan Leary
...
Jack Cook
P. Castonguay
Mariya Popova
Jocelyn Huang
Jonathan M. Cohen
211
296
0
14 Sep 2019
Generative Question Refinement with Deep Reinforcement Learning in
  Retrieval-based QA System
Generative Question Refinement with Deep Reinforcement Learning in Retrieval-based QA System
Ye Liu
Chenwei Zhang
Xiaohui Yan
Yi-Ju Chang
Philip S. Yu
30
18
0
13 Aug 2019
RobustTP: End-to-End Trajectory Prediction for Heterogeneous Road-Agents
  in Dense Traffic with Noisy Sensor Inputs
RobustTP: End-to-End Trajectory Prediction for Heterogeneous Road-Agents in Dense Traffic with Noisy Sensor Inputs
Rohan Chandra
Uttaran Bhattacharya
Christian Roncal
Aniket Bera
Tianyi Zhou
35
59
0
20 Jul 2019
Quick, Stat!: A Statistical Analysis of the Quick, Draw! Dataset
Quick, Stat!: A Statistical Analysis of the Quick, Draw! Dataset
Jennifer J. Gago
J. Victores
Ugo Pattacini
C. Balaguer
28
10
0
15 Jul 2019
Lost in Translation: Loss and Decay of Linguistic Richness in Machine
  Translation
Lost in Translation: Loss and Decay of Linguistic Richness in Machine Translation
Eva Vanmassenhove
D. Shterionov
Andy Way
22
91
0
28 Jun 2019
Sharing Attention Weights for Fast Transformer
Sharing Attention Weights for Fast Transformer
Tong Xiao
Yinqiao Li
Jingbo Zhu
Zhengtao Yu
Tongran Liu
17
50
0
26 Jun 2019
Bag of Color Features For Color Constancy
Bag of Color Features For Color Constancy
Firas Laakom
Nikolaos Passalis
Jenni Raitoharju
Jarno Nikkanen
Anastasios Tefas
Alexandros Iosifidis
Moncef Gabbouj
24
33
0
11 Jun 2019
Discrete Flows: Invertible Generative Models of Discrete Data
Discrete Flows: Invertible Generative Models of Discrete Data
Dustin Tran
Keyon Vafa
Kumar Krishna Agrawal
Laurent Dinh
Ben Poole
DRL
24
114
0
24 May 2019
A CNN-RNN Architecture for Multi-Label Weather Recognition
A CNN-RNN Architecture for Multi-Label Weather Recognition
Bin Zhao
Xuelong Li
Xiaoqiang Lu
Zhigang Wang
22
109
0
24 Apr 2019
Context-Aware Self-Attention Networks
Context-Aware Self-Attention Networks
Baosong Yang
Jian Li
Derek F. Wong
Lidia S. Chao
Xing Wang
Zhaopeng Tu
39
113
0
15 Feb 2019
On Learning Meaningful Code Changes via Neural Machine Translation
On Learning Meaningful Code Changes via Neural Machine Translation
Michele Tufano
Jevgenija Pantiuchina
Cody Watson
Gabriele Bavota
Denys Poshyvanyk
27
203
0
25 Jan 2019
Evaluating the State-of-the-Art of End-to-End Natural Language
  Generation: The E2E NLG Challenge
Evaluating the State-of-the-Art of End-to-End Natural Language Generation: The E2E NLG Challenge
Ondrej Dusek
Jekaterina Novikova
Verena Rieser
ELM
46
232
0
23 Jan 2019
How Much Does Tokenization Affect Neural Machine Translation?
How Much Does Tokenization Affect Neural Machine Translation?
Miguel Domingo
Mercedes García-Martínez
A. Helle
F. Casacuberta
Manuel Herranz
20
55
0
20 Dec 2018
TraPHic: Trajectory Prediction in Dense and Heterogeneous Traffic Using
  Weighted Interactions
TraPHic: Trajectory Prediction in Dense and Heterogeneous Traffic Using Weighted Interactions
Rohan Chandra
Uttaran Bhattacharya
Aniket Bera
Tianyi Zhou
28
256
0
12 Dec 2018
Data-parallel distributed training of very large models beyond GPU
  capacity
Data-parallel distributed training of very large models beyond GPU capacity
Samuel Matzek
M. Grossman
Minsik Cho
Anar Yusifov
Bryant Nelson
A. Juneja
GNN
22
3
0
29 Nov 2018
Scene Text Detection and Recognition: The Deep Learning Era
Scene Text Detection and Recognition: The Deep Learning Era
Shangbang Long
Xin He
Cong Yao
VLM
44
389
0
10 Nov 2018
Area Attention
Area Attention
Yang Li
Lukasz Kaiser
Samy Bengio
Si Si
31
20
0
23 Oct 2018
12
Next