Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1905.13324
Cited By
A Lightweight Recurrent Network for Sequence Modeling
30 May 2019
Biao Zhang
Rico Sennrich
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Lightweight Recurrent Network for Sequence Modeling"
30 / 30 papers shown
Title
Simplifying Neural Machine Translation with Addition-Subtraction Twin-Gated Recurrent Networks
Biao Zhang
Deyi Xiong
Jinsong Su
Qian Lin
Huiji Zhang
39
12
0
30 Oct 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.2K
93,936
0
11 Oct 2018
Rational Recurrences
Hao Peng
Roy Schwartz
Sam Thomson
Noah A. Smith
AI4CE
44
39
0
28 Aug 2018
Semantic Sentence Matching with Densely-connected Recurrent and Co-attentive Information
Seonhoon Kim
Inho Kang
Nojun Kwak
47
217
0
29 May 2018
Accelerating Neural Transformer via an Average Attention Network
Biao Zhang
Deyi Xiong
Jinsong Su
55
120
0
02 May 2018
The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation
Mengzhao Chen
Orhan Firat
Ankur Bapna
Melvin Johnson
Wolfgang Macherey
...
Niki Parmar
M. Schuster
Zhifeng Chen
Yonghui Wu
Macduff Hughes
AIMat
52
457
0
26 Apr 2018
Deep contextualized word representations
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
139
11,520
0
15 Feb 2018
Breaking the Softmax Bottleneck: A High-Rank RNN Language Model
Zhilin Yang
Zihang Dai
Ruslan Salakhutdinov
William W. Cohen
BDL
49
367
0
10 Nov 2017
Simple Recurrent Units for Highly Parallelizable Recurrence
Tao Lei
Yu Zhang
Sida I. Wang
Huijing Dai
Yoav Artzi
LRM
69
271
0
08 Sep 2017
Adversarial Examples for Evaluating Reading Comprehension Systems
Robin Jia
Percy Liang
AAML
ELM
182
1,594
0
23 Jul 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
514
129,831
0
12 Jun 2017
Deriving Neural Architectures from Sequence and Graph Kernels
Tao Lei
Wengong Jin
Regina Barzilay
Tommi Jaakkola
GNN
72
137
0
25 May 2017
Recurrent Additive Networks
Kenton Lee
Omer Levy
Luke Zettlemoyer
GNN
AI4CE
41
38
0
21 May 2017
Factorization tricks for LSTM networks
Oleksii Kuchaiev
Boris Ginsburg
48
113
0
31 Mar 2017
Quasi-Recurrent Neural Networks
James Bradbury
Stephen Merity
Caiming Xiong
R. Socher
105
441
0
05 Nov 2016
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhiwen Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
830
6,768
0
26 Sep 2016
Pointer Sentinel Mixture Models
Stephen Merity
Caiming Xiong
James Bradbury
R. Socher
RALM
213
2,814
0
26 Sep 2016
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
300
10,412
0
21 Jul 2016
SQuAD: 100,000+ Questions for Machine Comprehension of Text
Pranav Rajpurkar
Jian Zhang
Konstantin Lopyrev
Percy Liang
RALM
180
8,067
0
16 Jun 2016
Optimizing Performance of Recurrent Neural Networks on GPUs
J. Appleyard
Tomás Kociský
Phil Blunsom
39
92
0
07 Apr 2016
Neural Architectures for Named Entity Recognition
Guillaume Lample
Miguel Ballesteros
Sandeep Subramanian
Kazuya Kawakami
Chris Dyer
209
4,006
0
04 Mar 2016
Strongly-Typed Recurrent Neural Networks
David Balduzzi
Muhammad Ghifary
PINN
49
60
0
06 Feb 2016
Reasoning about Entailment with Neural Attention
Tim Rocktaschel
Edward Grefenstette
Karl Moritz Hermann
Tomás Kociský
Phil Blunsom
NAI
46
761
0
22 Sep 2015
Character-level Convolutional Networks for Text Classification
Xiang Zhang
Jiaqi Zhao
Yann LeCun
215
6,077
0
04 Sep 2015
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich
Barry Haddow
Alexandra Birch
174
7,683
0
31 Aug 2015
A large annotated corpus for learning natural language inference
Samuel R. Bowman
Gabor Angeli
Christopher Potts
Christopher D. Manning
245
4,268
0
21 Aug 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.1K
149,474
0
22 Dec 2014
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Kyunghyun Cho
B. V. Merrienboer
Çağlar Gülçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
AIMat
716
23,235
0
03 Jun 2014
ADADELTA: An Adaptive Learning Rate Method
Matthew D. Zeiler
ODL
117
6,619
0
22 Dec 2012
On the difficulty of training Recurrent Neural Networks
Razvan Pascanu
Tomas Mikolov
Yoshua Bengio
ODL
159
5,318
0
21 Nov 2012
1