ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1905.13324
  4. Cited By
A Lightweight Recurrent Network for Sequence Modeling

A Lightweight Recurrent Network for Sequence Modeling

30 May 2019
Biao Zhang
Rico Sennrich
ArXivPDFHTML

Papers citing "A Lightweight Recurrent Network for Sequence Modeling"

30 / 30 papers shown
Title
Simplifying Neural Machine Translation with Addition-Subtraction
  Twin-Gated Recurrent Networks
Simplifying Neural Machine Translation with Addition-Subtraction Twin-Gated Recurrent Networks
Biao Zhang
Deyi Xiong
Jinsong Su
Qian Lin
Huiji Zhang
39
12
0
30 Oct 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.2K
93,936
0
11 Oct 2018
Rational Recurrences
Rational Recurrences
Hao Peng
Roy Schwartz
Sam Thomson
Noah A. Smith
AI4CE
44
39
0
28 Aug 2018
Semantic Sentence Matching with Densely-connected Recurrent and
  Co-attentive Information
Semantic Sentence Matching with Densely-connected Recurrent and Co-attentive Information
Seonhoon Kim
Inho Kang
Nojun Kwak
49
217
0
29 May 2018
Accelerating Neural Transformer via an Average Attention Network
Accelerating Neural Transformer via an Average Attention Network
Biao Zhang
Deyi Xiong
Jinsong Su
55
120
0
02 May 2018
The Best of Both Worlds: Combining Recent Advances in Neural Machine
  Translation
The Best of Both Worlds: Combining Recent Advances in Neural Machine Translation
Mengzhao Chen
Orhan Firat
Ankur Bapna
Melvin Johnson
Wolfgang Macherey
...
Niki Parmar
M. Schuster
Zhifeng Chen
Yonghui Wu
Macduff Hughes
AIMat
52
457
0
26 Apr 2018
Deep contextualized word representations
Deep contextualized word representations
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
141
11,520
0
15 Feb 2018
Breaking the Softmax Bottleneck: A High-Rank RNN Language Model
Breaking the Softmax Bottleneck: A High-Rank RNN Language Model
Zhilin Yang
Zihang Dai
Ruslan Salakhutdinov
William W. Cohen
BDL
51
367
0
10 Nov 2017
Simple Recurrent Units for Highly Parallelizable Recurrence
Simple Recurrent Units for Highly Parallelizable Recurrence
Tao Lei
Yu Zhang
Sida I. Wang
Huijing Dai
Yoav Artzi
LRM
73
271
0
08 Sep 2017
Adversarial Examples for Evaluating Reading Comprehension Systems
Adversarial Examples for Evaluating Reading Comprehension Systems
Robin Jia
Percy Liang
AAML
ELM
185
1,594
0
23 Jul 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
519
129,831
0
12 Jun 2017
Deriving Neural Architectures from Sequence and Graph Kernels
Deriving Neural Architectures from Sequence and Graph Kernels
Tao Lei
Wengong Jin
Regina Barzilay
Tommi Jaakkola
GNN
72
137
0
25 May 2017
Recurrent Additive Networks
Recurrent Additive Networks
Kenton Lee
Omer Levy
Luke Zettlemoyer
GNN
AI4CE
41
38
0
21 May 2017
Factorization tricks for LSTM networks
Factorization tricks for LSTM networks
Oleksii Kuchaiev
Boris Ginsburg
48
113
0
31 Mar 2017
Quasi-Recurrent Neural Networks
Quasi-Recurrent Neural Networks
James Bradbury
Stephen Merity
Caiming Xiong
R. Socher
105
441
0
05 Nov 2016
Google's Neural Machine Translation System: Bridging the Gap between
  Human and Machine Translation
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation
Yonghui Wu
M. Schuster
Zhiwen Chen
Quoc V. Le
Mohammad Norouzi
...
Alex Rudnick
Oriol Vinyals
G. Corrado
Macduff Hughes
J. Dean
AIMat
832
6,768
0
26 Sep 2016
Pointer Sentinel Mixture Models
Pointer Sentinel Mixture Models
Stephen Merity
Caiming Xiong
James Bradbury
R. Socher
RALM
233
2,814
0
26 Sep 2016
Layer Normalization
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
305
10,412
0
21 Jul 2016
SQuAD: 100,000+ Questions for Machine Comprehension of Text
SQuAD: 100,000+ Questions for Machine Comprehension of Text
Pranav Rajpurkar
Jian Zhang
Konstantin Lopyrev
Percy Liang
RALM
182
8,067
0
16 Jun 2016
Optimizing Performance of Recurrent Neural Networks on GPUs
Optimizing Performance of Recurrent Neural Networks on GPUs
J. Appleyard
Tomás Kociský
Phil Blunsom
39
92
0
07 Apr 2016
Neural Architectures for Named Entity Recognition
Neural Architectures for Named Entity Recognition
Guillaume Lample
Miguel Ballesteros
Sandeep Subramanian
Kazuya Kawakami
Chris Dyer
209
4,006
0
04 Mar 2016
Strongly-Typed Recurrent Neural Networks
Strongly-Typed Recurrent Neural Networks
David Balduzzi
Muhammad Ghifary
PINN
49
60
0
06 Feb 2016
Reasoning about Entailment with Neural Attention
Reasoning about Entailment with Neural Attention
Tim Rocktaschel
Edward Grefenstette
Karl Moritz Hermann
Tomás Kociský
Phil Blunsom
NAI
48
761
0
22 Sep 2015
Character-level Convolutional Networks for Text Classification
Character-level Convolutional Networks for Text Classification
Xiang Zhang
Jiaqi Zhao
Yann LeCun
217
6,077
0
04 Sep 2015
Neural Machine Translation of Rare Words with Subword Units
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich
Barry Haddow
Alexandra Birch
174
7,683
0
31 Aug 2015
A large annotated corpus for learning natural language inference
A large annotated corpus for learning natural language inference
Samuel R. Bowman
Gabor Angeli
Christopher Potts
Christopher D. Manning
247
4,268
0
21 Aug 2015
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
1.1K
149,474
0
22 Dec 2014
Learning Phrase Representations using RNN Encoder-Decoder for
  Statistical Machine Translation
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Kyunghyun Cho
B. V. Merrienboer
Çağlar Gülçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
AIMat
716
23,235
0
03 Jun 2014
ADADELTA: An Adaptive Learning Rate Method
ADADELTA: An Adaptive Learning Rate Method
Matthew D. Zeiler
ODL
117
6,619
0
22 Dec 2012
On the difficulty of training Recurrent Neural Networks
On the difficulty of training Recurrent Neural Networks
Razvan Pascanu
Tomas Mikolov
Yoshua Bengio
ODL
164
5,318
0
21 Nov 2012
1