Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1709.04696
Cited By
v1
v2
v3 (latest)
DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding
14 September 2017
Tao Shen
Dinesh Manocha
Guodong Long
Jing Jiang
Shirui Pan
Chengqi Zhang
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding"
39 / 39 papers shown
Title
LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding
Junlong Tong
Jinlan Fu
Zixuan Lin
Yingqi Fan
Anhao Zhao
Hui Su
Xiaoyu Shen
93
0
0
22 May 2025
LOD1 3D City Model from LiDAR: The Impact of Segmentation Accuracy on Quality of Urban 3D Modeling and Morphology Extraction
Fatemeh Chajaei
Hossein Bagheri
3DV
3DPC
AI4CE
189
0
0
20 May 2025
A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference
Duc Hau Nguyen
Duc Hau Nguyen
Pascale Sébillot
120
5
0
23 Jan 2025
A Multi-Modal Explainability Approach for Human-Aware Robots in Multi-Party Conversation
Iveta Becková
Stefan Pócos
Giulia Belgiovine
Marco Matarese
A. Sciutti
Carlo Mazzola
Carlo Mazzola
119
0
0
20 May 2024
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
808
132,725
0
12 Jun 2017
Learning Structured Text Representations
Yang Liu
Mirella Lapata
99
153
0
25 May 2017
Reinforced Mnemonic Reader for Machine Reading Comprehension
Minghao Hu
Yuxing Peng
Zhen Huang
Xipeng Qiu
Furu Wei
Ming Zhou
RALM
AIMat
80
69
0
08 May 2017
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference
Adina Williams
Nikita Nangia
Samuel R. Bowman
526
4,497
0
18 Apr 2017
Structured Attention Networks
Yoon Kim
Carl Denton
Luong Hoang
Alexander M. Rush
125
463
0
03 Feb 2017
Structural Attention Neural Networks for improved sentiment analysis
Alexandros Potamianos
Filippos Kokkinos
76
71
0
07 Jan 2017
Bidirectional Tree-Structured LSTM with Head Lexicalization
Zhiyang Teng
Yue Zhang
49
23
0
21 Nov 2016
Bidirectional Attention Flow for Machine Comprehension
Minjoon Seo
Aniruddha Kembhavi
Ali Farhadi
Hannaneh Hajishirzi
135
2,091
0
05 Nov 2016
Neural Tree Indexers for Text Understanding
Tsendsuren Munkhdalai
Hong-ye Yu
79
104
0
15 Jul 2016
Neural Semantic Encoders
Tsendsuren Munkhdalai
Hong-ye Yu
280
134
0
14 Jul 2016
Stochastic Function Norm Regularization of Deep Networks
Amal Rannen Triki
Anderson C. A. Nascimento
67
2
0
30 May 2016
A Fast Unified Model for Parsing and Sentence Understanding
Samuel R. Bowman
Jon Gauthier
Abhinav Rastogi
Raghav Gupta
Christopher D. Manning
Christopher Potts
67
314
0
19 Mar 2016
Natural Language Inference by Tree-Based Convolution and Heuristic Matching
Lili Mou
Rui Men
Ge Li
Yan Xu
Lu Zhang
Rui Yan
Zhi Jin
86
353
0
28 Dec 2015
Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs)
Djork-Arné Clevert
Thomas Unterthiner
Sepp Hochreiter
307
5,539
0
23 Nov 2015
Order-Embeddings of Images and Language
Ivan Vendrov
Ryan Kiros
Sanja Fidler
R. Urtasun
118
548
0
19 Nov 2015
A Neural Attention Model for Abstractive Sentence Summarization
Alexander M. Rush
S. Chopra
Jason Weston
CVBM
186
2,703
0
02 Sep 2015
Character-Aware Neural Language Models
Yoon Kim
Yacine Jernite
David Sontag
Alexander M. Rush
113
1,670
0
26 Aug 2015
A large annotated corpus for learning natural language inference
Samuel R. Bowman
Gabor Angeli
Christopher Potts
Christopher D. Manning
338
4,297
0
21 Aug 2015
Molding CNNs for text: non-linear, non-consecutive convolutions
Tao Lei
Regina Barzilay
Tommi Jaakkola
100
146
0
17 Aug 2015
Effective Approaches to Attention-based Neural Machine Translation
Thang Luong
Hieu H. Pham
Christopher D. Manning
420
7,971
0
17 Aug 2015
Skip-Thought Vectors
Ryan Kiros
Yukun Zhu
Ruslan Salakhutdinov
R. Zemel
Antonio Torralba
R. Urtasun
Sanja Fidler
SSL
226
2,412
0
22 Jun 2015
Teaching Machines to Read and Comprehend
Karl Moritz Hermann
Tomás Kociský
Edward Grefenstette
L. Espeholt
W. Kay
Mustafa Suleyman
Phil Blunsom
355
3,555
0
10 Jun 2015
Self-Adaptive Hierarchical Sentence Model
Haiying Zhao
Zhengdong Lu
Pascal Poupart
100
191
0
20 Apr 2015
Neural Responding Machine for Short-Text Conversation
Lifeng Shang
Zhengdong Lu
Hang Li
121
1,146
0
09 Mar 2015
When Are Tree Structures Necessary for Deep Learning of Representations?
Jiwei Li
Thang Luong
Dan Jurafsky
Eduard H. Hovy
101
1
0
28 Feb 2015
Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks
Kai Sheng Tai
R. Socher
Christopher D. Manning
AIMat
146
3,122
0
28 Feb 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.1K
150,433
0
22 Dec 2014
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
Junyoung Chung
Çağlar Gülçehre
Kyunghyun Cho
Yoshua Bengio
607
12,745
0
11 Dec 2014
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
450
20,606
0
10 Sep 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
582
27,338
0
01 Sep 2014
Convolutional Neural Networks for Sentence Classification
Yoon Kim
AILaw
VLM
648
13,438
0
25 Aug 2014
A Convolutional Neural Network for Modelling Sentences
Nal Kalchbrenner
Edward Grefenstette
Phil Blunsom
109
3,559
0
08 Apr 2014
Distributed Representations of Words and Phrases and their Compositionality
Tomas Mikolov
Ilya Sutskever
Kai Chen
G. Corrado
J. Dean
NAI
OCL
406
33,573
0
16 Oct 2013
Efficient Estimation of Word Representations in Vector Space
Tomas Mikolov
Kai Chen
G. Corrado
J. Dean
3DV
693
31,571
0
16 Jan 2013
ADADELTA: An Adaptive Learning Rate Method
Matthew D. Zeiler
ODL
165
6,635
0
22 Dec 2012
1