v1v2v3 (latest)

DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding

14 September 2017

Papers citing "DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding"

39 / 39 papers shown

Title
LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding Junlong Tong Jinlan Fu Zixuan Lin Yingqi Fan Anhao Zhao Hui Su Xiaoyu Shen 93 0 0 22 May 2025
LOD1 3D City Model from LiDAR: The Impact of Segmentation Accuracy on Quality of Urban 3D Modeling and Morphology Extraction Fatemeh Chajaei Hossein Bagheri 3DV 3DPC AI4CE 189 0 0 20 May 2025
A Study of the Plausibility of Attention between RNN Encoders in Natural Language Inference Duc Hau Nguyen Duc Hau Nguyen Pascale Sébillot 120 5 0 23 Jan 2025
A Multi-Modal Explainability Approach for Human-Aware Robots in Multi-Party Conversation Iveta Becková Stefan Pócos Giulia Belgiovine Marco Matarese A. Sciutti Carlo Mazzola Carlo Mazzola 119 0 0 20 May 2024
Attention Is All You Need Ashish Vaswani Noam M. Shazeer Niki Parmar Jakob Uszkoreit Llion Jones Aidan Gomez Lukasz Kaiser Illia Polosukhin 3DV 808 132,725 0 12 Jun 2017
Learning Structured Text Representations Yang Liu Mirella Lapata 99 153 0 25 May 2017
Reinforced Mnemonic Reader for Machine Reading Comprehension Minghao Hu Yuxing Peng Zhen Huang Xipeng Qiu Furu Wei Ming Zhou RALM AIMat 80 69 0 08 May 2017
A Broad-Coverage Challenge Corpus for Sentence Understanding through Inference Adina Williams Nikita Nangia Samuel R. Bowman 526 4,497 0 18 Apr 2017
Structured Attention Networks Yoon Kim Carl Denton Luong Hoang Alexander M. Rush 125 463 0 03 Feb 2017
Structural Attention Neural Networks for improved sentiment analysis Alexandros Potamianos Filippos Kokkinos 76 71 0 07 Jan 2017
Bidirectional Tree-Structured LSTM with Head Lexicalization Zhiyang Teng Yue Zhang 49 23 0 21 Nov 2016
Bidirectional Attention Flow for Machine Comprehension Minjoon Seo Aniruddha Kembhavi Ali Farhadi Hannaneh Hajishirzi 135 2,091 0 05 Nov 2016
Neural Tree Indexers for Text Understanding Tsendsuren Munkhdalai Hong-ye Yu 79 104 0 15 Jul 2016
Neural Semantic Encoders Tsendsuren Munkhdalai Hong-ye Yu 280 134 0 14 Jul 2016
Stochastic Function Norm Regularization of Deep Networks Amal Rannen Triki Anderson C. A. Nascimento 67 2 0 30 May 2016
A Fast Unified Model for Parsing and Sentence Understanding Samuel R. Bowman Jon Gauthier Abhinav Rastogi Raghav Gupta Christopher D. Manning Christopher Potts 67 314 0 19 Mar 2016
Natural Language Inference by Tree-Based Convolution and Heuristic Matching Lili Mou Rui Men Ge Li Yan Xu Lu Zhang Rui Yan Zhi Jin 86 353 0 28 Dec 2015
Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs) Djork-Arné Clevert Thomas Unterthiner Sepp Hochreiter 307 5,539 0 23 Nov 2015
Order-Embeddings of Images and Language Ivan Vendrov Ryan Kiros Sanja Fidler R. Urtasun 118 548 0 19 Nov 2015
A Neural Attention Model for Abstractive Sentence Summarization Alexander M. Rush S. Chopra Jason Weston CVBM 186 2,703 0 02 Sep 2015
Character-Aware Neural Language Models Yoon Kim Yacine Jernite David Sontag Alexander M. Rush 113 1,670 0 26 Aug 2015
A large annotated corpus for learning natural language inference Samuel R. Bowman Gabor Angeli Christopher Potts Christopher D. Manning 338 4,297 0 21 Aug 2015
Molding CNNs for text: non-linear, non-consecutive convolutions Tao Lei Regina Barzilay Tommi Jaakkola 100 146 0 17 Aug 2015
Effective Approaches to Attention-based Neural Machine Translation Thang Luong Hieu H. Pham Christopher D. Manning 420 7,971 0 17 Aug 2015
Skip-Thought Vectors Ryan Kiros Yukun Zhu Ruslan Salakhutdinov R. Zemel Antonio Torralba R. Urtasun Sanja Fidler SSL 226 2,412 0 22 Jun 2015
Teaching Machines to Read and Comprehend Karl Moritz Hermann Tomás Kociský Edward Grefenstette L. Espeholt W. Kay Mustafa Suleyman Phil Blunsom 355 3,555 0 10 Jun 2015
Self-Adaptive Hierarchical Sentence Model Haiying Zhao Zhengdong Lu Pascal Poupart 100 191 0 20 Apr 2015
Neural Responding Machine for Short-Text Conversation Lifeng Shang Zhengdong Lu Hang Li 121 1,146 0 09 Mar 2015
When Are Tree Structures Necessary for Deep Learning of Representations? Jiwei Li Thang Luong Dan Jurafsky Eduard H. Hovy 101 1 0 28 Feb 2015
Improved Semantic Representations From Tree-Structured Long Short-Term Memory Networks Kai Sheng Tai R. Socher Christopher D. Manning AIMat 146 3,122 0 28 Feb 2015
Adam: A Method for Stochastic Optimization Diederik P. Kingma Jimmy Ba ODL 2.1K 150,433 0 22 Dec 2014
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling Junyoung Chung Çağlar Gülçehre Kyunghyun Cho Yoshua Bengio 607 12,745 0 11 Dec 2014
Sequence to Sequence Learning with Neural Networks Ilya Sutskever Oriol Vinyals Quoc V. Le AIMat 450 20,606 0 10 Sep 2014
Neural Machine Translation by Jointly Learning to Align and Translate Dzmitry Bahdanau Kyunghyun Cho Yoshua Bengio AIMat 582 27,338 0 01 Sep 2014
Convolutional Neural Networks for Sentence Classification Yoon Kim AILaw VLM 648 13,438 0 25 Aug 2014
A Convolutional Neural Network for Modelling Sentences Nal Kalchbrenner Edward Grefenstette Phil Blunsom 109 3,559 0 08 Apr 2014
Distributed Representations of Words and Phrases and their Compositionality Tomas Mikolov Ilya Sutskever Kai Chen G. Corrado J. Dean NAI OCL 406 33,573 0 16 Oct 2013
Efficient Estimation of Word Representations in Vector Space Tomas Mikolov Kai Chen G. Corrado J. Dean 3DV 693 31,571 0 16 Jan 2013
ADADELTA: An Adaptive Learning Rate Method Matthew D. Zeiler ODL 165 6,635 0 22 Dec 2012