Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.11959
Cited By
FineText: Text Classification via Attention-based Language Model Fine-tuning
25 October 2019
Yunzhe Tao
Saurabh Gupta
Satyapriya Krishna
Xiong Zhou
Orchid Majumder
Vineet Khare
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"FineText: Text Classification via Attention-based Language Model Fine-tuning"
19 / 19 papers shown
Title
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.8K
95,114
0
11 Oct 2018
Deep contextualized word representations
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
224
11,565
0
15 Feb 2018
DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding
Tao Shen
Dinesh Manocha
Guodong Long
Jing Jiang
Shirui Pan
Chengqi Zhang
64
755
0
14 Sep 2017
Regularizing and Optimizing LSTM Language Models
Stephen Merity
N. Keskar
R. Socher
166
1,096
0
07 Aug 2017
Learned in Translation: Contextualized Word Vectors
Bryan McCann
James Bradbury
Caiming Xiong
R. Socher
121
909
0
01 Aug 2017
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
725
132,199
0
12 Jun 2017
A Deep Reinforced Model for Abstractive Summarization
Romain Paulus
Caiming Xiong
R. Socher
AI4TS
206
1,558
0
11 May 2017
Semi-supervised sequence tagging with bidirectional language models
Matthew E. Peters
Bridger Waleed Ammar
Chandra Bhagavatula
Russell Power
84
635
0
29 Apr 2017
Selective Encoding for Abstractive Sentence Summarization
Qingyu Zhou
Nan Yang
Furu Wei
M. Zhou
CVBM
85
260
0
24 Apr 2017
Learning to Generate Reviews and Discovering Sentiment
Alec Radford
Rafal Jozefowicz
Ilya Sutskever
97
510
0
05 Apr 2017
Pointer Sentinel Mixture Models
Stephen Merity
Caiming Xiong
James Bradbury
R. Socher
RALM
328
2,895
0
26 Sep 2016
Convolutional Neural Networks for Text Categorization: Shallow Word-level vs. Deep Character-level
Rie Johnson
Tong Zhang
43
48
0
31 Aug 2016
A Decomposable Attention Model for Natural Language Inference
Ankur P. Parikh
Oscar Täckström
Dipanjan Das
Jakob Uszkoreit
346
1,375
0
06 Jun 2016
Long Short-Term Memory-Networks for Machine Reading
Jianpeng Cheng
Li Dong
Mirella Lapata
AIMat
RALM
107
1,123
0
25 Jan 2016
Character-level Convolutional Networks for Text Classification
Xiang Zhang
Jiaqi Zhao
Yann LeCun
268
6,130
0
04 Sep 2015
A Neural Attention Model for Abstractive Sentence Summarization
Alexander M. Rush
S. Chopra
Jason Weston
CVBM
186
2,701
0
02 Sep 2015
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe
Christian Szegedy
OOD
463
43,328
0
11 Feb 2015
Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech Recognition
Hasim Sak
A. Senior
F. Beaufays
99
1,052
0
05 Feb 2014
One Billion Word Benchmark for Measuring Progress in Statistical Language Modeling
Ciprian Chelba
Tomas Mikolov
M. Schuster
Qi Ge
T. Brants
P. Koehn
T. Robinson
188
1,108
0
11 Dec 2013
1