ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.11959
  4. Cited By
FineText: Text Classification via Attention-based Language Model
  Fine-tuning

FineText: Text Classification via Attention-based Language Model Fine-tuning

25 October 2019
Yunzhe Tao
Saurabh Gupta
Satyapriya Krishna
Xiong Zhou
Orchid Majumder
Vineet Khare
ArXiv (abs)PDFHTML

Papers citing "FineText: Text Classification via Attention-based Language Model Fine-tuning"

19 / 19 papers shown
Title
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLMSSLSSeg
1.8K
95,114
0
11 Oct 2018
Deep contextualized word representations
Deep contextualized word representations
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
224
11,565
0
15 Feb 2018
DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language
  Understanding
DiSAN: Directional Self-Attention Network for RNN/CNN-Free Language Understanding
Tao Shen
Dinesh Manocha
Guodong Long
Jing Jiang
Shirui Pan
Chengqi Zhang
64
755
0
14 Sep 2017
Regularizing and Optimizing LSTM Language Models
Regularizing and Optimizing LSTM Language Models
Stephen Merity
N. Keskar
R. Socher
166
1,096
0
07 Aug 2017
Learned in Translation: Contextualized Word Vectors
Learned in Translation: Contextualized Word Vectors
Bryan McCann
James Bradbury
Caiming Xiong
R. Socher
121
909
0
01 Aug 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
725
132,199
0
12 Jun 2017
A Deep Reinforced Model for Abstractive Summarization
A Deep Reinforced Model for Abstractive Summarization
Romain Paulus
Caiming Xiong
R. Socher
AI4TS
206
1,558
0
11 May 2017
Semi-supervised sequence tagging with bidirectional language models
Semi-supervised sequence tagging with bidirectional language models
Matthew E. Peters
Bridger Waleed Ammar
Chandra Bhagavatula
Russell Power
84
635
0
29 Apr 2017
Selective Encoding for Abstractive Sentence Summarization
Selective Encoding for Abstractive Sentence Summarization
Qingyu Zhou
Nan Yang
Furu Wei
M. Zhou
CVBM
85
260
0
24 Apr 2017
Learning to Generate Reviews and Discovering Sentiment
Learning to Generate Reviews and Discovering Sentiment
Alec Radford
Rafal Jozefowicz
Ilya Sutskever
97
510
0
05 Apr 2017
Pointer Sentinel Mixture Models
Pointer Sentinel Mixture Models
Stephen Merity
Caiming Xiong
James Bradbury
R. Socher
RALM
328
2,895
0
26 Sep 2016
Convolutional Neural Networks for Text Categorization: Shallow
  Word-level vs. Deep Character-level
Convolutional Neural Networks for Text Categorization: Shallow Word-level vs. Deep Character-level
Rie Johnson
Tong Zhang
43
48
0
31 Aug 2016
A Decomposable Attention Model for Natural Language Inference
A Decomposable Attention Model for Natural Language Inference
Ankur P. Parikh
Oscar Täckström
Dipanjan Das
Jakob Uszkoreit
346
1,375
0
06 Jun 2016
Long Short-Term Memory-Networks for Machine Reading
Long Short-Term Memory-Networks for Machine Reading
Jianpeng Cheng
Li Dong
Mirella Lapata
AIMatRALM
107
1,123
0
25 Jan 2016
Character-level Convolutional Networks for Text Classification
Character-level Convolutional Networks for Text Classification
Xiang Zhang
Jiaqi Zhao
Yann LeCun
268
6,130
0
04 Sep 2015
A Neural Attention Model for Abstractive Sentence Summarization
A Neural Attention Model for Abstractive Sentence Summarization
Alexander M. Rush
S. Chopra
Jason Weston
CVBM
186
2,701
0
02 Sep 2015
Batch Normalization: Accelerating Deep Network Training by Reducing
  Internal Covariate Shift
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe
Christian Szegedy
OOD
463
43,328
0
11 Feb 2015
Long Short-Term Memory Based Recurrent Neural Network Architectures for
  Large Vocabulary Speech Recognition
Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech Recognition
Hasim Sak
A. Senior
F. Beaufays
99
1,052
0
05 Feb 2014
One Billion Word Benchmark for Measuring Progress in Statistical
  Language Modeling
One Billion Word Benchmark for Measuring Progress in Statistical Language Modeling
Ciprian Chelba
Tomas Mikolov
M. Schuster
Qi Ge
T. Brants
P. Koehn
T. Robinson
188
1,108
0
11 Dec 2013
1