ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1903.05987
  4. Cited By
To Tune or Not to Tune? Adapting Pretrained Representations to Diverse
  Tasks

To Tune or Not to Tune? Adapting Pretrained Representations to Diverse Tasks

14 March 2019
Matthew E. Peters
Sebastian Ruder
Noah A. Smith
ArXivPDFHTML

Papers citing "To Tune or Not to Tune? Adapting Pretrained Representations to Diverse Tasks"

29 / 229 papers shown
Title
Parameter Space Factorization for Zero-Shot Learning across Tasks and
  Languages
Parameter Space Factorization for Zero-Shot Learning across Tasks and Languages
Edoardo Ponti
Ivan Vulić
Ryan Cotterell
Marinela Parović
Roi Reichart
Anna Korhonen
BDL
29
29
0
30 Jan 2020
TX-Ray: Quantifying and Explaining Model-Knowledge Transfer in
  (Un-)Supervised NLP
TX-Ray: Quantifying and Explaining Model-Knowledge Transfer in (Un-)Supervised NLP
Nils Rethmeier
V. Saxena
Isabelle Augenstein
FAtt
25
2
0
02 Dec 2019
Towards non-toxic landscapes: Automatic toxic comment detection using
  DNN
Towards non-toxic landscapes: Automatic toxic comment detection using DNN
Ashwin Geet D'Sa
Irina Illina
Dominique Fohr
11
22
0
19 Nov 2019
Improving BERT Fine-tuning with Embedding Normalization
Wenxuan Zhou
Junyi Du
Xiang Ren
13
6
0
10 Nov 2019
Learning to Few-Shot Learn Across Diverse Natural Language
  Classification Tasks
Learning to Few-Shot Learn Across Diverse Natural Language Classification Tasks
Trapit Bansal
Rishikesh Jha
Andrew McCallum
SSL
21
118
0
10 Nov 2019
SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language
  Models through Principled Regularized Optimization
SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization
Haoming Jiang
Pengcheng He
Weizhu Chen
Xiaodong Liu
Jianfeng Gao
T. Zhao
38
559
0
08 Nov 2019
On the Linguistic Representational Power of Neural Machine Translation
  Models
On the Linguistic Representational Power of Neural Machine Translation Models
Yonatan Belinkov
Nadir Durrani
Fahim Dalvi
Hassan Sajjad
James R. Glass
MILM
33
68
0
01 Nov 2019
What does BERT Learn from Multiple-Choice Reading Comprehension
  Datasets?
What does BERT Learn from Multiple-Choice Reading Comprehension Datasets?
Chenglei Si
Shuohang Wang
Min-Yen Kan
Jing Jiang
42
53
0
28 Oct 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text
  Transformer
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
126
19,493
0
23 Oct 2019
Extraction of Complex DNN Models: Real Threat or Boogeyman?
Extraction of Complex DNN Models: Real Threat or Boogeyman?
B. Atli
S. Szyller
Mika Juuti
Samuel Marchal
Nadarajah Asokan
MLAU
MIACV
33
45
0
11 Oct 2019
Conversational Transfer Learning for Emotion Recognition
Conversational Transfer Learning for Emotion Recognition
Devamanyu Hazarika
Soujanya Poria
Roger Zimmermann
Rada Mihalcea
29
17
0
11 Oct 2019
Neural Word Decomposition Models for Abusive Language Detection
Neural Word Decomposition Models for Abusive Language Detection
S. Bodapati
Spandana Gella
Kasturi Bhattacharjee
Yaser Al-Onaizan
17
28
0
02 Oct 2019
TalkDown: A Corpus for Condescension Detection in Context
TalkDown: A Corpus for Condescension Detection in Context
Zijian Wang
Christopher Potts
8
51
0
25 Sep 2019
Portuguese Named Entity Recognition using BERT-CRF
Portuguese Named Entity Recognition using BERT-CRF
Fábio Souza
Rodrigo Nogueira
R. Lotufo
22
251
0
23 Sep 2019
Dependency-Guided LSTM-CRF for Named Entity Recognition
Dependency-Guided LSTM-CRF for Named Entity Recognition
Zhanming Jie
Wei Lu
17
95
0
23 Sep 2019
Back to the Future -- Sequential Alignment of Text Representations
Back to the Future -- Sequential Alignment of Text Representations
Johannes Bjerva
Wouter M. Kouw
Isabelle Augenstein
AI4TS
6
9
0
08 Sep 2019
To lemmatize or not to lemmatize: how word normalisation affects ELMo
  performance in word sense disambiguation
To lemmatize or not to lemmatize: how word normalisation affects ELMo performance in word sense disambiguation
Andrey Kutuzov
E. Kuzmenko
11
22
0
06 Sep 2019
Show Your Work: Improved Reporting of Experimental Results
Show Your Work: Improved Reporting of Experimental Results
Jesse Dodge
Suchin Gururangan
Dallas Card
Roy Schwartz
Noah A. Smith
19
250
0
06 Sep 2019
When Low Resource NLP Meets Unsupervised Language Model:
  Meta-pretraining Then Meta-learning for Few-shot Text Classification
When Low Resource NLP Meets Unsupervised Language Model: Meta-pretraining Then Meta-learning for Few-shot Text Classification
Shumin Deng
Ningyu Zhang
Zhanlin Sun
Jiaoyan Chen
Huajun Chen
VLM
8
45
0
22 Aug 2019
Visualizing and Understanding the Effectiveness of BERT
Visualizing and Understanding the Effectiveness of BERT
Y. Hao
Li Dong
Furu Wei
Ke Xu
27
181
0
15 Aug 2019
Leveraging Pre-trained Checkpoints for Sequence Generation Tasks
Leveraging Pre-trained Checkpoints for Sequence Generation Tasks
S. Rothe
Shashi Narayan
Aliaksei Severyn
SILM
69
433
0
29 Jul 2019
To Tune or Not To Tune? How About the Best of Both Worlds?
To Tune or Not To Tune? How About the Best of Both Worlds?
Ran A. Wang
Haibo Su
Chunye Wang
Kailin Ji
J. Ding
VLM
33
17
0
09 Jul 2019
Transfer Learning for Risk Classification of Social Media Posts: Model
  Evaluation Study
Transfer Learning for Risk Classification of Social Media Posts: Model Evaluation Study
Derek Howard
M. Maslej
Justin Lee
Jacob Ritchie
G. Woollard
L. French
AI4MH
18
30
0
04 Jul 2019
Transfer Learning for Causal Sentence Detection
Transfer Learning for Causal Sentence Detection
Manolis Kyriakakis
Ion Androutsopoulos
Joan Ginés i Ametllé
Artur Saudabayev
11
25
0
18 Jun 2019
Pre-Training Graph Neural Networks for Generic Structural Feature
  Extraction
Pre-Training Graph Neural Networks for Generic Structural Feature Extraction
Ziniu Hu
Changjun Fan
Ting-Li Chen
Kai-Wei Chang
Yizhou Sun
32
43
0
31 May 2019
Evaluating the Underlying Gender Bias in Contextualized Word Embeddings
Evaluating the Underlying Gender Bias in Contextualized Word Embeddings
Christine Basta
Marta R. Costa-jussá
Noe Casas
16
189
0
18 Apr 2019
Linguistic Knowledge and Transferability of Contextual Representations
Linguistic Knowledge and Transferability of Contextual Representations
Nelson F. Liu
Matt Gardner
Yonatan Belinkov
Matthew E. Peters
Noah A. Smith
52
718
0
21 Mar 2019
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
299
6,984
0
20 Apr 2018
Convolutional Neural Networks for Sentence Classification
Convolutional Neural Networks for Sentence Classification
Yoon Kim
AILaw
VLM
270
13,368
0
25 Aug 2014
Previous
12345