ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05365
  4. Cited By
Deep contextualized word representations
v1v2 (latest)

Deep contextualized word representations

15 February 2018
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
    NAI
ArXiv (abs)PDFHTML

Papers citing "Deep contextualized word representations"

50 / 4,508 papers shown
Title
"Average" Approximates "First Principal Component"? An Empirical
  Analysis on Representations from Neural Language Models
"Average" Approximates "First Principal Component"? An Empirical Analysis on Representations from Neural Language Models
Zihan Wang
Chengyu Dong
Jingbo Shang
FAtt
143
4
0
18 Apr 2021
Worst of Both Worlds: Biases Compound in Pre-trained Vision-and-Language
  Models
Worst of Both Worlds: Biases Compound in Pre-trained Vision-and-Language Models
Tejas Srinivasan
Yonatan Bisk
VLM
83
56
0
18 Apr 2021
UPB at SemEval-2021 Task 5: Virtual Adversarial Training for Toxic Spans
  Detection
UPB at SemEval-2021 Task 5: Virtual Adversarial Training for Toxic Spans Detection
Andrei Paraschiv
Dumitru-Clementin Cercel
M. Dascalu
65
1
0
17 Apr 2021
Multi-source Neural Topic Modeling in Multi-view Embedding Spaces
Multi-source Neural Topic Modeling in Multi-view Embedding Spaces
Pankaj Gupta
Yatin Chaudhary
Hinrich Schütze
BDL
60
9
0
17 Apr 2021
DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages
DWUG: A large Resource of Diachronic Word Usage Graphs in Four Languages
Dominik Schlechtweg
Nina Tahmasebi
Simon Hengchen
Haim Dubossarsky
Barbara McGillivray
79
49
0
17 Apr 2021
The Topic Confusion Task: A Novel Scenario for Authorship Attribution
The Topic Confusion Task: A Novel Scenario for Authorship Attribution
Malik H. Altakrori
Jackie C.K. Cheung
Benjamin C. M. Fung
65
19
0
17 Apr 2021
Learning to Share by Masking the Non-shared for Multi-domain Sentiment
  Classification
Learning to Share by Masking the Non-shared for Multi-domain Sentiment Classification
Jianhua Yuan
Yanyan Zhao
Bing Qin
Ting Liu
83
17
0
17 Apr 2021
Memorisation versus Generalisation in Pre-trained Language Models
Memorisation versus Generalisation in Pre-trained Language Models
Michael Tänzer
Sebastian Ruder
Marek Rei
126
51
0
16 Apr 2021
AMMU : A Survey of Transformer-based Biomedical Pretrained Language
  Models
AMMU : A Survey of Transformer-based Biomedical Pretrained Language Models
Katikapalli Subramanyam Kalyan
A. Rajasekharan
S. Sangeetha
LM&MAMedIm
120
170
0
16 Apr 2021
Condenser: a Pre-training Architecture for Dense Retrieval
Condenser: a Pre-training Architecture for Dense Retrieval
Luyu Gao
Jamie Callan
AI4CE
71
269
0
16 Apr 2021
proScript: Partially Ordered Scripts Generation via Pre-trained Language
  Models
proScript: Partially Ordered Scripts Generation via Pre-trained Language Models
Keisuke Sakaguchi
Chandrasekhar Bhagavatula
Ronan Le Bras
Niket Tandon
Peter Clark
Yejin Choi
63
25
0
16 Apr 2021
LU-BZU at SemEval-2021 Task 2: Word2Vec and Lemma2Vec performance in
  Arabic Word-in-Context disambiguation
LU-BZU at SemEval-2021 Task 2: Word2Vec and Lemma2Vec performance in Arabic Word-in-Context disambiguation
Moustafa Al-Hajj
Mustafa Jarrar
72
15
0
16 Apr 2021
Probing Across Time: What Does RoBERTa Know and When?
Probing Across Time: What Does RoBERTa Know and When?
Leo Z. Liu
Yizhong Wang
Jungo Kasai
Hannaneh Hajishirzi
Noah A. Smith
KELM
124
88
0
16 Apr 2021
Translational NLP: A New Paradigm and General Principles for Natural
  Language Processing Research
Translational NLP: A New Paradigm and General Principles for Natural Language Processing Research
Denis R. Newman-Griffis
J. Lehman
Carolyn Rose
H. Hochheiser
58
20
0
16 Apr 2021
ZeRO-Infinity: Breaking the GPU Memory Wall for Extreme Scale Deep
  Learning
ZeRO-Infinity: Breaking the GPU Memory Wall for Extreme Scale Deep Learning
Samyam Rajbhandari
Olatunji Ruwase
Jeff Rasley
Shaden Smith
Yuxiong He
GNN
116
393
0
16 Apr 2021
A Masked Segmental Language Model for Unsupervised Natural Language
  Segmentation
A Masked Segmental Language Model for Unsupervised Natural Language Segmentation
C.M. Downey
Fei Xia
Gina-Anne Levow
Shane Steinert-Threlkeld
51
13
0
16 Apr 2021
Sublanguage: A Serious Issue Affects Pretrained Models in Legal Domain
Sublanguage: A Serious Issue Affects Pretrained Models in Legal Domain
Nguyen Ha Thanh
Le-Minh Nguyen
ELMAILaw
19
2
0
15 Apr 2021
Syntax-Aware Graph-to-Graph Transformer for Semantic Role Labelling
Syntax-Aware Graph-to-Graph Transformer for Semantic Role Labelling
Alireza Mohammadshahi
James Henderson
74
12
0
15 Apr 2021
Generating Datasets with Pretrained Language Models
Generating Datasets with Pretrained Language Models
Timo Schick
Hinrich Schütze
178
235
0
15 Apr 2021
Learning Zero-Shot Multifaceted Visually Grounded Word Embeddings via
  Multi-Task Training
Learning Zero-Shot Multifaceted Visually Grounded Word Embeddings via Multi-Task Training
Hassan Shahmohammadi
Hendrik P. A. Lensch
R. Baayen
80
19
0
15 Apr 2021
Unmasking the Mask -- Evaluating Social Biases in Masked Language Models
Unmasking the Mask -- Evaluating Social Biases in Masked Language Models
Masahiro Kaneko
Danushka Bollegala
77
72
0
15 Apr 2021
Effect of Post-processing on Contextualized Word Representations
Effect of Post-processing on Contextualized Word Representations
Hassan Sajjad
Firoj Alam
Fahim Dalvi
Nadir Durrani
66
9
0
15 Apr 2021
Emotion Dynamics Modeling via BERT
Emotion Dynamics Modeling via BERT
Haiqing Yang
Jianping Shen
88
12
0
15 Apr 2021
COIL: Revisit Exact Lexical Match in Information Retrieval with
  Contextualized Inverted List
COIL: Revisit Exact Lexical Match in Information Retrieval with Contextualized Inverted List
Luyu Gao
Zhuyun Dai
Jamie Callan
89
220
0
15 Apr 2021
Disentangling Representations of Text by Masking Transformers
Disentangling Representations of Text by Masking Transformers
Xiongyi Zhang
Jan-Willem van de Meent
Byron C. Wallace
DRL
68
21
0
14 Apr 2021
Static Embeddings as Efficient Knowledge Bases?
Static Embeddings as Efficient Knowledge Bases?
Philipp Dufter
Nora Kassner
Hinrich Schütze
67
19
0
14 Apr 2021
Modeling Human Mental States with an Entity-based Narrative Graph
Modeling Human Mental States with an Entity-based Narrative Graph
I-Ta Lee
Maria Leonor Pacheco
Dan Goldwasser
45
4
0
14 Apr 2021
K-PLUG: Knowledge-injected Pre-trained Language Model for Natural
  Language Understanding and Generation in E-Commerce
K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce
Song Xu
Haoran Li
Peng Yuan
Yujia Wang
Youzheng Wu
Xiaodong He
Ying Liu
Bowen Zhou
KELM
94
24
0
14 Apr 2021
The Surprising Performance of Simple Baselines for Misinformation
  Detection
The Surprising Performance of Simple Baselines for Misinformation Detection
Kellin Pelrine
Jacob Danovitch
Reihaneh Rabbany
82
66
0
14 Apr 2021
Masked Language Modeling and the Distributional Hypothesis: Order Word
  Matters Pre-training for Little
Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little
Koustuv Sinha
Robin Jia
Dieuwke Hupkes
J. Pineau
Adina Williams
Douwe Kiela
140
249
0
14 Apr 2021
Learning How to Ask: Querying LMs with Mixtures of Soft Prompts
Learning How to Ask: Querying LMs with Mixtures of Soft Prompts
Guanghui Qin
J. Eisner
82
551
0
14 Apr 2021
Zero-Resource Multi-Dialectal Arabic Natural Language Understanding
Zero-Resource Multi-Dialectal Arabic Natural Language Understanding
Muhammad Khalifa
Hesham A. Hassan
A. Fahmy
63
7
0
14 Apr 2021
Should Semantic Vector Composition be Explicit? Can it be Linear?
Should Semantic Vector Composition be Explicit? Can it be Linear?
Dominic Widdows
Kristen Howell
T. Cohen
CoGe
68
3
0
13 Apr 2021
Large-Scale Contextualised Language Modelling for Norwegian
Large-Scale Contextualised Language Modelling for Norwegian
Andrey Kutuzov
Jeremy Barnes
Erik Velldal
Lilja Ovrelid
Stephan Oepen
84
38
0
13 Apr 2021
Zhestyatsky at SemEval-2021 Task 2: ReLU over Cosine Similarity for BERT
  Fine-tuning
Zhestyatsky at SemEval-2021 Task 2: ReLU over Cosine Similarity for BERT Fine-tuning
Boris Zhestiankin
Maria Ponomareva
32
5
0
13 Apr 2021
GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised
  Named Entity Recognition
GLaRA: Graph-based Labeling Rule Augmentation for Weakly Supervised Named Entity Recognition
Xinyan Zhao
Haibo Ding
Z. Feng
65
23
0
13 Apr 2021
On the Impact of Knowledge-based Linguistic Annotations in the Quality
  of Scientific Embeddings
On the Impact of Knowledge-based Linguistic Annotations in the Quality of Scientific Embeddings
Andrés García-Silva
R. Denaux
José Manuél Gómez-Pérez
122
3
0
13 Apr 2021
Understanding Transformers for Bot Detection in Twitter
Understanding Transformers for Bot Detection in Twitter
Andrés García-Silva
Cristian Berrío
José Manuél Gómez-Pérez
44
4
0
13 Apr 2021
Semantic maps and metrics for science Semantic maps and metrics for
  science using deep transformer encoders
Semantic maps and metrics for science Semantic maps and metrics for science using deep transformer encoders
Brendan Chambers
James A. Evans
MedIm
53
0
0
13 Apr 2021
DirectProbe: Studying Representations without Classifiers
DirectProbe: Studying Representations without Classifiers
Yichu Zhou
Vivek Srikumar
97
29
0
13 Apr 2021
Evaluating Pre-Trained Models for User Feedback Analysis in Software
  Engineering: A Study on Classification of App-Reviews
Evaluating Pre-Trained Models for User Feedback Analysis in Software Engineering: A Study on Classification of App-Reviews
M. Hadi
Fatemeh H. Fard
63
33
0
12 Apr 2021
Relational World Knowledge Representation in Contextual Language Models:
  A Review
Relational World Knowledge Representation in Contextual Language Models: A Review
Tara Safavi
Danai Koutra
KELM
100
51
0
12 Apr 2021
Few-shot Intent Classification and Slot Filling with Retrieved Examples
Few-shot Intent Classification and Slot Filling with Retrieved Examples
Dian Yu
Luheng He
Yuan Zhang
Xinya Du
Panupong Pasupat
Qi Li
VLM
71
54
0
12 Apr 2021
DATE: Detecting Anomalies in Text via Self-Supervision of Transformers
DATE: Detecting Anomalies in Text via Self-Supervision of Transformers
Andrei Manolache
Florin Brad
Elena Burceanu
UQCV
70
34
0
12 Apr 2021
Survey on reinforcement learning for language processing
Survey on reinforcement learning for language processing
Víctor Uc Cetina
Nicolás Navarro-Guerrero
A. Martín-González
C. Weber
S. Wermter
OffRL
97
111
0
12 Apr 2021
Self-Training with Weak Supervision
Self-Training with Weak Supervision
Giannis Karamanolakis
Subhabrata Mukherjee
Guoqing Zheng
Ahmed Hassan Awadallah
NoLa
65
86
0
12 Apr 2021
Stay Together: A System for Single and Split-antecedent Anaphora
  Resolution
Stay Together: A System for Single and Split-antecedent Anaphora Resolution
Juntao Yu
N. Moosavi
Silviu Paun
Massimo Poesio
41
14
0
12 Apr 2021
Better Feature Integration for Named Entity Recognition
Better Feature Integration for Named Entity Recognition
Lu Xu
Zhanming Jie
Wei Lu
Lidong Bing
72
38
0
12 Apr 2021
Learning to Remove: Towards Isotropic Pre-trained BERT Embedding
Learning to Remove: Towards Isotropic Pre-trained BERT Embedding
Y. Liang
Rui Cao
Jie Zheng
Jie Ren
Ling Gao
SSL
183
28
0
12 Apr 2021
Unsupervised Learning of Explainable Parse Trees for Improved
  Generalisation
Unsupervised Learning of Explainable Parse Trees for Improved Generalisation
Atul Sahay
Ayush Maheshwari
Ritesh Kumar
Ganesh Ramakrishnan
M. Hanawal
K. Arya
LRM
76
1
0
11 Apr 2021
Previous
123...363738...899091
Next