ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05365
  4. Cited By
Deep contextualized word representations
v1v2 (latest)

Deep contextualized word representations

15 February 2018
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
    NAI
ArXiv (abs)PDFHTML

Papers citing "Deep contextualized word representations"

50 / 4,508 papers shown
Title
Schrödinger's Tree -- On Syntax and Neural Language Models
Schrödinger's Tree -- On Syntax and Neural Language Models
Artur Kulmizev
Joakim Nivre
77
6
0
17 Oct 2021
An LSTM-based Plagiarism Detection via Attention Mechanism and a
  Population-based Approach for Pre-Training Parameters with imbalanced Classes
An LSTM-based Plagiarism Detection via Attention Mechanism and a Population-based Approach for Pre-Training Parameters with imbalanced Classes
Seyed Vahid Moravvej
Seyed Jalaleddin Mousavirad
M. H. Moghadam
Mehrdad Saadatmand
29
35
0
17 Oct 2021
FrugalScore: Learning Cheaper, Lighter and Faster Evaluation Metricsfor
  Automatic Text Generation
FrugalScore: Learning Cheaper, Lighter and Faster Evaluation Metricsfor Automatic Text Generation
Moussa Kamal Eddine
Guokan Shang
A. Tixier
Michalis Vazirgiannis
77
28
0
16 Oct 2021
PAGnol: An Extra-Large French Generative Model
PAGnol: An Extra-Large French Generative Model
Julien Launay
E. L. Tommasone
B. Pannier
Franccois Boniface
A. Chatelain
Alessandro Cappelli
Iacopo Poli
Djamé Seddah
AILawMoEAI4CE
87
8
0
16 Oct 2021
An Empirical Survey of the Effectiveness of Debiasing Techniques for
  Pre-trained Language Models
An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models
Nicholas Meade
Elinor Poole-Dayan
Siva Reddy
113
131
0
16 Oct 2021
Multimodal Dialogue Response Generation
Multimodal Dialogue Response Generation
Qingfeng Sun
Yujing Wang
Can Xu
Kai Zheng
Yaming Yang
Huang Hu
Fei Xu
Jessica Zhang
Xiubo Geng
Daxin Jiang
106
49
0
16 Oct 2021
AugmentedCode: Examining the Effects of Natural Language Resources in
  Code Retrieval Models
AugmentedCode: Examining the Effects of Natural Language Resources in Code Retrieval Models
M. Bahrami
N. Shrikanth
Yuji Mizobuchi
Lei Liu
M. Fukuyori
Wei-Peng Chen
Kazuki Munakata
49
3
0
16 Oct 2021
Improving Compositional Generalization with Self-Training for
  Data-to-Text Generation
Improving Compositional Generalization with Self-Training for Data-to-Text Generation
Sanket Vaibhav Mehta
J. Rao
Yi Tay
Mihir Kale
Ankur P. Parikh
Emma Strubell
AI4CE
96
30
0
16 Oct 2021
A Short Study on Compressing Decoder-Based Language Models
A Short Study on Compressing Decoder-Based Language Models
Tianda Li
Yassir El Mesbahi
I. Kobyzev
Ahmad Rashid
A. Mahmud
Nithin Anchuri
Habib Hajimolahoseini
Yang Liu
Mehdi Rezagholizadeh
151
25
0
16 Oct 2021
The Dangers of Underclaiming: Reasons for Caution When Reporting How NLP
  Systems Fail
The Dangers of Underclaiming: Reasons for Caution When Reporting How NLP Systems Fail
Sam Bowman
OffRL
117
45
0
15 Oct 2021
ASPECTNEWS: Aspect-Oriented Summarization of News Documents
ASPECTNEWS: Aspect-Oriented Summarization of News Documents
Ojas Ahuja
Jiacheng Xu
A. Gupta
Kevin Horecka
Greg Durrett
107
46
0
15 Oct 2021
Don't speak too fast: The impact of data bias on self-supervised speech
  models
Don't speak too fast: The impact of data bias on self-supervised speech models
Yen Meng
Yi-Hui Chou
Andy T. Liu
Hung-yi Lee
97
27
0
15 Oct 2021
Plug-Tagger: A Pluggable Sequence Labeling Framework Using Language
  Models
Plug-Tagger: A Pluggable Sequence Labeling Framework Using Language Models
Xin Zhou
Ruotian Ma
Tao Gui
Y. Tan
Qi Zhang
Xuanjing Huang
VLM
73
5
0
14 Oct 2021
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language
  Processing
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing
Junyi Ao
Rui Wang
Long Zhou
Chengyi Wang
Shuo Ren
...
Yu Zhang
Zhihua Wei
Yao Qian
Jinyu Li
Furu Wei
175
203
0
14 Oct 2021
Rethinking Self-Supervision Objectives for Generalizable Coherence
  Modeling
Rethinking Self-Supervision Objectives for Generalizable Coherence Modeling
Prathyusha Jwalapuram
Shafiq Joty
Xiang Lin
111
16
0
14 Oct 2021
Bag-of-Vectors Autoencoders for Unsupervised Conditional Text Generation
Bag-of-Vectors Autoencoders for Unsupervised Conditional Text Generation
Florian Mai
James Henderson
47
2
0
13 Oct 2021
Automated Essay Scoring Using Transformer Models
Automated Essay Scoring Using Transformer Models
Sabrina Ludwig
Christian W. F. Mayer
Christopher Hansen
Kerstin Eilers
Steffen Brandt
85
40
0
13 Oct 2021
Mengzi: Towards Lightweight yet Ingenious Pre-trained Models for Chinese
Mengzi: Towards Lightweight yet Ingenious Pre-trained Models for Chinese
Zhuosheng Zhang
Hanqing Zhang
Keming Chen
Yuhang Guo
Jingyun Hua
Yulong Wang
Ming Zhou
VLM
110
72
0
13 Oct 2021
MDERank: A Masked Document Embedding Rank Approach for Unsupervised
  Keyphrase Extraction
MDERank: A Masked Document Embedding Rank Approach for Unsupervised Keyphrase Extraction
Linhan Zhang
Qian Chen
Wen Wang
Chong Deng
Shiliang Zhang
Bing Li
Wei Wang
Xin Cao
78
59
0
13 Oct 2021
The Dawn of Quantum Natural Language Processing
The Dawn of Quantum Natural Language Processing
R. Sipio
Jia-Hong Huang
Samuel Yen-Chi Chen
Stefano Mangini
Marcel Worring
131
86
0
13 Oct 2021
Fake News Detection in Spanish Using Deep Learning Techniques
Fake News Detection in Spanish Using Deep Learning Techniques
Kevin Martínez-Gallego
Andrés M. Álvarez-Ortiz
Julián D. Arias-Londoño
SyDa
123
14
0
13 Oct 2021
Learning Compact Metrics for MT
Learning Compact Metrics for MT
Amy Pu
Hyung Won Chung
Ankur P. Parikh
Sebastian Gehrmann
Thibault Sellam
91
101
0
12 Oct 2021
The Rich Get Richer: Disparate Impact of Semi-Supervised Learning
The Rich Get Richer: Disparate Impact of Semi-Supervised Learning
Zhaowei Zhu
Tianyi Luo
Yang Liu
243
40
0
12 Oct 2021
A Survey on Legal Question Answering Systems
A Survey on Legal Question Answering Systems
J. Martinez-Gil
AILawELM
92
29
0
12 Oct 2021
Regionalized models for Spanish language variations based on Twitter
Regionalized models for Spanish language variations based on Twitter
Eric Sadit Tellez
Daniela Moctezuma
Sabino Miranda
Mario Graff
Guillermo Ruiz
87
3
0
12 Oct 2021
Investigation on Data Adaptation Techniques for Neural Named Entity
  Recognition
Investigation on Data Adaptation Techniques for Neural Named Entity Recognition
Evgeniia Tokarchuk
David Thulke
Weiyue Wang
Christian Dugast
Hermann Ney
53
2
0
12 Oct 2021
We've had this conversation before: A Novel Approach to Measuring Dialog
  Similarity
We've had this conversation before: A Novel Approach to Measuring Dialog Similarity
Ofer Lavi
Ella Rabinovich
Segev Shlomov
David Boaz
Inbal Ronen
Ateret Anaby-Tavor
95
5
0
12 Oct 2021
Evaluating User Perception of Speech Recognition System Quality with
  Semantic Distance Metric
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric
Suyoun Kim
Duc Le
Weiyi Zheng
Tarun Singh
Abhinav Arora
Xiaoyu Zhai
Christian Fuegen
Ozlem Kalinli
M. Seltzer
58
16
0
11 Oct 2021
Improving Gender Fairness of Pre-Trained Language Models without
  Catastrophic Forgetting
Improving Gender Fairness of Pre-Trained Language Models without Catastrophic Forgetting
Zahra Fatemi
Chen Xing
Wenhao Liu
Caiming Xiong
CLL
85
34
0
11 Oct 2021
A Comprehensive Comparison of Word Embeddings in Event & Entity
  Coreference Resolution
A Comprehensive Comparison of Word Embeddings in Event & Entity Coreference Resolution
Judicael Poumay
A. Ittoo
28
2
0
11 Oct 2021
Pre-trained Language Models in Biomedical Domain: A Systematic Survey
Pre-trained Language Models in Biomedical Domain: A Systematic Survey
Benyou Wang
Qianqian Xie
Jiahuan Pei
Zhihong Chen
Prayag Tiwari
Zhao Li
Jie Fu
LM&MAAI4CE
154
171
0
11 Oct 2021
Advances in Multi-turn Dialogue Comprehension: A Survey
Zhuosheng Zhang
Hai Zhao
103
21
0
11 Oct 2021
CoRGi: Content-Rich Graph Neural Networks with Attention
CoRGi: Content-Rich Graph Neural Networks with Attention
Jooyeon Kim
A. Lamb
Simon Woodhead
Simon L. Peyton Jones
Cheng Zheng
Miltiadis Allamanis
73
6
0
10 Oct 2021
Enhance Long Text Understanding via Distilled Gist Detector from
  Abstractive Summarization
Enhance Long Text Understanding via Distilled Gist Detector from Abstractive Summarization
Yang Liu
Yazheng Yang
64
6
0
10 Oct 2021
Learning to Follow Language Instructions with Compositional Policies
Learning to Follow Language Instructions with Compositional Policies
Vanya Cohen
Geraud Nangue Tasse
N. Gopalan
Steven D. James
Matthew C. Gombolay
Benjamin Rosman
57
4
0
09 Oct 2021
An Isotropy Analysis in the Multilingual BERT Embedding Space
An Isotropy Analysis in the Multilingual BERT Embedding Space
S. Rajaee
Mohammad Taher Pilehvar
123
34
0
09 Oct 2021
Towards a Unified View of Parameter-Efficient Transfer Learning
Towards a Unified View of Parameter-Efficient Transfer Learning
Junxian He
Chunting Zhou
Xuezhe Ma
Taylor Berg-Kirkpatrick
Graham Neubig
AAML
202
958
0
08 Oct 2021
VieSum: How Robust Are Transformer-based Models on Vietnamese
  Summarization?
VieSum: How Robust Are Transformer-based Models on Vietnamese Summarization?
Hieu Duy Nguyen
Long Phan
J. Anibal
Alec Peltekian
H. Tran
71
5
0
08 Oct 2021
Hierarchical Conditional End-to-End ASR with CTC and Multi-Granular
  Subword Units
Hierarchical Conditional End-to-End ASR with CTC and Multi-Granular Subword Units
Yosuke Higuchi
Keita Karube
Tetsuji Ogawa
Tetsunori Kobayashi
56
24
0
08 Oct 2021
On the Generalization of Models Trained with SGD: Information-Theoretic
  Bounds and Implications
On the Generalization of Models Trained with SGD: Information-Theoretic Bounds and Implications
Ziqiao Wang
Yongyi Mao
FedMLMLT
124
26
0
07 Oct 2021
Self-Supervised Knowledge Assimilation for Expert-Layman Text Style
  Transfer
Self-Supervised Knowledge Assimilation for Expert-Layman Text Style Transfer
Wenda Xu
Michael Stephen Saxon
Misha Sra
Wenjie Wang
MedIm
81
13
0
06 Oct 2021
Using Optimal Transport as Alignment Objective for fine-tuning
  Multilingual Contextualized Embeddings
Using Optimal Transport as Alignment Objective for fine-tuning Multilingual Contextualized Embeddings
Sawsan Alqahtani
Garima Lalwani
Yi Zhang
Salvatore Romeo
Saab Mansour
OT
72
25
0
06 Oct 2021
BadPre: Task-agnostic Backdoor Attacks to Pre-trained NLP Foundation
  Models
BadPre: Task-agnostic Backdoor Attacks to Pre-trained NLP Foundation Models
Kangjie Chen
Yuxian Meng
Xiaofei Sun
Shangwei Guo
Tianwei Zhang
Jiwei Li
Chun Fan
SILM
87
111
0
06 Oct 2021
Word Acquisition in Neural Language Models
Word Acquisition in Neural Language Models
Tyler A. Chang
Benjamin Bergen
90
40
0
05 Oct 2021
BERT Attends the Conversation: Improving Low-Resource Conversational ASR
BERT Attends the Conversation: Improving Low-Resource Conversational ASR
Pablo Ortiz
Simen Burud
55
5
0
05 Oct 2021
Learning Sense-Specific Static Embeddings using Contextualised Word
  Embeddings as a Proxy
Learning Sense-Specific Static Embeddings using Contextualised Word Embeddings as a Proxy
Yi Zhou
Danushka Bollegala
79
9
0
05 Oct 2021
ASR Rescoring and Confidence Estimation with ELECTRA
ASR Rescoring and Confidence Estimation with ELECTRA
Hayato Futami
Hirofumi Inaguma
Masato Mimura
S. Sakai
Tatsuya Kawahara
KELM
104
21
0
05 Oct 2021
Attention Augmented Convolutional Transformer for Tabular Time-series
Attention Augmented Convolutional Transformer for Tabular Time-series
Sharath M. Shankaranarayana
D. Runje
LMTDAI4TS
110
8
0
05 Oct 2021
A Survey On Neural Word Embeddings
A Survey On Neural Word Embeddings
Erhan Sezerer
Selma Tekir
AI4TS
86
13
0
05 Oct 2021
Low Frequency Names Exhibit Bias and Overfitting in Contextualizing
  Language Models
Low Frequency Names Exhibit Bias and Overfitting in Contextualizing Language Models
Robert Wolfe
Aylin Caliskan
125
51
0
01 Oct 2021
Previous
123...272829...899091
Next