ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05365
  4. Cited By
Deep contextualized word representations
v1v2 (latest)

Deep contextualized word representations

15 February 2018
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
    NAI
ArXiv (abs)PDFHTML

Papers citing "Deep contextualized word representations"

50 / 4,508 papers shown
Title
Robust Layout-aware IE for Visually Rich Documents with Pre-trained
  Language Models
Robust Layout-aware IE for Visually Rich Documents with Pre-trained Language Models
Mengxi Wei
Yifan He
Qiong Zhang
VLM
50
41
0
22 May 2020
Med-BERT: pre-trained contextualized embeddings on large-scale
  structured electronic health records for disease prediction
Med-BERT: pre-trained contextualized embeddings on large-scale structured electronic health records for disease prediction
L. Rasmy
Yang Xiang
Z. Xie
Cui Tao
Degui Zhi
AI4MHLM&MA
129
705
0
22 May 2020
Stochastic Optimization with Heavy-Tailed Noise via Accelerated Gradient
  Clipping
Stochastic Optimization with Heavy-Tailed Noise via Accelerated Gradient Clipping
Eduard A. Gorbunov
Marina Danilova
Alexander Gasnikov
94
123
0
21 May 2020
Fluent Response Generation for Conversational Question Answering
Fluent Response Generation for Conversational Question Answering
Ashutosh Baheti
Alan Ritter
Kevin Small
90
29
0
21 May 2020
Stance Prediction and Claim Verification: An Arabic Perspective
Stance Prediction and Claim Verification: An Arabic Perspective
Jude Khouja
73
62
0
21 May 2020
Pretraining with Contrastive Sentence Objectives Improves Discourse
  Performance of Language Models
Pretraining with Contrastive Sentence Objectives Improves Discourse Performance of Language Models
Dan Iter
Kelvin Guu
L. Lansing
Dan Jurafsky
96
78
0
20 May 2020
Leveraging Graph to Improve Abstractive Multi-Document Summarization
Leveraging Graph to Improve Abstractive Multi-Document Summarization
Wei Li
Xinyan Xiao
Jiachen Liu
Hua Wu
Haifeng Wang
Junping Du
92
136
0
20 May 2020
Graph-based, Self-Supervised Program Repair from Diagnostic Feedback
Graph-based, Self-Supervised Program Repair from Diagnostic Feedback
Michihiro Yasunaga
Percy Liang
LRM
140
176
0
20 May 2020
Cross-lingual Approaches for Task-specific Dialogue Act Recognition
Cross-lingual Approaches for Task-specific Dialogue Act Recognition
Jivrí Martínek
Christophe Cerisara
Pavel Král
Ladislav Lenc
20
0
0
19 May 2020
The Effect of Moderation on Online Mental Health Conversations
The Effect of Moderation on Online Mental Health Conversations
David Wadden
Tal August
Qisheng Li
Tim Althoff
AI4MH
72
46
0
19 May 2020
Contextual Embeddings: When Are They Worth It?
Contextual Embeddings: When Are They Worth It?
Simran Arora
Avner May
Jian Zhang
Christopher Ré
65
62
0
18 May 2020
Are All Languages Created Equal in Multilingual BERT?
Are All Languages Created Equal in Multilingual BERT?
Shijie Wu
Mark Dredze
105
326
0
18 May 2020
P-SIF: Document Embeddings Using Partition Averaging
P-SIF: Document Embeddings Using Partition Averaging
Vivek Gupta
A. Saw
Pegah Nokhiz
Praneeth Netrapalli
Piyush Rai
Partha P. Talukdar
57
25
0
18 May 2020
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio
  Representation
Audio ALBERT: A Lite BERT for Self-supervised Learning of Audio Representation
Po-Han Chi
Pei-Hung Chung
Tsung-Han Wu
Chun-Cheng Hsieh
Yen-Hao Chen
Shang-Wen Li
Hung-yi Lee
SSL
101
148
0
18 May 2020
Text Classification with Few Examples using Controlled Generalization
Text Classification with Few Examples using Controlled Generalization
A. Mahabal
Jason Baldridge
Burcu Karagol Ayan
Vincent Perot
Dan Roth
OODAI4CE
69
11
0
18 May 2020
Vector-Quantized Autoregressive Predictive Coding
Vector-Quantized Autoregressive Predictive Coding
Yu-An Chung
Hao Tang
James R. Glass
SSL
68
115
0
17 May 2020
TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data
TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data
Pengcheng Yin
Graham Neubig
Wen-tau Yih
Sebastian Riedel
RALMLMTD
174
610
0
17 May 2020
Support-BERT: Predicting Quality of Question-Answer Pairs in MSDN using
  Deep Bidirectional Transformer
Support-BERT: Predicting Quality of Question-Answer Pairs in MSDN using Deep Bidirectional Transformer
Bhaskar Sen
Nikhil Gopal
Xinwei Xue
OOD
35
10
0
17 May 2020
SPot: A tool for identifying operating segments in financial tables
SPot: A tool for identifying operating segments in financial tables
Zhiqiang Ma
Steven Pomerville
Mingyang Di
Armineh Nourbakhsh
12
6
0
17 May 2020
CS-NLP team at SemEval-2020 Task 4: Evaluation of State-of-the-art NLP
  Deep Learning Architectures on Commonsense Reasoning Task
CS-NLP team at SemEval-2020 Task 4: Evaluation of State-of-the-art NLP Deep Learning Architectures on Commonsense Reasoning Task
Sirwe Saeedi
Ali (Aliakbar) Panahi
Seyran Saeedi
A. Fong
ReLMELMLRM
78
12
0
17 May 2020
Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation
Speech to Text Adaptation: Towards an Efficient Cross-Modal Distillation
Won Ik Cho
Donghyun Kwak
J. Yoon
N. Kim
84
26
0
17 May 2020
Cross-Lingual Low-Resource Set-to-Description Retrieval for Global
  E-Commerce
Cross-Lingual Low-Resource Set-to-Description Retrieval for Global E-Commerce
Juntao Li
Chang Liu
Jian Wang
Lidong Bing
Hongsong Li
Xiaozhong Liu
Dongyan Zhao
Rui Yan
48
12
0
17 May 2020
Encodings of Source Syntax: Similarities in NMT Representations Across
  Target Languages
Encodings of Source Syntax: Similarities in NMT Representations Across Target Languages
Tyler A. Chang
Anna N. Rafferty
74
2
0
17 May 2020
Semi-Automating Knowledge Base Construction for Cancer Genetics
Semi-Automating Knowledge Base Construction for Cancer Genetics
Somin Wadhwa
K. Yin
K. Hughes
Byron C. Wallace
118
0
0
17 May 2020
RPD: A Distance Function Between Word Embeddings
RPD: A Distance Function Between Word Embeddings
Xuhui Zhou
Zaixiang Zheng
Shujian Huang
48
3
0
16 May 2020
Learning Probabilistic Sentence Representations from Paraphrases
Learning Probabilistic Sentence Representations from Paraphrases
Mingda Chen
Kevin Gimpel
32
2
0
16 May 2020
Rethinking and Improving Natural Language Generation with Layer-Wise
  Multi-View Decoding
Rethinking and Improving Natural Language Generation with Layer-Wise Multi-View Decoding
Fenglin Liu
Xuancheng Ren
Guangxiang Zhao
Chenyu You
Xuewei Ma
Xian Wu
Xu Sun
107
2
0
16 May 2020
KEIS@JUST at SemEval-2020 Task 12: Identifying Multilingual Offensive
  Tweets Using Weighted Ensemble and Fine-Tuned BERT
KEIS@JUST at SemEval-2020 Task 12: Identifying Multilingual Offensive Tweets Using Weighted Ensemble and Fine-Tuned BERT
Saja Khaled Tawalbeh
Mahmoud M. Hammad
Mohammad Al-Smadi
50
12
0
15 May 2020
A Scientific Information Extraction Dataset for Nature Inspired
  Engineering
A Scientific Information Extraction Dataset for Nature Inspired Engineering
Ruben Kruiper
J. Vincent
Jessica Chen-Burger
M. Desmulliez
Ioannis Konstas
73
6
0
15 May 2020
In Layman's Terms: Semi-Open Relation Extraction from Scientific Texts
In Layman's Terms: Semi-Open Relation Extraction from Scientific Texts
Ruben Kruiper
J. Vincent
Jessica Chen-Burger
M. Desmulliez
Ioannis Konstas
71
21
0
15 May 2020
Cross-lingual Transfer of Sentiment Classifiers
Cross-lingual Transfer of Sentiment Classifiers
Marko Robnik-Šikonja
Kristjan Reba
I. Mozetič
42
6
0
15 May 2020
NAT: Noise-Aware Training for Robust Neural Sequence Labeling
NAT: Noise-Aware Training for Robust Neural Sequence Labeling
Marcin Namysl
Sven Behnke
Joachim Kohler
NoLa
45
15
0
14 May 2020
Named Entity Recognition as Dependency Parsing
Named Entity Recognition as Dependency Parsing
Juntao Yu
Bernd Bohnet
Massimo Poesio
111
423
0
14 May 2020
An Efficient Shared-memory Parallel Sinkhorn-Knopp Algorithm to Compute
  the Word Mover's Distance
An Efficient Shared-memory Parallel Sinkhorn-Knopp Algorithm to Compute the Word Mover's Distance
Jesmin Jahan Tithi
Fabrizio Petrini
39
4
0
14 May 2020
Document-Level Event Role Filler Extraction using Multi-Granularity
  Contextualized Encoding
Document-Level Event Role Filler Extraction using Multi-Granularity Contextualized Encoding
Xinya Du
Claire Cardie
69
101
0
13 May 2020
Deep Learning for Political Science
Deep Learning for Political Science
Kakia Chatsiou
Slava Jankin
AI4CE
75
13
0
13 May 2020
A Mixture of $h-1$ Heads is Better than $h$ Heads
A Mixture of h−1h-1h−1 Heads is Better than hhh Heads
Hao Peng
Roy Schwartz
Dianqi Li
Noah A. Smith
MoE
76
33
0
13 May 2020
The Unstoppable Rise of Computational Linguistics in Deep Learning
The Unstoppable Rise of Computational Linguistics in Deep Learning
James Henderson
AI4CE
84
28
0
13 May 2020
Machine Reading Comprehension: The Role of Contextualized Language
  Models and Beyond
Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond
Zhuosheng Zhang
Hai Zhao
Rui Wang
117
63
0
13 May 2020
MLSolv-A: A Novel Machine Learning-Based Prediction of Solvation Free
  Energies from Pairwise Atomistic Interactions
MLSolv-A: A Novel Machine Learning-Based Prediction of Solvation Free Energies from Pairwise Atomistic Interactions
Hyuntae Lim
YounJoon Jung
60
37
0
13 May 2020
Automated Extraction of Socio-political Events from News (AESPEN):
  Workshop and Shared Task Report
Automated Extraction of Socio-political Events from News (AESPEN): Workshop and Shared Task Report
Ali Hürriyetoǧlu
Vanni Zavarella
Hristo Tanev
E. Yoruk
Ali Safaya
Osman Mutlu
50
31
0
12 May 2020
MOReL : Model-Based Offline Reinforcement Learning
MOReL : Model-Based Offline Reinforcement Learning
Rahul Kidambi
Aravind Rajeswaran
Praneeth Netrapalli
Thorsten Joachims
OffRL
166
679
0
12 May 2020
A Report on the 2020 Sarcasm Detection Shared Task
A Report on the 2020 Sarcasm Detection Shared Task
Debanjan Ghosh
Avijit Vajpayee
Smaranda Muresan
66
61
0
12 May 2020
Do not let the history haunt you -- Mitigating Compounding Errors in
  Conversational Question Answering
Do not let the history haunt you -- Mitigating Compounding Errors in Conversational Question Answering
Angrosh Mandya
James OÑeill
Danushka Bollegala
Frans Coenen
39
8
0
12 May 2020
Dynamic Memory Induction Networks for Few-Shot Text Classification
Dynamic Memory Induction Networks for Few-Shot Text Classification
Ruiying Geng
Binhua Li
Yongbin Li
Jian Sun
Xiao-Dan Zhu
56
79
0
12 May 2020
On the Robustness of Language Encoders against Grammatical Errors
On the Robustness of Language Encoders against Grammatical Errors
Fan Yin
Quanyu Long
Tao Meng
Kai-Wei Chang
83
35
0
12 May 2020
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
Hao Tian
Can Gao
Xinyan Xiao
Hao Liu
Bolei He
Hua Wu
Haifeng Wang
Feng Wu
91
237
0
12 May 2020
Neural Polysynthetic Language Modelling
Neural Polysynthetic Language Modelling
Lane Schwartz
Francis M. Tyers
Lori S. Levin
Christo Kirov
Patrick Littell
...
Vasilisa Andriyanets
Aldrian Obaja Muis
Naoki Otani
J. Park
Zhisong Zhang
73
24
0
11 May 2020
A Deep Learning Approach for Automatic Detection of Fake News
A Deep Learning Approach for Automatic Detection of Fake News
Tanik Saikh
Arkadipta De
Asif Ekbal
P. Bhattacharyya
59
34
0
11 May 2020
schuBERT: Optimizing Elements of BERT
schuBERT: Optimizing Elements of BERT
A. Khetan
Zohar Karnin
93
30
0
09 May 2020
Previous
123...555657...899091
Next