ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05365
  4. Cited By
Deep contextualized word representations
v1v2 (latest)

Deep contextualized word representations

15 February 2018
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
    NAI
ArXiv (abs)PDFHTML

Papers citing "Deep contextualized word representations"

50 / 4,508 papers shown
Title
A Latent-Variable Model for Intrinsic Probing
A Latent-Variable Model for Intrinsic Probing
Karolina Stañczak
Lucas Torroba Hennigen
Adina Williams
Ryan Cotterell
Isabelle Augenstein
119
4
0
20 Jan 2022
Linguistically-driven Multi-task Pre-training for Low-resource Neural
  Machine Translation
Linguistically-driven Multi-task Pre-training for Low-resource Neural Machine Translation
Zhuoyuan Mao
Chenhui Chu
Sadao Kurohashi
44
7
0
20 Jan 2022
AstBERT: Enabling Language Model for Financial Code Understanding with
  Abstract Syntax Trees
AstBERT: Enabling Language Model for Financial Code Understanding with Abstract Syntax Trees
Rong Liang
Tiehu Zhang
Y. Lu
Yuze Liu
Zhengqing Huang
Xin Chen
53
3
0
20 Jan 2022
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
Towards a Cleaner Document-Oriented Multilingual Crawled Corpus
Julien Abadji
Pedro Ortiz Suarez
Laurent Romary
Benoît Sagot
CLL
99
159
0
17 Jan 2022
Millions of Co-purchases and Reviews Reveal the Spread of Polarization
  and Lifestyle Politics across Online Markets
Millions of Co-purchases and Reviews Reveal the Spread of Polarization and Lifestyle Politics across Online Markets
Alex Ruch
Ari Decter-Frain
Raghav Batra
34
2
0
17 Jan 2022
Transferability in Deep Learning: A Survey
Transferability in Deep Learning: A Survey
Junguang Jiang
Yang Shu
Jianmin Wang
Mingsheng Long
OOD
93
105
0
15 Jan 2022
Machine Learning for Food Review and Recommendation
Machine Learning for Food Review and Recommendation
Tan Le
S. Hui
18
4
0
15 Jan 2022
The Dark Side of the Language: Pre-trained Transformers in the DarkNet
The Dark Side of the Language: Pre-trained Transformers in the DarkNet
Leonardo Ranaldi
Aria Nourbakhsh
Arianna Patrizi
Elena Sofia Ruzzetti
Dario Onorati
Francesca Fallucchi
Fabio Massimo Zanzotto
VLM
63
21
0
14 Jan 2022
A Survey of Controllable Text Generation using Transformer-based
  Pre-trained Language Models
A Survey of Controllable Text Generation using Transformer-based Pre-trained Language Models
Hanqing Zhang
Haolin Song
Shaoyu Li
Ming Zhou
Dawei Song
143
230
0
14 Jan 2022
DapStep: Deep Assignee Prediction for Stack Trace Error rePresentation
DapStep: Deep Assignee Prediction for Stack Trace Error rePresentation
Denis Sushentsev
Aleksandr Khvorov
R. Vasiliev
Yaroslav Golubev
T. Bryksin
60
3
0
14 Jan 2022
A Feature Extraction based Model for Hate Speech Identification
A Feature Extraction based Model for Hate Speech Identification
Salar Mohtaj
Vera Schmitt
Sebastian Möller
38
4
0
11 Jan 2022
Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular
  Vision-Language Pre-training
Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training
Yehao Li
Jiahao Fan
Yingwei Pan
Ting Yao
Weiyao Lin
Tao Mei
MLLMObjD
81
19
0
11 Jan 2022
Head2Toe: Utilizing Intermediate Representations for Better Transfer
  Learning
Head2Toe: Utilizing Intermediate Representations for Better Transfer Learning
Utku Evci
Vincent Dumoulin
Hugo Larochelle
Michael C. Mozer
147
86
0
10 Jan 2022
Medication Error Detection Using Contextual Language Models
Medication Error Detection Using Contextual Language Models
Yu Jiang
C. Poellabauer
19
1
0
09 Jan 2022
Coherence-Based Distributed Document Representation Learning for
  Scientific Documents
Coherence-Based Distributed Document Representation Learning for Scientific Documents
Shicheng Tan
Shu Zhao
Yanping Zhang
37
2
0
08 Jan 2022
Automatic Related Work Generation: A Meta Study
Automatic Related Work Generation: A Meta Study
Xiangci Li
Jessica Ouyang
110
10
0
06 Jan 2022
Multi Document Reading Comprehension
Multi Document Reading Comprehension
Avi Chawla
96
0
0
05 Jan 2022
Discrete and continuous representations and processing in deep learning:
  Looking forward
Discrete and continuous representations and processing in deep learning: Looking forward
Ruben Cartuyvels
Graham Spinks
Marie-Francine Moens
OCL
95
20
0
04 Jan 2022
Sound and Visual Representation Learning with Multiple Pretraining Tasks
Sound and Visual Representation Learning with Multiple Pretraining Tasks
A. Vasudevan
Dengxin Dai
Luc Van Gool
SSL
85
6
0
04 Jan 2022
Which Student is Best? A Comprehensive Knowledge Distillation Exam for
  Task-Specific BERT Models
Which Student is Best? A Comprehensive Knowledge Distillation Exam for Task-Specific BERT Models
Made Nindyatama Nityasya
Haryo Akbarianto Wibowo
Rendi Chevi
Radityo Eko Prasojo
Alham Fikri Aji
86
6
0
03 Jan 2022
Learning with Latent Structures in Natural Language Processing: A Survey
Learning with Latent Structures in Natural Language Processing: A Survey
Zhaofeng Wu
BDLDRL
73
4
0
03 Jan 2022
Transformer Embeddings of Irregularly Spaced Events and Their
  Participants
Transformer Embeddings of Irregularly Spaced Events and Their Participants
Chenghao Yang
Hongyuan Mei
Jason Eisner
AI4TS
118
60
0
31 Dec 2021
Clustering Vietnamese Conversations From Facebook Page To Build Training
  Dataset For Chatbot
Clustering Vietnamese Conversations From Facebook Page To Build Training Dataset For Chatbot
Tri Nguyen
Thi-Kim-Ngoan Pham
T. Bui
Thanh-Quynh-Chau Nguyen
62
0
0
31 Dec 2021
What is Event Knowledge Graph: A Survey
What is Event Knowledge Graph: A Survey
Saiping Guan
Xueqi Cheng
Long Bai
Fu Zhang
Zixuan Li
Yutao Zeng
Xiaolong Jin
Jiafeng Guo
73
58
0
31 Dec 2021
A Survey on Gender Bias in Natural Language Processing
A Survey on Gender Bias in Natural Language Processing
Karolina Stañczak
Isabelle Augenstein
95
117
0
28 Dec 2021
"A Passage to India": Pre-trained Word Embeddings for Indian Languages
"A Passage to India": Pre-trained Word Embeddings for Indian Languages
Saurav Kumar
Saunack Kumar
Diptesh Kanojia
P. Bhattacharyya
118
31
0
27 Dec 2021
Bridging the Gap: Using Deep Acoustic Representations to Learn Grounded
  Language from Percepts and Raw Speech
Bridging the Gap: Using Deep Acoustic Representations to Learn Grounded Language from Percepts and Raw Speech
Gaoussou Youssouf Kebe
Luke E. Richards
Edward Raff
Francis Ferraro
Cynthia Matuszek
SSL
92
5
0
27 Dec 2021
ArT: All-round Thinker for Unsupervised Commonsense Question-Answering
ArT: All-round Thinker for Unsupervised Commonsense Question-Answering
Jiawei Wang
Hai Zhao
LLMAGLRM
88
3
0
26 Dec 2021
PerCQA: Persian Community Question Answering Dataset
PerCQA: Persian Community Question Answering Dataset
Naghme Jamali
Yadollah Yaghoobzadeh
H. Faili
47
8
0
25 Dec 2021
Analyzing Scientific Publications using Domain-Specific Word Embedding
  and Topic Modelling
Analyzing Scientific Publications using Domain-Specific Word Embedding and Topic Modelling
Trisha Singhal
Junhua Liu
L. Blessing
Kwan Hui Lim
32
7
0
24 Dec 2021
ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training
  for Language Understanding and Generation
ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
Shuohuan Wang
Yu Sun
Yang Xiang
Zhihua Wu
Siyu Ding
...
Tian Wu
Wei Zeng
Ge Li
Wen Gao
Haifeng Wang
ELM
92
78
0
23 Dec 2021
Do Multi-Lingual Pre-trained Language Models Reveal Consistent Token
  Attributions in Different Languages?
Do Multi-Lingual Pre-trained Language Models Reveal Consistent Token Attributions in Different Languages?
Junxiang Wang
Xuchao Zhang
Bo Zong
Yanchi Liu
Wei Cheng
Jingchao Ni
Haifeng Chen
Liang Zhao
AAML
64
0
0
23 Dec 2021
A Label Dependence-aware Sequence Generation Model for Multi-level
  Implicit Discourse Relation Recognition
A Label Dependence-aware Sequence Generation Model for Multi-level Implicit Discourse Relation Recognition
Changxing Wu
Liuwen Cao
Yubin Ge
Yang Liu
Min Zhang
Jinsong Su
57
32
0
22 Dec 2021
A Survey of Natural Language Generation
A Survey of Natural Language Generation
Chenhe Dong
Hai-Tao Zheng
Haifan Gong
Mengzhao Chen
Junxin Li
Ying Shen
Min Yang
3DV
89
45
0
22 Dec 2021
How Should Pre-Trained Language Models Be Fine-Tuned Towards Adversarial
  Robustness?
How Should Pre-Trained Language Models Be Fine-Tuned Towards Adversarial Robustness?
Xinhsuai Dong
Anh Tuan Luu
Min Lin
Shuicheng Yan
Hanwang Zhang
SILMAAML
71
62
0
22 Dec 2021
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media
  Knowledge Extraction and Grounding
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
Revanth Reddy Gangi Reddy
Xilin Rui
Manling Li
Xudong Lin
Haoyang Wen
...
Joey Tianyi Zhou
Avirup Sil
Shih-Fu Chang
Alex Schwing
Heng Ji
80
32
0
20 Dec 2021
Efficient Large Scale Language Modeling with Mixtures of Experts
Efficient Large Scale Language Modeling with Mixtures of Experts
Mikel Artetxe
Shruti Bhosale
Naman Goyal
Todor Mihaylov
Myle Ott
...
Jeff Wang
Luke Zettlemoyer
Mona T. Diab
Zornitsa Kozareva
Ves Stoyanov
MoE
245
201
0
20 Dec 2021
Between words and characters: A Brief History of Open-Vocabulary
  Modeling and Tokenization in NLP
Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Sabrina J. Mielke
Zaid Alyafeai
Elizabeth Salesky
Colin Raffel
Manan Dey
...
Arun Raja
Chenglei Si
Wilson Y. Lee
Benoît Sagot
Samson Tan
113
151
0
20 Dec 2021
Article Reranking by Memory-Enhanced Key Sentence Matching for Detecting
  Previously Fact-Checked Claims
Article Reranking by Memory-Enhanced Key Sentence Matching for Detecting Previously Fact-Checked Claims
Qiang Sheng
Juan Cao
Xueyao Zhang
Xirong Li
L. Zhong
KELM
78
28
0
20 Dec 2021
hybrid-Falcon: Hybrid Pattern Malware Detection and Categorization with
  Network Traffic and Program Code
hybrid-Falcon: Hybrid Pattern Malware Detection and Categorization with Network Traffic and Program Code
Peng Xu
Claudia Eckert
Apostolis Zarras
82
4
0
19 Dec 2021
Word Graph Guided Summarization for Radiology Findings
Word Graph Guided Summarization for Radiology Findings
Jinpeng Hu
Jianling Li
Zhihong Chen
Yaling Shen
Yan Song
Xiang Wan
Tsung-Hui Chang
79
38
0
18 Dec 2021
An Empirical Investigation of the Role of Pre-training in Lifelong
  Learning
An Empirical Investigation of the Role of Pre-training in Lifelong Learning
Sanket Vaibhav Mehta
Darshan Patil
Sarath Chandar
Emma Strubell
CLL
156
145
0
16 Dec 2021
An Unsupervised Way to Understand Artifact Generating Internal Units in
  Generative Neural Networks
An Unsupervised Way to Understand Artifact Generating Internal Units in Generative Neural Networks
Haedong Jeong
Jiyeon Han
Jaesik Choi
59
3
0
16 Dec 2021
Harnessing Cross-lingual Features to Improve Cognate Detection for
  Low-resource Languages
Harnessing Cross-lingual Features to Improve Cognate Detection for Low-resource Languages
Diptesh Kanojia
Raj Dabre
Shubham Dewangan
P. Bhattacharyya
Gholamreza Haffari
Malhar A. Kulkarni
55
5
0
16 Dec 2021
Efficient Hierarchical Domain Adaptation for Pretrained Language Models
Efficient Hierarchical Domain Adaptation for Pretrained Language Models
Alexandra Chronopoulou
Matthew E. Peters
Jesse Dodge
92
44
0
16 Dec 2021
CLIN-X: pre-trained language models and a study on cross-task transfer
  for concept extraction in the clinical domain
CLIN-X: pre-trained language models and a study on cross-task transfer for concept extraction in the clinical domain
Lukas Lange
Heike Adel
Jannik Strötgen
Dietrich Klakow
AILawLM&MA
83
21
0
16 Dec 2021
Reconsidering the Past: Optimizing Hidden States in Language Models
Reconsidering the Past: Optimizing Hidden States in Language Models
Davis Yoshida
Kevin Gimpel
BDL
63
2
0
16 Dec 2021
Explainable Natural Language Processing with Matrix Product States
Explainable Natural Language Processing with Matrix Product States
J. Tangpanitanon
Chanatip Mangkang
P. Bhadola
Yuichiro Minato
D. Angelakis
Thiparat Chotibut
79
5
0
16 Dec 2021
Learning Rich Representation of Keyphrases from Text
Learning Rich Representation of Keyphrases from Text
Mayank Kulkarni
Debanjan Mahata
Ravneet Arora
Rajarshi Bhowmik
VLM
76
68
0
16 Dec 2021
Penn-Helsinki Parsed Corpus of Early Modern English: First Parsing
  Results and Analysis
Penn-Helsinki Parsed Corpus of Early Modern English: First Parsing Results and Analysis
S. Kulick
Neville Ryant
Beatrice Santorini
23
3
0
15 Dec 2021
Previous
123...242526...899091
Next