ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05365
  4. Cited By
Deep contextualized word representations
v1v2 (latest)

Deep contextualized word representations

15 February 2018
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
    NAI
ArXiv (abs)PDFHTML

Papers citing "Deep contextualized word representations"

50 / 4,508 papers shown
Title
Subject Independent Emotion Recognition using EEG Signals Employing
  Attention Driven Neural Networks
Subject Independent Emotion Recognition using EEG Signals Employing Attention Driven Neural Networks
Arjun
Aniket Singh Rajpoot
Mahesh Raveendranatha Panicker
50
91
0
07 Jun 2021
A Globally Normalized Neural Model for Semantic Parsing
A Globally Normalized Neural Model for Semantic Parsing
Chenyang Huang
Wei Yang
Yanshuai Cao
Osmar Zaïane
Lili Mou
79
3
0
07 Jun 2021
SelfDoc: Self-Supervised Document Representation Learning
SelfDoc: Self-Supervised Document Representation Learning
Peizhao Li
Jiuxiang Gu
Jason Kuen
Vlad I. Morariu
Handong Zhao
R. Jain
Varun Manjunatha
Hongfu Liu
ViTSSL
92
162
0
07 Jun 2021
Exploring the Limits of Out-of-Distribution Detection
Exploring the Limits of Out-of-Distribution Detection
Stanislav Fort
Jie Jessie Ren
Balaji Lakshminarayanan
115
342
0
06 Jun 2021
Meta-Learning with Variational Semantic Memory for Word Sense
  Disambiguation
Meta-Learning with Variational Semantic Memory for Word Sense Disambiguation
Yingjun Du
Nithin Holla
Xiantong Zhen
Cees G. M. Snoek
Ekaterina Shutova
82
9
0
05 Jun 2021
Denoising Word Embeddings by Averaging in a Shared Space
Denoising Word Embeddings by Averaging in a Shared Space
Avi Caciularu
Ido Dagan
Jacob Goldberger
FedMLMoMe
67
4
0
05 Jun 2021
Integrating Auxiliary Information in Self-supervised Learning
Integrating Auxiliary Information in Self-supervised Learning
Yao-Hung Hubert Tsai
Tianqi Li
Weixin Liu
Peiyuan Liao
Ruslan Salakhutdinov
Louis-Philippe Morency
SSL
55
7
0
05 Jun 2021
MergeDistill: Merging Pre-trained Language Models using Distillation
MergeDistill: Merging Pre-trained Language Models using Distillation
Simran Khanuja
Melvin Johnson
Partha P. Talukdar
84
16
0
05 Jun 2021
Exposing the Implicit Energy Networks behind Masked Language Models via
  Metropolis--Hastings
Exposing the Implicit Energy Networks behind Masked Language Models via Metropolis--Hastings
Kartik Goyal
Chris Dyer
Taylor Berg-Kirkpatrick
178
51
0
04 Jun 2021
The Image Local Autoregressive Transformer
The Image Local Autoregressive Transformer
Chenjie Cao
Yue Hong
Xiang Li
Chengrong Wang
C. Xu
Xiangyang Xue
Yanwei Fu
82
13
0
04 Jun 2021
Annotation Curricula to Implicitly Train Non-Expert Annotators
Annotation Curricula to Implicitly Train Non-Expert Annotators
Ji-Ung Lee
Jan-Christoph Klie
Iryna Gurevych
78
11
0
04 Jun 2021
BERTTune: Fine-Tuning Neural Machine Translation with BERTScore
BERTTune: Fine-Tuning Neural Machine Translation with BERTScore
Inigo Jauregi Unanue
Jacob Parnell
Massimo Piccardi
58
34
0
04 Jun 2021
Enabling Lightweight Fine-tuning for Pre-trained Language Model
  Compression based on Matrix Product Operators
Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product Operators
Peiyu Liu
Ze-Feng Gao
Wayne Xin Zhao
Z. Xie
Zhong-Yi Lu
Ji-Rong Wen
48
30
0
04 Jun 2021
Anticipative Video Transformer
Anticipative Video Transformer
Rohit Girdhar
Kristen Grauman
ViT
96
212
0
03 Jun 2021
Defending Democracy: Using Deep Learning to Identify and Prevent
  Misinformation
Defending Democracy: Using Deep Learning to Identify and Prevent Misinformation
Anusua Trivedi
Alyssa Suhm
Prathamesh Mahankal
Subhiksha Mukuntharaj
Meghana D. Parab
Malvika Mohan
Meredith Berger
Arathi Sethumadhavan
A. Jaiman
Rahul Dodhia
117
0
0
03 Jun 2021
The Case for Translation-Invariant Self-Attention in Transformer-Based
  Language Models
The Case for Translation-Invariant Self-Attention in Transformer-Based Language Models
Ulme Wennberg
G. Henter
MILM
95
22
0
03 Jun 2021
Representing Syntax and Composition with Geometric Transformations
Representing Syntax and Composition with Geometric Transformations
Lorenzo Bertolini
Julie Weeds
David J. Weir
Qiwei Peng
62
2
0
03 Jun 2021
Bilingual Alignment Pre-Training for Zero-Shot Cross-Lingual Transfer
Bilingual Alignment Pre-Training for Zero-Shot Cross-Lingual Transfer
Ziqing Yang
Wentao Ma
Yiming Cui
Jiani Ye
Wanxiang Che
Shijin Wang
59
11
0
03 Jun 2021
Can Generative Pre-trained Language Models Serve as Knowledge Bases for
  Closed-book QA?
Can Generative Pre-trained Language Models Serve as Knowledge Bases for Closed-book QA?
Cunxiang Wang
Pai Liu
Yue Zhang
RALM
106
84
0
03 Jun 2021
A Unified Generative Framework for Various NER Subtasks
A Unified Generative Framework for Various NER Subtasks
Hang Yan
Tao Gui
Junqi Dai
Qipeng Guo
Zheng Zhang
Xipeng Qiu
96
298
0
02 Jun 2021
RevCore: Review-augmented Conversational Recommendation
RevCore: Review-augmented Conversational Recommendation
Yu Lu
Junwei Bao
Yan Song
Zichen Ma
Shuguang Cui
Youzheng Wu
Xiaodong He
136
77
0
02 Jun 2021
MathBERT: A Pre-trained Language Model for General NLP Tasks in
  Mathematics Education
MathBERT: A Pre-trained Language Model for General NLP Tasks in Mathematics Education
J. Shen
Michiharu Yamashita
Ethan Prihar
Neil T. Heffernan
Xintao Wu
Ben Graff
Dongwon Lee
84
61
0
02 Jun 2021
Exploiting Global Contextual Information for Document-level Named Entity
  Recognition
Exploiting Global Contextual Information for Document-level Named Entity Recognition
Zanbo Wang
Wei Wei
Xian-Ling Mao
Shanshan Feng
Pan Zhou
Zhiyong He
Sheng Jiang
158
6
0
02 Jun 2021
Conversational Question Answering: A Survey
Conversational Question Answering: A Survey
Munazza Zaib
Wei Emma Zhang
Quan Z. Sheng
A. Mahmood
Yang Zhang
89
91
0
02 Jun 2021
Implicit Representations of Meaning in Neural Language Models
Implicit Representations of Meaning in Neural Language Models
Belinda Z. Li
Maxwell Nye
Jacob Andreas
NAIMILM
91
177
0
01 Jun 2021
SpanNER: Named Entity Re-/Recognition as Span Prediction
SpanNER: Named Entity Re-/Recognition as Span Prediction
Jinlan Fu
Xuanjing Huang
Pengfei Liu
75
101
0
01 Jun 2021
NewsEmbed: Modeling News through Pre-trained Document Representations
NewsEmbed: Modeling News through Pre-trained Document Representations
Jialu Liu
Tianqi Liu
Cong Yu
VLM
223
12
0
01 Jun 2021
Sub-Character Tokenization for Chinese Pretrained Language Models
Sub-Character Tokenization for Chinese Pretrained Language Models
Chenglei Si
Zhengyan Zhang
Yingfa Chen
Fanchao Qi
Xiaozhi Wang
Zhiyuan Liu
Yasheng Wang
Qun Liu
Maosong Sun
75
12
0
01 Jun 2021
An In-depth Study on Internal Structure of Chinese Words
An In-depth Study on Internal Structure of Chinese Words
Chen Gong
Saihao Huang
Houquan Zhou
Zhenghua Li
Hao Fei
Zhefeng Wang
Baoxing Huai
N. Yuan
45
2
0
01 Jun 2021
Distribution Matching for Rationalization
Distribution Matching for Rationalization
Yongfeng Huang
Yujun Chen
Yulun Du
Zhilin Yang
OOD
67
18
0
01 Jun 2021
Preview, Attend and Review: Schema-Aware Curriculum Learning for
  Multi-Domain Dialog State Tracking
Preview, Attend and Review: Schema-Aware Curriculum Learning for Multi-Domain Dialog State Tracking
Yinpei Dai
Hangyu Li
Yongbin Li
Jian Sun
Fei Huang
Luo Si
Xiao-Dan Zhu
92
53
0
01 Jun 2021
Volta at SemEval-2021 Task 9: Statement Verification and Evidence
  Finding with Tables using TAPAS and Transfer Learning
Volta at SemEval-2021 Task 9: Statement Verification and Evidence Finding with Tables using TAPAS and Transfer Learning
Devansh Gautam
Kshitij Gupta
Manish Shrivastava
LMTD
56
6
0
01 Jun 2021
Corpus-Based Paraphrase Detection Experiments and Review
Corpus-Based Paraphrase Detection Experiments and Review
T. Vrbanec
A. Meštrović
132
31
0
31 May 2021
Training ELECTRA Augmented with Multi-word Selection
Training ELECTRA Augmented with Multi-word Selection
Jiaming Shen
Jialu Liu
Tianqi Liu
Cong Yu
Jiawei Han
97
9
0
31 May 2021
Generalized AdaGrad (G-AdaGrad) and Adam: A State-Space Perspective
Generalized AdaGrad (G-AdaGrad) and Adam: A State-Space Perspective
Kushal Chakrabarti
Nikhil Chopra
ODLAI4CE
83
9
0
31 May 2021
How transfer learning impacts linguistic knowledge in deep NLP models?
How transfer learning impacts linguistic knowledge in deep NLP models?
Nadir Durrani
Hassan Sajjad
Fahim Dalvi
47
51
0
31 May 2021
Analogous to Evolutionary Algorithm: Designing a Unified Sequence Model
Analogous to Evolutionary Algorithm: Designing a Unified Sequence Model
Jiangning Zhang
Chao Xu
Jian Li
Wenzhou Chen
Yabiao Wang
Ying Tai
Shuo Chen
Chengjie Wang
Feiyue Huang
Yong Liu
108
22
0
31 May 2021
M6-T: Exploring Sparse Expert Models and Beyond
M6-T: Exploring Sparse Expert Models and Beyond
An Yang
Junyang Lin
Rui Men
Chang Zhou
Le Jiang
...
Dingyang Zhang
Wei Lin
Lin Qu
Jingren Zhou
Hongxia Yang
MoE
130
24
0
31 May 2021
An Exploratory Analysis of the Relation Between Offensive Language and
  Mental Health
An Exploratory Analysis of the Relation Between Offensive Language and Mental Health
Ana-Maria Bucur
Marcos Zampieri
Liviu P. Dinu
50
23
0
31 May 2021
A Multilingual Modeling Method for Span-Extraction Reading Comprehension
A Multilingual Modeling Method for Span-Extraction Reading Comprehension
Gaochen Wu
Bin Xu
Dejie Chang
Bangchang Liu
35
1
0
31 May 2021
Effective Batching for Recurrent Neural Network Grammars
Effective Batching for Recurrent Neural Network Grammars
Hiroshi Noji
Yohei Oseki
GNN
83
17
0
31 May 2021
HIT: A Hierarchically Fused Deep Attention Network for Robust Code-mixed
  Language Representation
HIT: A Hierarchically Fused Deep Attention Network for Robust Code-mixed Language Representation
Ayan Sengupta
S. Bhattacharjee
Tanmoy Chakraborty
Md. Shad Akhtar
55
14
0
30 May 2021
Drop Clause: Enhancing Performance, Interpretability and Robustness of
  the Tsetlin Machine
Drop Clause: Enhancing Performance, Interpretability and Robustness of the Tsetlin Machine
Jivitesh Sharma
Rohan Kumar Yadav
Ole-Christoffer Granmo
Lei Jiao
VLM
58
12
0
30 May 2021
Pre-training Universal Language Representation
Pre-training Universal Language Representation
Yian Li
Hai Zhao
SSL
67
8
0
30 May 2021
Sentiment analysis in tweets: an assessment study from classical to
  modern text representation models
Sentiment analysis in tweets: an assessment study from classical to modern text representation models
Sérgio Barreto
Ricardo Moura
Jonnathan Carvalho
A. Paes
A. Plastino
87
14
0
29 May 2021
CommitBERT: Commit Message Generation Using Pre-Trained Programming
  Language Model
CommitBERT: Commit Message Generation Using Pre-Trained Programming Language Model
Tae-Hwan Jung
VLM
61
31
0
29 May 2021
MixerGAN: An MLP-Based Architecture for Unpaired Image-to-Image
  Translation
MixerGAN: An MLP-Based Architecture for Unpaired Image-to-Image Translation
George Cazenavette
Manuel Ladron de Guevara
90
17
0
28 May 2021
Knowledge Inheritance for Pre-trained Language Models
Knowledge Inheritance for Pre-trained Language Models
Yujia Qin
Yankai Lin
Jing Yi
Jiajie Zhang
Xu Han
...
Yusheng Su
Zhiyuan Liu
Peng Li
Maosong Sun
Jie Zhou
VLM
85
50
0
28 May 2021
ByT5: Towards a token-free future with pre-trained byte-to-byte models
ByT5: Towards a token-free future with pre-trained byte-to-byte models
Linting Xue
Aditya Barua
Noah Constant
Rami Al-Rfou
Sharan Narang
Mihir Kale
Adam Roberts
Colin Raffel
187
509
0
28 May 2021
Alleviating the Knowledge-Language Inconsistency: A Study for Deep
  Commonsense Knowledge
Alleviating the Knowledge-Language Inconsistency: A Study for Deep Commonsense Knowledge
Yi Zhang
Lei Li
Yunfang Wu
Qi Su
Xu Sun
54
4
0
28 May 2021
Previous
123...333435...899091
Next