ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05365
  4. Cited By
Deep contextualized word representations
v1v2 (latest)

Deep contextualized word representations

15 February 2018
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
    NAI
ArXiv (abs)PDFHTML

Papers citing "Deep contextualized word representations"

50 / 4,508 papers shown
Title
From Representation to Reasoning: Towards both Evidence and Commonsense
  Reasoning for Video Question-Answering
From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-Answering
Jiangtong Li
Li Niu
Liqing Zhang
67
53
0
30 May 2022
Self-supervised models of audio effectively explain human cortical
  responses to speech
Self-supervised models of audio effectively explain human cortical responses to speech
Aditya R. Vaidya
Shailee Jain
Alexander G. Huth
86
50
0
27 May 2022
AANG: Automating Auxiliary Learning
AANG: Automating Auxiliary Learning
Lucio Dery
Paul Michel
M. Khodak
Graham Neubig
Ameet Talwalkar
114
9
0
27 May 2022
Understanding Long Programming Languages with Structure-Aware Sparse
  Attention
Understanding Long Programming Languages with Structure-Aware Sparse Attention
Tingting Liu
Chengyu Wang
Cen Chen
Ming Gao
Aoying Zhou
65
3
0
27 May 2022
Federated Split BERT for Heterogeneous Text Classification
Federated Split BERT for Heterogeneous Text Classification
Zhengyang Li
Shijing Si
Jianzong Wang
Jing Xiao
FedML
91
21
0
26 May 2022
Matryoshka Representation Learning
Matryoshka Representation Learning
Aditya Kusupati
Gantavya Bhatt
Aniket Rege
Matthew Wallingford
Aditya Sinha
...
William Howard-Snyder
Kaifeng Chen
Sham Kakade
Prateek Jain
Ali Farhadi
152
91
0
26 May 2022
Transcormer: Transformer for Sentence Scoring with Sliding Language
  Modeling
Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling
Kaitao Song
Yichong Leng
Xu Tan
Yicheng Zou
Tao Qin
Dongsheng Li
111
11
0
25 May 2022
Large Language Models are Few-Shot Clinical Information Extractors
Large Language Models are Few-Shot Clinical Information Extractors
Monica Agrawal
S. Hegselmann
Hunter Lang
Yoon Kim
David Sontag
BDLLM&MA
263
351
0
25 May 2022
RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning
RLPrompt: Optimizing Discrete Text Prompts with Reinforcement Learning
Mingkai Deng
Jianyu Wang
Cheng-Ping Hsieh
Yihan Wang
Han Guo
Tianmin Shu
Meng Song
Eric Xing
Zhiting Hu
97
345
0
25 May 2022
Toward Understanding Bias Correlations for Mitigation in NLP
Toward Understanding Bias Correlations for Mitigation in NLP
Lu Cheng
Suyu Ge
Huan Liu
72
9
0
24 May 2022
Auxiliary Task Guided Interactive Attention Model for Question
  Difficulty Prediction
Auxiliary Task Guided Interactive Attention Model for Question Difficulty Prediction
Venktesh V
Md. Shad Akhtar
Mukesh Mohania
Vikram Goyal
24
0
0
24 May 2022
GeoMLAMA: Geo-Diverse Commonsense Probing on Multilingual Pre-Trained
  Language Models
GeoMLAMA: Geo-Diverse Commonsense Probing on Multilingual Pre-Trained Language Models
Da Yin
Hritik Bansal
Masoud Monajatipoor
Liunian Harold Li
Kai-Wei Chang
97
31
0
24 May 2022
Formulating Few-shot Fine-tuning Towards Language Model Pre-training: A
  Pilot Study on Named Entity Recognition
Formulating Few-shot Fine-tuning Towards Language Model Pre-training: A Pilot Study on Named Entity Recognition
Zihan Wang
Kewen Zhao
Zilong Wang
Jingbo Shang
75
6
0
24 May 2022
Deep Learning Meets Software Engineering: A Survey on Pre-Trained Models
  of Source Code
Deep Learning Meets Software Engineering: A Survey on Pre-Trained Models of Source Code
Changan Niu
Chuanyi Li
Bin Luo
Vincent Ng
SyDaVLM
107
50
0
24 May 2022
On the Role of Bidirectionality in Language Model Pre-Training
On the Role of Bidirectionality in Language Model Pre-Training
Mikel Artetxe
Jingfei Du
Naman Goyal
Luke Zettlemoyer
Ves Stoyanov
205
17
0
24 May 2022
Semi-Parametric Inducing Point Networks and Neural Processes
Semi-Parametric Inducing Point Networks and Neural Processes
R. Rastogi
Yair Schiff
Alon Hacohen
Zhaozhi Li
I-Hsiang Lee
Yuntian Deng
M. Sabuncu
Volodymyr Kuleshov
3DPC
89
7
0
24 May 2022
Local Byte Fusion for Neural Machine Translation
Local Byte Fusion for Neural Machine Translation
Makesh Narsimhan Sreedhar
Xiangpeng Wan
Yu-Jie Cheng
Junjie Hu
106
4
0
23 May 2022
QASem Parsing: Text-to-text Modeling of QA-based Semantics
QASem Parsing: Text-to-text Modeling of QA-based Semantics
Ayal Klein
Eran Hirsch
Ron Eliav
Valentina Pyatkin
Avi Caciularu
Ido Dagan
97
13
0
23 May 2022
Many-Class Text Classification with Matching
Many-Class Text Classification with Matching
Yi-Fan Song
Yuxian Gu
Minlie Huang
VLM
29
1
0
23 May 2022
DistilCamemBERT: a distillation of the French model CamemBERT
DistilCamemBERT: a distillation of the French model CamemBERT
Cyrile Delestre
Abibatou Amar
73
5
0
23 May 2022
BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating
  Low-Resource Natural Language Generation in Bangla
BanglaNLG and BanglaT5: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in Bangla
Abhik Bhattacharjee
Tahmid Hasan
Wasi Uddin Ahmad
Rifat Shahriyar
AIMatLM&MA
109
32
0
23 May 2022
Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust
  Intent Detection
Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust Intent Detection
Peilin Zhou
Dading Chong
Helin Wang
Qingcheng Zeng
48
5
0
23 May 2022
Life after BERT: What do Other Muppets Understand about Language?
Life after BERT: What do Other Muppets Understand about Language?
Vladislav Lialin
Kevin Zhao
Namrata Shivagunde
Anna Rumshisky
110
6
0
21 May 2022
Self-Supervised Speech Representation Learning: A Review
Self-Supervised Speech Representation Learning: A Review
Abdel-rahman Mohamed
Hung-yi Lee
Lasse Borgholt
Jakob Drachmann Havtorn
Joakim Edin
...
Shang-Wen Li
Karen Livescu
Lars Maaløe
Tara N. Sainath
Shinji Watanabe
SSLAI4TS
291
368
0
21 May 2022
DeepStruct: Pretraining of Language Models for Structure Prediction
DeepStruct: Pretraining of Language Models for Structure Prediction
Chenguang Wang
Xiao Liu
Zui Chen
Haoyun Hong
Jie Tang
Dawn Song
278
71
0
21 May 2022
Heterformer: Transformer-based Deep Node Representation Learning on
  Heterogeneous Text-Rich Networks
Heterformer: Transformer-based Deep Node Representation Learning on Heterogeneous Text-Rich Networks
Bowen Jin
Yu Zhang
Qi Zhu
Jiawei Han
143
41
0
20 May 2022
How to keep text private? A systematic review of deep learning methods
  for privacy-preserving natural language processing
How to keep text private? A systematic review of deep learning methods for privacy-preserving natural language processing
Samuel Sousa
Roman Kern
PILMAILaw
79
46
0
20 May 2022
Transition-based Semantic Role Labeling with Pointer Networks
Transition-based Semantic Role Labeling with Pointer Networks
Daniel Fernández-González
54
6
0
20 May 2022
Can Foundation Models Wrangle Your Data?
Can Foundation Models Wrangle Your Data?
A. Narayan
Ines Chami
Laurel J. Orr
Simran Arora
Christopher Ré
LMTDAI4CE
239
231
0
20 May 2022
ArabGlossBERT: Fine-Tuning BERT on Context-Gloss Pairs for WSD
ArabGlossBERT: Fine-Tuning BERT on Context-Gloss Pairs for WSD
Moustafa Al-Hajj
Mustafa Jarrar
58
32
0
19 May 2022
Persian Natural Language Inference: A Meta-learning approach
Persian Natural Language Inference: A Meta-learning approach
Heydar Soudani
Mohammadreza Mojab
H. Beigy
99
1
0
18 May 2022
When to Use Multi-Task Learning vs Intermediate Fine-Tuning for
  Pre-Trained Encoder Transfer Learning
When to Use Multi-Task Learning vs Intermediate Fine-Tuning for Pre-Trained Encoder Transfer Learning
Orion Weller
Kevin Seppi
Matt Gardner
62
23
0
17 May 2022
Dimensionality Reduced Training by Pruning and Freezing Parts of a Deep
  Neural Network, a Survey
Dimensionality Reduced Training by Pruning and Freezing Parts of a Deep Neural Network, a Survey
Paul Wimmer
Jens Mehnert
Alexandru Paul Condurache
DD
98
21
0
17 May 2022
What company do words keep? Revisiting the distributional semantics of
  J.R. Firth & Zellig Harris
What company do words keep? Revisiting the distributional semantics of J.R. Firth & Zellig Harris
Mikael Brunila
J. LaViolette
112
21
0
16 May 2022
TiBERT: Tibetan Pre-trained Language Model
TiBERT: Tibetan Pre-trained Language Model
Yuan Sun
Sisi Liu
Junjie Deng
Xiaobing Zhao
94
10
0
15 May 2022
Adaptive Prompt Learning-based Few-Shot Sentiment Analysis
Adaptive Prompt Learning-based Few-Shot Sentiment Analysis
Pengfei Zhang
Tingting Chai
Yongdong Xu
VLM
82
13
0
15 May 2022
Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT
Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT
Bowen Shi
Abdel-rahman Mohamed
Wei-Ning Hsu
SSL
69
18
0
15 May 2022
PathologyBERT -- Pre-trained Vs. A New Transformer Language Model for
  Pathology Domain
PathologyBERT -- Pre-trained Vs. A New Transformer Language Model for Pathology Domain
Thiago Santos
Amara Tariq
Susmita Das
Kavyasree Vayalpati
Geoffrey H. Smith
Hari M. Trivedi
Imon Banerjee
LM&MAMedIm
44
18
0
13 May 2022
A Comprehensive Survey of Few-shot Learning: Evolution, Applications,
  Challenges, and Opportunities
A Comprehensive Survey of Few-shot Learning: Evolution, Applications, Challenges, and Opportunities
Yisheng Song
Ting-Yuan Wang
S. Mondal
J. P. Sahoo
SLR
128
391
0
13 May 2022
Arithmetic-Based Pretraining -- Improving Numeracy of Pretrained
  Language Models
Arithmetic-Based Pretraining -- Improving Numeracy of Pretrained Language Models
Dominic Petrak
N. Moosavi
Iryna Gurevych
AIMat
40
2
0
13 May 2022
ViT5: Pretrained Text-to-Text Transformer for Vietnamese Language
  Generation
ViT5: Pretrained Text-to-Text Transformer for Vietnamese Language Generation
Long Phan
H. Tran
Hieu Duy Nguyen
Trieu H. Trinh
ViT
109
72
0
13 May 2022
FETA: A Benchmark for Few-Sample Task Transfer in Open-Domain Dialogue
FETA: A Benchmark for Few-Sample Task Transfer in Open-Domain Dialogue
Alon Albalak
Yi-Lin Tuan
Pegah Jandaghi
Connor Pryor
Luke Yoffe
Deepak Ramachandran
Lise Getoor
Jay Pujara
William Yang Wang
79
14
0
12 May 2022
Utilizing coarse-grained data in low-data settings for event extraction
Utilizing coarse-grained data in low-data settings for event extraction
Osman Mutlu
55
2
0
11 May 2022
Detecting Emerging Technologies and their Evolution using Deep Learning
  and Weak Signal Analysis
Detecting Emerging Technologies and their Evolution using Deep Learning and Weak Signal Analysis
Ashkan Ebadi
Alain Auger
Yvan Gauthier
59
24
0
11 May 2022
Towards Unified Prompt Tuning for Few-shot Text Classification
Towards Unified Prompt Tuning for Few-shot Text Classification
Jiadong Wang
Chengyu Wang
Fuli Luo
Chuanqi Tan
Minghui Qiu
Fei Yang
Qiuhui Shi
Songfang Huang
Ming Gao
VLM
70
29
0
11 May 2022
UL2: Unifying Language Learning Paradigms
UL2: Unifying Language Learning Paradigms
Yi Tay
Mostafa Dehghani
Vinh Q. Tran
Xavier Garcia
Jason W. Wei
...
Tal Schuster
H. Zheng
Denny Zhou
N. Houlsby
Donald Metzler
AI4CE
144
313
0
10 May 2022
Extracting Latent Steering Vectors from Pretrained Language Models
Extracting Latent Steering Vectors from Pretrained Language Models
Nishant Subramani
Nivedita Suresh
Matthew E. Peters
LLMSV
87
101
0
10 May 2022
A Communication-Efficient Distributed Gradient Clipping Algorithm for
  Training Deep Neural Networks
A Communication-Efficient Distributed Gradient Clipping Algorithm for Training Deep Neural Networks
Mingrui Liu
Zhenxun Zhuang
Yunwei Lei
Chunyang Liao
79
20
0
10 May 2022
BLINK with Elasticsearch for Efficient Entity Linking in Business
  Conversations
BLINK with Elasticsearch for Efficient Entity Linking in Business Conversations
Md Tahmid Rahman Laskar
Cheng Chen
Aliaksandr Martsinovich
Jonathan Johnston
Xue-Yong Fu
TN ShashiBhushan
Simon Corston-Oliver
79
17
0
09 May 2022
EigenNoise: A Contrastive Prior to Warm-Start Representations
EigenNoise: A Contrastive Prior to Warm-Start Representations
H. Heidenreich
Jake Williams
38
1
0
09 May 2022
Previous
123...192021...899091
Next