ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05365
  4. Cited By
Deep contextualized word representations
v1v2 (latest)

Deep contextualized word representations

15 February 2018
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
    NAI
ArXiv (abs)PDFHTML

Papers citing "Deep contextualized word representations"

50 / 4,508 papers shown
Title
Self-supervised speech representation learning for keyword-spotting with
  light-weight transformers
Self-supervised speech representation learning for keyword-spotting with light-weight transformers
Chenyang Gao
Yue Gu
Francesco Calivá
Yuzong Liu
OffRL
81
4
0
07 Mar 2023
A Comprehensive Survey of AI-Generated Content (AIGC): A History of
  Generative AI from GAN to ChatGPT
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
Yihan Cao
Siyu Li
Yixin Liu
Zhiling Yan
Yutong Dai
Philip S. Yu
Lichao Sun
105
555
0
07 Mar 2023
Calibrating Transformers via Sparse Gaussian Processes
Calibrating Transformers via Sparse Gaussian Processes
Wenlong Chen
Yingzhen Li
UQCV
119
12
0
04 Mar 2023
Cryptocurrency Price Prediction using Twitter Sentiment Analysis
Cryptocurrency Price Prediction using Twitter Sentiment Analysis
GB Haritha
Sahana N.B
35
8
0
03 Mar 2023
On the Provable Advantage of Unsupervised Pretraining
On the Provable Advantage of Unsupervised Pretraining
Jiawei Ge
Shange Tang
Jianqing Fan
Chi Jin
SSL
78
17
0
02 Mar 2023
BenchDirect: A Directed Language Model for Compiler Benchmarks
BenchDirect: A Directed Language Model for Compiler Benchmarks
Foivos Tsimpourlas
Pavlos Petoumenos
Min Xu
Chris Cummins
K. Hazelwood
A. Rajan
Hugh Leather
ELM
47
3
0
02 Mar 2023
Variance-reduced Clipping for Non-convex Optimization
Variance-reduced Clipping for Non-convex Optimization
Amirhossein Reisizadeh
Haochuan Li
Subhro Das
Ali Jadbabaie
96
29
0
02 Mar 2023
TimeMAE: Self-Supervised Representations of Time Series with Decoupled
  Masked Autoencoders
TimeMAE: Self-Supervised Representations of Time Series with Decoupled Masked Autoencoders
Mingyue Cheng
Qi Liu
Zhiding Liu
Haotong Zhang
Rujiao Zhang
Enhong Chen
AI4TS
138
49
0
01 Mar 2023
How Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language
  Understanding Tasks
How Robust is GPT-3.5 to Predecessors? A Comprehensive Study on Language Understanding Tasks
Xuanting Chen
Junjie Ye
Can Zu
Nuo Xu
Rui Zheng
Minlong Peng
Jie Zhou
Tao Gui
Qi Zhang
Xuanjing Huang
AI4MHELM
69
83
0
01 Mar 2023
Linear Spaces of Meanings: Compositional Structures in Vision-Language
  Models
Linear Spaces of Meanings: Compositional Structures in Vision-Language Models
Matthew Trager
Pramuditha Perera
Luca Zancato
Alessandro Achille
Parminder Bhatia
Stefano Soatto
CoGe
172
32
0
28 Feb 2023
Systematic Rectification of Language Models via Dead-end Analysis
Systematic Rectification of Language Models via Dead-end Analysis
Mengyao Cao
Mehdi Fatemi
Jackie C.K. Cheung
Samira Shabanian
KELM
73
16
0
27 Feb 2023
Argument Mining using BERT and Self-Attention based Embeddings
Argument Mining using BERT and Self-Attention based Embeddings
Pranjal Srivastava
P. Bhatnagar
Anurag Goel
34
11
0
27 Feb 2023
A low latency attention module for streaming self-supervised speech
  representation learning
A low latency attention module for streaming self-supervised speech representation learning
Jianbo Ma
Siqi Pan
Deepak Chandran
A. Fanelli
Richard Cartwright
59
0
0
27 Feb 2023
NoPPA: Non-Parametric Pairwise Attention Random Walk Model for Sentence
  Representation
NoPPA: Non-Parametric Pairwise Attention Random Walk Model for Sentence Representation
Xuansheng Wu
Zhiyi Zhao
Ninghao Liu
69
0
0
24 Feb 2023
KHAN: Knowledge-Aware Hierarchical Attention Networks for Accurate
  Political Stance Prediction
KHAN: Knowledge-Aware Hierarchical Attention Networks for Accurate Political Stance Prediction
Yunyong Ko
Seongeun Ryu
Soeun Han
Youngseung Jeon
Jaehoon Kim
Sohyun Park
Kyungsik Han
Hanghang Tong
Sang-Wook Kim
115
15
0
23 Feb 2023
Natural Language Processing in the Legal Domain
Natural Language Processing in the Legal Domain
Daniel Martin Katz
D. Hartung
Lauritz Gerlach
Abhik Jana
M. Bommarito
ELMAILaw
63
60
0
23 Feb 2023
Edgeformers: Graph-Empowered Transformers for Representation Learning on
  Textual-Edge Networks
Edgeformers: Graph-Empowered Transformers for Representation Learning on Textual-Edge Networks
Bowen Jin
Yu Zhang
Yu Meng
Jiawei Han
97
31
0
21 Feb 2023
Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey
Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey
Tianlin Li
Guangyao Chen
Guangwu Qian
Pengcheng Gao
Xiaoyong Wei
Yaowei Wang
Yonghong Tian
Wen Gao
AI4CEVLM
157
216
0
20 Feb 2023
SanskritShala: A Neural Sanskrit NLP Toolkit with Web-Based Interface
  for Pedagogical and Annotation Purposes
SanskritShala: A Neural Sanskrit NLP Toolkit with Web-Based Interface for Pedagogical and Annotation Purposes
Jivnesh Sandhan
Anshul Agarwal
Laxmidhar Behera
Tushar Sandhan
Pawan Goyal
67
4
0
19 Feb 2023
Text Classification in the Wild: a Large-scale Long-tailed Name
  Normalization Dataset
Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset
Jiexing Qi
Shuhao Li
Zhixin Guo
Yusheng Huang
Cheng Zhou
Weinan Zhang
Xinbing Wang
Zhouhan Lin
VLM
40
0
0
19 Feb 2023
Learning Language Representations with Logical Inductive Bias
Learning Language Representations with Logical Inductive Bias
Jianshu Chen
NAIAI4CELRM
53
3
0
19 Feb 2023
New Insights for the Stability-Plasticity Dilemma in Online Continual
  Learning
New Insights for the Stability-Plasticity Dilemma in Online Continual Learning
Dahuin Jung
Dongjin Lee
Sunwon Hong
Hyemi Jang
Ho Bae
Sungroh Yoon
CLL
62
15
0
17 Feb 2023
LabelPrompt: Effective Prompt-based Learning for Relation Classification
LabelPrompt: Effective Prompt-based Learning for Relation Classification
Weinan Zhang
Xiaoning Song
Zhenhua Feng
Tianyang Xu
Xiaojun Wu
VLM
72
4
0
16 Feb 2023
InfoNCE Loss Provably Learns Cluster-Preserving Representations
InfoNCE Loss Provably Learns Cluster-Preserving Representations
Advait Parulekar
Liam Collins
Karthikeyan Shanmugam
Aryan Mokhtari
Sanjay Shakkottai
SSL
117
25
0
15 Feb 2023
On graph-based reentrancy-free semantic parsing
On graph-based reentrancy-free semantic parsing
Alban Petit
Caio Corro
GNN
58
3
0
15 Feb 2023
Performance Limits of a Deep Learning-Enabled Text Semantic
  Communication under Interference
Performance Limits of a Deep Learning-Enabled Text Semantic Communication under Interference
T. Getu
Walid Saad
Georges Kaddoum
M. Bennis
71
8
0
15 Feb 2023
EPISODE: Episodic Gradient Clipping with Periodic Resampled Corrections
  for Federated Learning with Heterogeneous Data
EPISODE: Episodic Gradient Clipping with Periodic Resampled Corrections for Federated Learning with Heterogeneous Data
M. Crawshaw
Yajie Bao
Mingrui Liu
FedML
87
8
0
14 Feb 2023
AdapterSoup: Weight Averaging to Improve Generalization of Pretrained
  Language Models
AdapterSoup: Weight Averaging to Improve Generalization of Pretrained Language Models
Alexandra Chronopoulou
Matthew E. Peters
Alexander Fraser
Jesse Dodge
MoMe
91
72
0
14 Feb 2023
Generation of Highlights from Research Papers Using Pointer-Generator
  Networks and SciBERT Embeddings
Generation of Highlights from Research Papers Using Pointer-Generator Networks and SciBERT Embeddings
Tohida Rehman
Debarshi Kumar Sanyal
S. Chattopadhyay
Plaban Kumar Bhowmick
P. Das
77
11
0
14 Feb 2023
Exploring Category Structure with Contextual Language Models and Lexical
  Semantic Networks
Exploring Category Structure with Contextual Language Models and Lexical Semantic Networks
Joseph Renner
Pascal Denis
Rémi Gilleron
Angèle Brunellière
40
4
0
14 Feb 2023
Gradient-Based Automated Iterative Recovery for Parameter-Efficient
  Tuning
Gradient-Based Automated Iterative Recovery for Parameter-Efficient Tuning
Maximilian Mozes
Tolga Bolukbasi
Ann Yuan
Frederick Liu
Nithum Thain
Lucas Dixon
62
5
0
13 Feb 2023
Towards Agile Text Classifiers for Everyone
Towards Agile Text Classifiers for Everyone
Maximilian Mozes
Jessica Hoffmann
Katrin Tomanek
Muhamed Kouate
Nithum Thain
Ann Yuan
Tolga Bolukbasi
Lucas Dixon
103
13
0
13 Feb 2023
A Biomedical Knowledge Graph for Biomarker Discovery in Cancer
A Biomedical Knowledge Graph for Biomarker Discovery in Cancer
Md. Rezaul Karim
Lina Molinas Comet
Oya Beyan
Dietrich-Rebholz Schuhmann
Stefan Decker
89
2
0
09 Feb 2023
Zero-Shot Learning for Requirements Classification: An Exploratory Study
Zero-Shot Learning for Requirements Classification: An Exploratory Study
Waad Alhoshan
Alessio Ferrari
Liping Zhao
VLM
113
41
0
09 Feb 2023
Enhancing E-Commerce Recommendation using Pre-Trained Language Model and
  Fine-Tuning
Enhancing E-Commerce Recommendation using Pre-Trained Language Model and Fine-Tuning
Nuofan Xu
Chenhui Hu
23
2
0
09 Feb 2023
CRL+: A Novel Semi-Supervised Deep Active Contrastive Representation
  Learning-Based Text Classification Model for Insurance Data
CRL+: A Novel Semi-Supervised Deep Active Contrastive Representation Learning-Based Text Classification Model for Insurance Data
Amir Namavar Jahromi
Ebrahim Pourjafari
H. Karimipour
Amit Satpathy
Lovell Hodge
57
3
0
08 Feb 2023
EvoText: Enhancing Natural Language Generation Models via
  Self-Escalation Learning for Up-to-Date Knowledge and Improved Performance
EvoText: Enhancing Natural Language Generation Models via Self-Escalation Learning for Up-to-Date Knowledge and Improved Performance
Zheng Yuan
HU Xue
Chuxu Zhang
Yongming Liu
VLM
66
0
0
08 Feb 2023
What do Language Models know about word senses? Zero-Shot WSD with
  Language Models and Domain Inventories
What do Language Models know about word senses? Zero-Shot WSD with Language Models and Domain Inventories
Oscar Sainz
Oier López de Lacalle
Eneko Agirre
German Rigau
77
7
0
07 Feb 2023
AutoWS: Automated Weak Supervision Framework for Text Classification
AutoWS: Automated Weak Supervision Framework for Text Classification
Abhinav Bohra
Huy-Thanh Nguyen
Devashish Khatwani
NoLa
54
0
0
07 Feb 2023
Unleashing the True Potential of Sequence-to-Sequence Models for
  Sequence Tagging and Structure Parsing
Unleashing the True Potential of Sequence-to-Sequence Models for Sequence Tagging and Structure Parsing
Han He
Jinho Choi
78
4
0
05 Feb 2023
Improving Prediction Backward-Compatiblility in NLP Model Upgrade with
  Gated Fusion
Improving Prediction Backward-Compatiblility in NLP Model Upgrade with Gated Fusion
Yi-An Lai
Elman Mansimov
Yuqing Xie
Yan Zhang
63
4
0
04 Feb 2023
Witgenstein's influence on artificial intelligence
Witgenstein's influence on artificial intelligence
Piero Molino
Jacopo Tagliabue
53
0
0
03 Feb 2023
Curriculum-Guided Abstractive Summarization
Curriculum-Guided Abstractive Summarization
Sajad Sotudeh
Hanieh Deilamsalehy
Franck Dernoncourt
Nazli Goharian
86
2
0
02 Feb 2023
How to choose "Good" Samples for Text Data Augmentation
How to choose "Good" Samples for Text Data Augmentation
Xiaotian Lin
Nankai Lin
Yingwen Fu
Ziyu Yang
Shengyi Jiang
79
2
0
02 Feb 2023
The Flan Collection: Designing Data and Methods for Effective
  Instruction Tuning
The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
Shayne Longpre
Le Hou
Tu Vu
Albert Webson
Hyung Won Chung
...
Denny Zhou
Quoc V. Le
Barret Zoph
Jason W. Wei
Adam Roberts
ALM
122
678
0
31 Jan 2023
Response-act Guided Reinforced Dialogue Generation for Mental Health
  Counseling
Response-act Guided Reinforced Dialogue Generation for Mental Health Counseling
Aseem Srivastava
Ishan Pandey
Md. Shad Akhtar
Tanmoy Chakraborty
OffRL
75
13
0
30 Jan 2023
Open Problems in Applied Deep Learning
Open Problems in Applied Deep Learning
M. Raissi
AI4CE
115
2
0
26 Jan 2023
Improved Stock Price Movement Classification Using News Articles Based
  on Embeddings and Label Smoothing
Improved Stock Price Movement Classification Using News Articles Based on Embeddings and Label Smoothing
Luis Villamil
Ryan Bausback
Shaeke Salman
Ting Liu
Conrad Horn
Xiuwen Liu
AIFin
44
0
0
25 Jan 2023
A Stability Analysis of Fine-Tuning a Pre-Trained Model
A Stability Analysis of Fine-Tuning a Pre-Trained Model
Z. Fu
Anthony Man-Cho So
Nigel Collier
62
3
0
24 Jan 2023
Topic Ontologies for Arguments
Topic Ontologies for Arguments
Yamen Ajjour
Johannes Kiesel
Benno Stein
Martin Potthast
49
6
0
23 Jan 2023
Previous
123...121314...899091
Next