ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05365
  4. Cited By
Deep contextualized word representations
v1v2 (latest)

Deep contextualized word representations

15 February 2018
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
    NAI
ArXiv (abs)PDFHTML

Papers citing "Deep contextualized word representations"

50 / 4,508 papers shown
Title
Logographic Information Aids Learning Better Representations for Natural
  Language Inference
Logographic Information Aids Learning Better Representations for Natural Language Inference
Zijian Jin
Duygu Ataman
58
1
0
03 Nov 2022
TOE: A Grid-Tagging Discontinuous NER Model Enhanced by Embedding
  Tag/Word Relations and More Fine-Grained Tags
TOE: A Grid-Tagging Discontinuous NER Model Enhanced by Embedding Tag/Word Relations and More Fine-Grained Tags
Jiang-Dong Liu
Donghong Ji
Jingye Li
Dongdong Xie
Chong Teng
Liang Zhao
Fei Li
73
15
0
01 Nov 2022
VarMAE: Pre-training of Variational Masked Autoencoder for
  Domain-adaptive Language Understanding
VarMAE: Pre-training of Variational Masked Autoencoder for Domain-adaptive Language Understanding
Dou Hu
Xiaolong Hou
Xiyang Du
Mengyuan Zhou
Lian-Xin Jiang
Yang Mo
Xiaofeng Shi
99
13
0
01 Nov 2022
The future is different: Large pre-trained language models fail in
  prediction tasks
The future is different: Large pre-trained language models fail in prediction tasks
K. Cvejoski
Ramses J. Sanchez
C. Ojeda
87
4
0
01 Nov 2022
Transfer Learning with Kernel Methods
Transfer Learning with Kernel Methods
Adityanarayanan Radhakrishnan
Max Ruiz Luyten
Neha Prasad
Caroline Uhler
52
25
0
01 Nov 2022
WHEN FLUE MEETS FLANG: Benchmarks and Large Pre-trained Language Model
  for Financial Domain
WHEN FLUE MEETS FLANG: Benchmarks and Large Pre-trained Language Model for Financial Domain
Raj Sanjay Shah
Kunal Chawla
Dheeraj Eidnani
Agam Shah
Wendi Du
Sudheer Chava
Natraj Raman
Charese Smiley
Jiaao Chen
Diyi Yang
AIFin
95
112
0
31 Oct 2022
Leveraging Pre-trained Models for Failure Analysis Triplets Generation
Leveraging Pre-trained Models for Failure Analysis Triplets Generation
Kenneth Ezukwoke
Anis Hoayek
M. Batton-Hubert
Xavier Boucher
Pascal Gounet
Jerome Adrian
64
1
0
31 Oct 2022
Emergent Linguistic Structures in Neural Networks are Fragile
Emergent Linguistic Structures in Neural Networks are Fragile
Emanuele La Malfa
Matthew Wicker
Marta Kiatkowska
84
1
0
31 Oct 2022
Improving Cause-of-Death Classification from Verbal Autopsy Reports
Improving Cause-of-Death Classification from Verbal Autopsy Reports
Thokozile Manaka
Terence L van Zyl
D. Kar
59
1
0
31 Oct 2022
Do Charge Prediction Models Learn Legal Theory?
Do Charge Prediction Models Learn Legal Theory?
Zhenwei An
Quzhe Huang
Cong Jiang
Yansong Feng
Dongyan Zhao
ELMAILaw
68
6
0
31 Oct 2022
Using Context-to-Vector with Graph Retrofitting to Improve Word
  Embeddings
Using Context-to-Vector with Graph Retrofitting to Improve Word Embeddings
Jiangbin Zheng
Yile Wang
Ge Wang
Jun Xia
Yufei Huang
Guojiang Zhao
Yue Zhang
Stan Y. Li
68
26
0
30 Oct 2022
Probing for targeted syntactic knowledge through grammatical error
  detection
Probing for targeted syntactic knowledge through grammatical error detection
Christopher Davis
Christopher Bryant
Andrew Caines
Marek Rei
P. Buttery
47
4
0
28 Oct 2022
Modeling structure-building in the brain with CCG parsing and large
  language models
Modeling structure-building in the brain with CCG parsing and large language models
Miloš Stanojević
Jonathan Brennan
Donald Dunagan
Mark Steedman
John T. Hale
53
14
0
28 Oct 2022
MABEL: Attenuating Gender Bias using Textual Entailment Data
MABEL: Attenuating Gender Bias using Textual Entailment Data
Jacqueline He
Mengzhou Xia
C. Fellbaum
Danqi Chen
54
32
0
26 Oct 2022
A Robust Bias Mitigation Procedure Based on the Stereotype Content Model
A Robust Bias Mitigation Procedure Based on the Stereotype Content Model
Eddie L. Ungless
Amy Rafferty
Hrichika Nag
Bjorn Ross
72
30
0
26 Oct 2022
End-to-End Multimodal Representation Learning for Video Dialog
End-to-End Multimodal Representation Learning for Video Dialog
Huda AlAmri
Anthony Bilic
Michael Hu
Apoorva Beedu
Irfan Essa
84
7
0
26 Oct 2022
Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning
Inducer-tuning: Connecting Prefix-tuning and Adapter-tuning
Yifan Chen
Devamanyu Hazarika
Mahdi Namazifar
Yang Liu
Di Jin
Dilek Z. Hakkani-Tür
63
4
0
26 Oct 2022
Revision for Concision: A Constrained Paraphrase Generation Task
Revision for Concision: A Constrained Paraphrase Generation Task
Wenchuan Mu
Kwanin Lim
70
3
0
25 Oct 2022
Leveraging Open Data and Task Augmentation to Automated Behavioral
  Coding of Psychotherapy Conversations in Low-Resource Scenarios
Leveraging Open Data and Task Augmentation to Automated Behavioral Coding of Psychotherapy Conversations in Low-Resource Scenarios
Zhuohao Chen
Nikolaos Flemotomos
Zac E. Imel
David C. Atkins
Shrikanth Narayanan
66
4
0
25 Oct 2022
Towards Better Few-Shot and Finetuning Performance with Forgetful Causal
  Language Models
Towards Better Few-Shot and Finetuning Performance with Forgetful Causal Language Models
Hao Liu
Xinyang Geng
Lisa Lee
Igor Mordatch
Sergey Levine
Sharan Narang
Pieter Abbeel
KELMCLL
89
2
0
24 Oct 2022
The Better Your Syntax, the Better Your Semantics? Probing Pretrained
  Language Models for the English Comparative Correlative
The Better Your Syntax, the Better Your Semantics? Probing Pretrained Language Models for the English Comparative Correlative
Leonie Weissweiler
Valentin Hofmann
Abdullatif Köksal
Hinrich Schütze
82
35
0
24 Oct 2022
Full-Text Argumentation Mining on Scientific Publications
Full-Text Argumentation Mining on Scientific Publications
Arne Binder
Bhuvanesh Verma
Leonhard Hennig
20
5
0
24 Oct 2022
Enhancing Label Consistency on Document-level Named Entity Recognition
Enhancing Label Consistency on Document-level Named Entity Recognition
Minbyul Jeong
Jaewoo Kang
48
6
0
24 Oct 2022
A Greek Parliament Proceedings Dataset for Computational Linguistics and
  Political Analysis
A Greek Parliament Proceedings Dataset for Computational Linguistics and Political Analysis
Konstantina Dritsa
Kaiti Thoma
John Pavlopoulos
Panos Louridas
AILaw
58
1
0
23 Oct 2022
Span-based joint entity and relation extraction augmented with sequence
  tagging mechanism
Span-based joint entity and relation extraction augmented with sequence tagging mechanism
Bing Ji
Shasha Li
Hao Xu
Jie Yu
Jun Ma
Bin Ji
Jing Yang
79
4
0
23 Oct 2022
A BERT-based Deep Learning Approach for Reputation Analysis in Social
  Media
A BERT-based Deep Learning Approach for Reputation Analysis in Social Media
Mohammad Wali Ur Rahman
Sicong Shao
Pratik Satam
Salim Hariri
Chris Padilla
Zoe Taylor
C. Nevarez
43
5
0
23 Oct 2022
Guided contrastive self-supervised pre-training for automatic speech
  recognition
Guided contrastive self-supervised pre-training for automatic speech recognition
Aparna Khare
Minhua Wu
Saurabhchand Bhati
J. Droppo
Roland Maas
SSL
67
0
0
22 Oct 2022
What do Large Language Models Learn beyond Language?
What do Large Language Models Learn beyond Language?
Avinash Madasu
Shashank Srivastava
LRMAI4CE
73
5
0
21 Oct 2022
SpaBERT: A Pretrained Language Model from Geographic Data for Geo-Entity
  Representation
SpaBERT: A Pretrained Language Model from Geographic Data for Geo-Entity Representation
Zekun Li
Jina Kim
Yao-Yi Chiang
Muhao Chen
133
32
0
21 Oct 2022
Probing with Noise: Unpicking the Warp and Weft of Embeddings
Probing with Noise: Unpicking the Warp and Weft of Embeddings
Filip Klubicka
John D. Kelleher
70
4
0
21 Oct 2022
Shift-Reduce Task-Oriented Semantic Parsing with Stack-Transformers
Shift-Reduce Task-Oriented Semantic Parsing with Stack-Transformers
Daniel Fernández-González
77
0
0
21 Oct 2022
Augmentation with Projection: Towards an Effective and Efficient Data
  Augmentation Paradigm for Distillation
Augmentation with Projection: Towards an Effective and Efficient Data Augmentation Paradigm for Distillation
Ziqi Wang
Yuexin Wu
Frederick Liu
Daogao Liu
Le Hou
Hongkun Yu
Jing Li
Heng Ji
88
5
0
21 Oct 2022
Efficiently Tuned Parameters are Task Embeddings
Efficiently Tuned Parameters are Task Embeddings
Wangchunshu Zhou
Canwen Xu
Julian McAuley
58
8
0
21 Oct 2022
Composing Ensembles of Pre-trained Models via Iterative Consensus
Composing Ensembles of Pre-trained Models via Iterative Consensus
Shuang Li
Yilun Du
J. Tenenbaum
Antonio Torralba
Igor Mordatch
MoMe
73
25
0
20 Oct 2022
An Empirical Analysis of SMS Scam Detection Systems
An Empirical Analysis of SMS Scam Detection Systems
Muhammad Salman
Muhammad Ikram
M. Kâafar
98
8
0
19 Oct 2022
Language Model Decomposition: Quantifying the Dependency and Correlation
  of Language Models
Language Model Decomposition: Quantifying the Dependency and Correlation of Language Models
Hao Zhang
31
0
0
19 Oct 2022
Detecting and analyzing missing citations to published scientific
  entities
Detecting and analyzing missing citations to published scientific entities
Jialiang Lin
Yao Yu
Jia-Qi Song
X. Shi
52
4
0
18 Oct 2022
Hidden State Variability of Pretrained Language Models Can Guide
  Computation Reduction for Transfer Learning
Hidden State Variability of Pretrained Language Models Can Guide Computation Reduction for Transfer Learning
Shuo Xie
Jiahao Qiu
Ankita Pasad
Li Du
Qing Qu
Hongyuan Mei
87
16
0
18 Oct 2022
Sentiment-Aware Word and Sentence Level Pre-training for Sentiment
  Analysis
Sentiment-Aware Word and Sentence Level Pre-training for Sentiment Analysis
Shuai Fan
Chen Lin
Haonan Li
Zheng-Wen Lin
Jinsong Su
Hang Zhang
Yeyun Gong
Jian Guo
Nan Duan
VLM
67
19
0
18 Oct 2022
Fine-mixing: Mitigating Backdoors in Fine-tuned Language Models
Fine-mixing: Mitigating Backdoors in Fine-tuned Language Models
Zhiyuan Zhang
Lingjuan Lyu
Xingjun Ma
Chenguang Wang
Xu Sun
AAML
66
43
0
18 Oct 2022
Less is More: A Lightweight and Robust Neural Architecture for Discourse
  Parsing
Less is More: A Lightweight and Robust Neural Architecture for Discourse Parsing
Ming Li
Ruihong Huang
59
2
0
18 Oct 2022
Improving Low-Resource Cross-lingual Parsing with Expected Statistic
  Regularization
Improving Low-Resource Cross-lingual Parsing with Expected Statistic Regularization
Thomas Effland
Michael Collins
103
6
0
17 Oct 2022
Deep Bidirectional Language-Knowledge Graph Pretraining
Deep Bidirectional Language-Knowledge Graph Pretraining
Michihiro Yasunaga
Antoine Bosselut
Hongyu Ren
Xikun Zhang
Christopher D. Manning
Percy Liang
J. Leskovec
101
205
0
17 Oct 2022
Improving Semantic Matching through Dependency-Enhanced Pre-trained
  Model with Adaptive Fusion
Improving Semantic Matching through Dependency-Enhanced Pre-trained Model with Adaptive Fusion
Jian Song
Di Liang
Rumei Li
Yun Li
Sirui Wang
Minlong Peng
Wei Wu
Yongxin Yu
67
12
0
16 Oct 2022
PAR: Political Actor Representation Learning with Social Context and
  Expert Knowledge
PAR: Political Actor Representation Learning with Social Context and Expert Knowledge
Shangbin Feng
Zhaoxuan Tan
Zilong Chen
Ningnan Wang
Peisheng Yu
Qinghua Zheng
Xiao Chang
Minnan Luo
81
9
0
15 Oct 2022
Temporal Word Meaning Disambiguation using TimeLMs
Temporal Word Meaning Disambiguation using TimeLMs
Mihir Godbole
Parth Dandavate
Aditya Kane
74
2
0
15 Oct 2022
Extracting speaker and emotion information from self-supervised speech
  models via channel-wise correlations
Extracting speaker and emotion information from self-supervised speech models via channel-wise correlations
Themos Stafylakis
Ladislav Mošner
Sofoklis Kakouros
Oldrich Plchot
L. Burget
J. Černocký
SSL
60
10
0
15 Oct 2022
Language Generation Models Can Cause Harm: So What Can We Do About It?
  An Actionable Survey
Language Generation Models Can Cause Harm: So What Can We Do About It? An Actionable Survey
Sachin Kumar
Vidhisha Balachandran
Lucille Njoo
Antonios Anastasopoulos
Yulia Tsvetkov
ELM
189
91
0
14 Oct 2022
Predicting Fine-Tuning Performance with Probing
Predicting Fine-Tuning Performance with Probing
Zining Zhu
Soroosh Shahtalebi
Frank Rudzicz
64
10
0
13 Oct 2022
Experiments on Turkish ASR with Self-Supervised Speech Representation
  Learning
Experiments on Turkish ASR with Self-Supervised Speech Representation Learning
Ali Safaya
E. Erzin
41
1
0
13 Oct 2022
Previous
123...151617...899091
Next