ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05365
  4. Cited By
Deep contextualized word representations
v1v2 (latest)

Deep contextualized word representations

15 February 2018
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
    NAI
ArXiv (abs)PDFHTML

Papers citing "Deep contextualized word representations"

50 / 4,508 papers shown
Title
Ensemble Transfer Learning for Multilingual Coreference Resolution
Ensemble Transfer Learning for Multilingual Coreference Resolution
T. Lai
Heng Ji
46
1
0
22 Jan 2023
Interpretability in Activation Space Analysis of Transformers: A Focused
  Survey
Interpretability in Activation Space Analysis of Transformers: A Focused Survey
Soniya Vijayakumar
AI4CE
66
4
0
22 Jan 2023
Differentially Private Natural Language Models: Recent Advances and
  Future Directions
Differentially Private Natural Language Models: Recent Advances and Future Directions
Lijie Hu
Ivan Habernal
Lei Shen
Di Wang
AAML
98
19
0
22 Jan 2023
RILS: Masked Visual Reconstruction in Language Semantic Space
RILS: Masked Visual Reconstruction in Language Semantic Space
Shusheng Yang
Yixiao Ge
Kun Yi
Dian Li
Ying Shan
Xiaohu Qie
Xinggang Wang
CLIP
95
11
0
17 Jan 2023
Multimodal Side-Tuning for Document Classification
Multimodal Side-Tuning for Document Classification
S. P. Zingaro
G. Lisanti
M. Gabbrielli
53
6
0
16 Jan 2023
tieval: An Evaluation Framework for Temporal Information Extraction
  Systems
tieval: An Evaluation Framework for Temporal Information Extraction Systems
Hugo Sousa
A. Jorge
Ricardo Campos
128
6
0
11 Jan 2023
Few-shot Learning for Cross-Target Stance Detection by Aggregating
  Multimodal Embeddings
Few-shot Learning for Cross-Target Stance Detection by Aggregating Multimodal Embeddings
Parisa Jamadi Khiabani
A. Zubiaga
90
12
0
11 Jan 2023
GPT as Knowledge Worker: A Zero-Shot Evaluation of (AI)CPA Capabilities
GPT as Knowledge Worker: A Zero-Shot Evaluation of (AI)CPA Capabilities
Jillian Bommarito
M. Bommarito
Daniel Martin Katz
Jessica Katz
ELM
73
55
0
11 Jan 2023
Topics in Contextualised Attention Embeddings
Topics in Contextualised Attention Embeddings
Mozhgan Talebpour
A. G. S. D. Herrera
Shoaib Jameel
69
2
0
11 Jan 2023
Channel-aware Decoupling Network for Multi-turn Dialogue Comprehension
Channel-aware Decoupling Network for Multi-turn Dialogue Comprehension
Zhuosheng Zhang
Hai Zhao
Longxiang Liu
BDL
93
2
0
10 Jan 2023
Universal Multimodal Representation for Language Understanding
Universal Multimodal Representation for Language Understanding
Zhuosheng Zhang
Kehai Chen
Rui Wang
Masao Utiyama
Eiichiro Sumita
Z. Li
Hai Zhao
SSL
109
22
0
09 Jan 2023
MOTOR: A Time-To-Event Foundation Model For Structured Medical Records
MOTOR: A Time-To-Event Foundation Model For Structured Medical Records
E. Steinberg
Jason Alan Fries
Yizhe Xu
N. Shah
OODAI4TS
159
19
0
09 Jan 2023
Traditional Readability Formulas Compared for English
Traditional Readability Formulas Compared for English
Bruce W. Lee
J. Lee
AIMat
91
6
0
08 Jan 2023
RLAS-BIABC: A Reinforcement Learning-Based Answer Selection Using the
  BERT Model Boosted by an Improved ABC Algorithm
RLAS-BIABC: A Reinforcement Learning-Based Answer Selection Using the BERT Model Boosted by an Improved ABC Algorithm
Hamid Gharagozlou
J. Mohammadzadeh
A. Bastanfard
S. S. Ghidary
48
35
0
07 Jan 2023
Facilitating Contrastive Learning of Discourse Relational Senses by
  Exploiting the Hierarchy of Sense Relations
Facilitating Contrastive Learning of Discourse Relational Senses by Exploiting the Hierarchy of Sense Relations
Wanqiu Long
Bonnie Webber
106
34
0
06 Jan 2023
Can Large Language Models Change User Preference Adversarially?
Can Large Language Models Change User Preference Adversarially?
Varshini Subhash
AAML
97
8
0
05 Jan 2023
GIVL: Improving Geographical Inclusivity of Vision-Language Models with
  Pre-Training Methods
GIVL: Improving Geographical Inclusivity of Vision-Language Models with Pre-Training Methods
Da Yin
Feng Gao
Govind Thattai
Michael F. Johnston
Kai-Wei Chang
VLM
94
15
0
05 Jan 2023
A comprehensive review of automatic text summarization techniques:
  method, data, evaluation and coding
A comprehensive review of automatic text summarization techniques: method, data, evaluation and coding
D. Cajueiro
A. G. Nery
Igor Tavares
Maísa Kely de Melo
Silvia A. dos Reis
Weigang Li
V. R. R. Celestino
86
15
0
04 Jan 2023
Tsetlin Machine Embedding: Representing Words Using Logical Expressions
Tsetlin Machine Embedding: Representing Words Using Logical Expressions
Bimal Bhattarai
Ole-Christoffer Granmo
Lei Jiao
Rohan Kumar Yadav
Jivitesh Sharma
NAI
56
13
0
02 Jan 2023
Integrating Semantic Information into Sketchy Reading Module of
  Retro-Reader for Vietnamese Machine Reading Comprehension
Integrating Semantic Information into Sketchy Reading Module of Retro-Reader for Vietnamese Machine Reading Comprehension
Hang Le
Viet-Duc Ho
Duc-Vu Nguyen
Ngan Luu-Thuy Nguyen
72
2
0
01 Jan 2023
GPT Takes the Bar Exam
GPT Takes the Bar Exam
M. Bommarito
Daniel Martin Katz
ELM
81
156
0
29 Dec 2022
Automatic Recognition and Classification of Future Work Sentences from
  Academic Articles in a Specific Domain
Automatic Recognition and Classification of Future Work Sentences from Academic Articles in a Specific Domain
Chengzhi Zhang
Yi Xiang
Wenke Hao
Zhicheng Li
Yuchen Qian
Yuzhuo Wang
48
11
0
28 Dec 2022
A Survey on Knowledge-Enhanced Pre-trained Language Models
A Survey on Knowledge-Enhanced Pre-trained Language Models
Chaoqi Zhen
Yanlei Shang
Xiangyu Liu
Yifei Li
Yong Chen
Dell Zhang
VLMKELM
92
3
0
27 Dec 2022
MicroBERT: Effective Training of Low-resource Monolingual BERTs through
  Parameter Reduction and Multitask Learning
MicroBERT: Effective Training of Low-resource Monolingual BERTs through Parameter Reduction and Multitask Learning
Luke Gessler
Amir Zeldes
79
14
0
23 Dec 2022
SERENGETI: Massively Multilingual Language Models for Africa
SERENGETI: Massively Multilingual Language Models for Africa
Ife Adebara
AbdelRahim Elmadany
Muhammad Abdul-Mageed
Alcides Alcoba Inciarte
76
33
0
21 Dec 2022
Character-Aware Models Improve Visual Text Rendering
Character-Aware Models Improve Visual Text Rendering
Rosanne Liu
Daniel H Garrette
Chitwan Saharia
William Chan
Adam Roberts
Sharan Narang
Irina Blok
R. Mical
Mohammad Norouzi
Noah Constant
VLM
117
74
0
20 Dec 2022
Pretraining Without Attention
Pretraining Without Attention
Junxiong Wang
J. Yan
Albert Gu
Alexander M. Rush
96
49
0
20 Dec 2022
Trustworthy Social Bias Measurement
Trustworthy Social Bias Measurement
Rishi Bommasani
Percy Liang
76
11
0
20 Dec 2022
A Measure-Theoretic Characterization of Tight Language Models
A Measure-Theoretic Characterization of Tight Language Models
Li Du
Lucas Torroba Hennigen
Tiago Pimentel
Clara Meister
Jason Eisner
Ryan Cotterell
100
32
0
20 Dec 2022
Is GPT-3 a Good Data Annotator?
Is GPT-3 a Good Data Annotator?
Bosheng Ding
Chengwei Qin
Linlin Liu
Yew Ken Chia
Shafiq Joty
Boyang Albert Li
Lidong Bing
95
250
0
20 Dec 2022
EIT: Enhanced Interactive Transformer
EIT: Enhanced Interactive Transformer
Tong Zheng
Bei Li
Huiwen Bao
Tong Xiao
Jingbo Zhu
119
2
0
20 Dec 2022
PLUE: Language Understanding Evaluation Benchmark for Privacy Policies
  in English
PLUE: Language Understanding Evaluation Benchmark for Privacy Policies in English
Jianfeng Chi
Wasi Uddin Ahmad
Yuan Tian
Kai-Wei Chang
AILawELM
55
11
0
20 Dec 2022
Inducing Character-level Structure in Subword-based Language Models with
  Type-level Interchange Intervention Training
Inducing Character-level Structure in Subword-based Language Models with Type-level Interchange Intervention Training
Jing-ling Huang
Zhengxuan Wu
Kyle Mahowald
Christopher Potts
87
14
0
19 Dec 2022
Exploring Hybrid and Ensemble Models for Multiclass Prediction of Mental
  Health Status on Social Media
Exploring Hybrid and Ensemble Models for Multiclass Prediction of Mental Health Status on Social Media
S. Zanwar
Daniel Wiechmann
Yu Qiao
E. Kerz
AI4MH
61
5
0
19 Dec 2022
Do CoNLL-2003 Named Entity Taggers Still Work Well in 2023?
Do CoNLL-2003 Named Entity Taggers Still Work Well in 2023?
Shuheng Liu
Alan Ritter
AI4TS
87
13
0
19 Dec 2022
Enriching Relation Extraction with OpenIE
Enriching Relation Extraction with OpenIE
Alessandro Temperoni
M. Biryukov
Martin Theobald
51
1
0
19 Dec 2022
On Isotropy, Contextualization and Learning Dynamics of
  Contrastive-based Sentence Representation Learning
On Isotropy, Contextualization and Learning Dynamics of Contrastive-based Sentence Representation Learning
Chenghao Xiao
Yang Long
Noura Al Moubayed
80
10
0
18 Dec 2022
Recall, Expand and Multi-Candidate Cross-Encode: Fast and Accurate
  Ultra-Fine Entity Typing
Recall, Expand and Multi-Candidate Cross-Encode: Fast and Accurate Ultra-Fine Entity Typing
Chengyue Jiang
Wenyang Hui
Yong Jiang
Xiaobin Wang
Pengjun Xie
Kewei Tu
86
4
0
18 Dec 2022
Rarely a problem? Language models exhibit inverse scaling in their
  predictions following few-type quantifiers
Rarely a problem? Language models exhibit inverse scaling in their predictions following few-type quantifiers
J. Michaelov
Benjamin Bergen
44
17
0
16 Dec 2022
Enhancing Multi-modal and Multi-hop Question Answering via Structured
  Knowledge and Unified Retrieval-Generation
Enhancing Multi-modal and Multi-hop Question Answering via Structured Knowledge and Unified Retrieval-Generation
Qian Yang
Qian Chen
Wen Wang
Baotian Hu
Min Zhang
103
27
0
16 Dec 2022
The KITMUS Test: Evaluating Knowledge Integration from Multiple Sources
  in Natural Language Understanding Systems
The KITMUS Test: Evaluating Knowledge Integration from Multiple Sources in Natural Language Understanding Systems
Akshatha Arodi
Martin Pömsl
Kaheer Suleman
Adam Trischler
Alexandra Olteanu
Jackie C.K. Cheung
ELM
75
5
0
15 Dec 2022
MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End
  Language Modeling
MANTa: Efficient Gradient-Based Tokenization for Robust End-to-End Language Modeling
Nathan Godey
Roman Castagné
Eric Villemonte de la Clergerie
Benoît Sagot
45
3
0
14 Dec 2022
VTCC-NLP at NL4Opt competition subtask 1: An Ensemble Pre-trained
  language models for Named Entity Recognition
VTCC-NLP at NL4Opt competition subtask 1: An Ensemble Pre-trained language models for Named Entity Recognition
Xuan-Dung Doan
73
6
0
14 Dec 2022
Explainability of Text Processing and Retrieval Methods: A Critical
  Survey
Explainability of Text Processing and Retrieval Methods: A Critical Survey
Sourav Saha
Debapriyo Majumdar
Mandar Mitra
96
5
0
14 Dec 2022
AsPOS: Assamese Part of Speech Tagger using Deep Learning Approach
AsPOS: Assamese Part of Speech Tagger using Deep Learning Approach
Dhrubajyoti Pathak
Sukumar Nandi
Priyankoo Sarmah
62
7
0
14 Dec 2022
Distantly-Supervised Named Entity Recognition with Adaptive Teacher
  Learning and Fine-grained Student Ensemble
Distantly-Supervised Named Entity Recognition with Adaptive Teacher Learning and Fine-grained Student Ensemble
Xiaoye Qu
Jun Zeng
Daizong Liu
Zhefeng Wang
Baoxing Huai
Pan Zhou
76
22
0
13 Dec 2022
Quant 4.0: Engineering Quantitative Investment with Automated,
  Explainable and Knowledge-driven Artificial Intelligence
Quant 4.0: Engineering Quantitative Investment with Automated, Explainable and Knowledge-driven Artificial Intelligence
Jian Guo
Saizhuo Wang
L. Ni
H. Shum
AIFin
99
8
0
13 Dec 2022
TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models
  of Different Modalities
TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities
Zhe Zhao
Yudong Li
Cheng-An Hou
Jing-xin Zhao
Rong Tian
...
Xingwu Sun
Zhanhui Kang
Xiaoyong Du
Linlin Shen
Kimmo Yan
VLM
106
24
0
13 Dec 2022
Real-World Compositional Generalization with Disentangled
  Sequence-to-Sequence Learning
Real-World Compositional Generalization with Disentangled Sequence-to-Sequence Learning
Hao Zheng
Mirella Lapata
OODCoGeDRL
64
5
0
12 Dec 2022
Domain Adaptation of Transformer-Based Models using Unlabeled Data for
  Relevance and Polarity Classification of German Customer Feedback
Domain Adaptation of Transformer-Based Models using Unlabeled Data for Relevance and Polarity Classification of German Customer Feedback
Ahmad Idrissi-Yaghir
Henning Schafer
Nadja Bauer
Christoph M. Friedrich
80
6
0
12 Dec 2022
Previous
123...131415...899091
Next