ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.05365
  4. Cited By
Deep contextualized word representations
v1v2 (latest)

Deep contextualized word representations

15 February 2018
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
    NAI
ArXiv (abs)PDFHTML

Papers citing "Deep contextualized word representations"

50 / 4,508 papers shown
Title
ALLaM: Large Language Models for Arabic and English
ALLaM: Large Language Models for Arabic and English
M Saiful Bari
Yazeed Alnumay
Norah A. Alzahrani
Nouf M. Alotaibi
H. A. Alyahya
...
Jeril Kuriakose
Abdalghani Abujabal
Nora Al-Twairesh
Areeb Alowisheq
Haidar Khan
73
17
0
22 Jul 2024
Impact of Model Size on Fine-tuned LLM Performance in Data-to-Text
  Generation: A State-of-the-Art Investigation
Impact of Model Size on Fine-tuned LLM Performance in Data-to-Text Generation: A State-of-the-Art Investigation
Joy Mahapatra
Utpal Garain
89
10
0
19 Jul 2024
Clinical Reading Comprehension with Encoder-Decoder Models Enhanced by
  Direct Preference Optimization
Clinical Reading Comprehension with Encoder-Decoder Models Enhanced by Direct Preference Optimization
Md Sultan al Nahian
R. Kavuluru
MedImAI4CE
56
0
0
19 Jul 2024
Dynamic Sentiment Analysis with Local Large Language Models using
  Majority Voting: A Study on Factors Affecting Restaurant Evaluation
Dynamic Sentiment Analysis with Local Large Language Models using Majority Voting: A Study on Factors Affecting Restaurant Evaluation
Junichiro Niimi
84
4
0
18 Jul 2024
Word Embedding Dimension Reduction via Weakly-Supervised Feature
  Selection
Word Embedding Dimension Reduction via Weakly-Supervised Feature Selection
Jintang Xue
Yun Cheng Wang
Chengwei Wei
C.-C. Jay Kuo
78
0
0
17 Jul 2024
Lacuna Language Learning: Leveraging RNNs for Ranked Text Completion in
  Digitized Coptic Manuscripts
Lacuna Language Learning: Leveraging RNNs for Ranked Text Completion in Digitized Coptic Manuscripts
Lauren Levine
Cindy Tung Li
Lydia Bremer-McCollum
Nicholas Wagner
Amir Zeldes
RALM
69
1
0
17 Jul 2024
MaskMoE: Boosting Token-Level Learning via Routing Mask in
  Mixture-of-Experts
MaskMoE: Boosting Token-Level Learning via Routing Mask in Mixture-of-Experts
Zhenpeng Su
Zijia Lin
Xue Bai
Xing Wu
Yizhe Xiong
...
Guangyuan Ma
Hui Chen
Guiguang Ding
Wei Zhou
Songlin Hu
MoE
93
5
0
13 Jul 2024
A Survey on Symbolic Knowledge Distillation of Large Language Models
A Survey on Symbolic Knowledge Distillation of Large Language Models
Kamal Acharya
Alvaro Velasquez
Haoze Song
SyDa
74
7
0
12 Jul 2024
Training on the Test Task Confounds Evaluation and Emergence
Training on the Test Task Confounds Evaluation and Emergence
Ricardo Dominguez-Olmedo
Florian E. Dorner
Moritz Hardt
ELM
154
9
1
10 Jul 2024
Can Model Uncertainty Function as a Proxy for Multiple-Choice Question Item Difficulty?
Can Model Uncertainty Function as a Proxy for Multiple-Choice Question Item Difficulty?
Leonidas Zotos
H. Rijn
Malvina Nissim
ELM
92
3
0
07 Jul 2024
Toucan: Many-to-Many Translation for 150 African Language Pairs
Toucan: Many-to-Many Translation for 150 African Language Pairs
AbdelRahim Elmadany
Ife Adebara
Muhammad Abdul-Mageed
68
3
0
05 Jul 2024
Survey on Knowledge Distillation for Large Language Models: Methods,
  Evaluation, and Application
Survey on Knowledge Distillation for Large Language Models: Methods, Evaluation, and Application
Chuanpeng Yang
Wang Lu
Yao Zhu
Yidong Wang
Qian Chen
Chenlong Gao
Bingjie Yan
Yiqiang Chen
ALMKELM
101
32
0
02 Jul 2024
Cross-Lingual Transfer Learning for Speech Translation
Cross-Lingual Transfer Learning for Speech Translation
Rao Ma
Yassir Fathullah
Mengjie Qian
Siyuan Tang
Mark Gales
Kate Knill
174
4
0
01 Jul 2024
MALSIGHT: Exploring Malicious Source Code and Benign Pseudocode for Iterative Binary Malware Summarization
MALSIGHT: Exploring Malicious Source Code and Benign Pseudocode for Iterative Binary Malware Summarization
Haolang Lu
Hongrui Peng
Guoshun Nan
Jiaoyang Cui
Cheng Wang
Weifei Jin
Songtao Wang
Shengli Pan
Xiaofeng Tao
69
4
0
26 Jun 2024
MoE-CT: A Novel Approach For Large Language Models Training With
  Resistance To Catastrophic Forgetting
MoE-CT: A Novel Approach For Large Language Models Training With Resistance To Catastrophic Forgetting
Tianhao Li
Shangjie Li
Binbin Xie
Deyi Xiong
Baosong Yang
CLL
118
4
0
25 Jun 2024
Evaluation of Language Models in the Medical Context Under
  Resource-Constrained Settings
Evaluation of Language Models in the Medical Context Under Resource-Constrained Settings
Andrea Posada
Daniel Rueckert
Felix Meissen
Philip Muller
LM&MAELM
58
0
0
24 Jun 2024
Evaluating the Effectiveness of the Foundational Models for Q&A
  Classification in Mental Health care
Evaluating the Effectiveness of the Foundational Models for Q&A Classification in Mental Health care
Hassan Alhuzali
Ashwag Alasmari
AI4MH
79
2
0
23 Jun 2024
The Fire Thief Is Also the Keeper: Balancing Usability and Privacy in
  Prompts
The Fire Thief Is Also the Keeper: Balancing Usability and Privacy in Prompts
Zhili Shen
Zihang Xi
Ying He
Wei Tong
Jingyu Hua
Sheng Zhong
SILM
86
8
0
20 Jun 2024
In Tree Structure Should Sentence Be Generated
In Tree Structure Should Sentence Be Generated
Yaguang Li
Xin Chen
40
0
0
20 Jun 2024
Healing Powers of BERT: How Task-Specific Fine-Tuning Recovers Corrupted Language Models
Healing Powers of BERT: How Task-Specific Fine-Tuning Recovers Corrupted Language Models
Shijie Han
Zhenyu Zhang
Andrei Arsene Simion
70
2
0
20 Jun 2024
Large Scale Transfer Learning for Tabular Data via Language Modeling
Large Scale Transfer Learning for Tabular Data via Language Modeling
Josh Gardner
Juan C. Perdomo
Ludwig Schmidt
LMTD
107
24
0
17 Jun 2024
RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen
  Reference Content
RepLiQA: A Question-Answering Dataset for Benchmarking LLMs on Unseen Reference Content
Joao Monteiro
Pierre-Andre Noel
Étienne Marcotte
Sai Rajeswar
Valentina Zantedeschi
David Vazquez
Nicolas Chapados
Christopher Pal
Perouz Taslakian
65
7
0
17 Jun 2024
A Survey on Human Preference Learning for Large Language Models
A Survey on Human Preference Learning for Large Language Models
Ruili Jiang
Kehai Chen
Xuefeng Bai
Zhixuan He
Juntao Li
Muyun Yang
Tiejun Zhao
Liqiang Nie
Min Zhang
134
9
0
17 Jun 2024
To be Continuous, or to be Discrete, Those are Bits of Questions
To be Continuous, or to be Discrete, Those are Bits of Questions
Yiran Wang
Masao Utiyama
80
4
0
12 Jun 2024
Nonlinear time-series embedding by monotone variational inequality
Nonlinear time-series embedding by monotone variational inequality
Jonathan Y. Zhou
Yao Xie
AI4TS
122
0
0
11 Jun 2024
Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL
Decision Mamba: A Multi-Grained State Space Model with Self-Evolution Regularization for Offline RL
Qi Lv
Xiang Deng
Gongwei Chen
Michael Yu Wang
Liqiang Nie
198
8
0
08 Jun 2024
Randomized Geometric Algebra Methods for Convex Neural Networks
Randomized Geometric Algebra Methods for Convex Neural Networks
Yifei Wang
Sungyoon Kim
Paul Chu
Indu Subramaniam
Mert Pilanci
AAML
125
0
0
04 Jun 2024
Towards Effective Time-Aware Language Representation: Exploring Enhanced Temporal Understanding in Language Models
Towards Effective Time-Aware Language Representation: Exploring Enhanced Temporal Understanding in Language Models
Jiexin Wang
Adam Jatowt
Yi Cai
AI4CE
96
1
0
04 Jun 2024
EduNLP: Towards a Unified and Modularized Library for Educational
  Resources
EduNLP: Towards a Unified and Modularized Library for Educational Resources
Zhenya Huang
Yuting Ning
Longhu Qin
Shiwei Tong
Shangzi Xue
...
Xin Lin
Jia-Yin Liu
Qi Liu
Enhong Chen
Shijin Wang
AI4Ed
75
1
0
03 Jun 2024
Synergizing Unsupervised and Supervised Learning: A Hybrid Approach for
  Accurate Natural Language Task Modeling
Synergizing Unsupervised and Supervised Learning: A Hybrid Approach for Accurate Natural Language Task Modeling
Wrick Talukdar
Anjanava Biswas
52
5
0
03 Jun 2024
Identifiability of a statistical model with two latent vectors:
  Importance of the dimensionality relation and application to graph embedding
Identifiability of a statistical model with two latent vectors: Importance of the dimensionality relation and application to graph embedding
Hiroaki Sasaki
CML
54
0
0
30 May 2024
Recent advances in text embedding: A Comprehensive Review of
  Top-Performing Methods on the MTEB Benchmark
Recent advances in text embedding: A Comprehensive Review of Top-Performing Methods on the MTEB Benchmark
Hongliu Cao
AI4TS
104
15
0
27 May 2024
From Frege to chatGPT: Compositionality in language, cognition, and deep
  neural networks
From Frege to chatGPT: Compositionality in language, cognition, and deep neural networks
Jacob Russin
Sam Whitman McGrath
Danielle J. Williams
Lotem Elber-Dorozko
AI4CE
195
4
0
24 May 2024
CEEBERT: Cross-Domain Inference in Early Exit BERT
CEEBERT: Cross-Domain Inference in Early Exit BERT
Divya J. Bajpai
M. Hanawal
LRM
84
5
0
23 May 2024
A Survey on Vision-Language-Action Models for Embodied AI
A Survey on Vision-Language-Action Models for Embodied AI
Yueen Ma
Zixing Song
Yuzheng Zhuang
Jianye Hao
Irwin King
LM&Ro
335
54
0
23 May 2024
Efficacy of ByT5 in Multilingual Translation of Biblical Texts for
  Underrepresented Languages
Efficacy of ByT5 in Multilingual Translation of Biblical Texts for Underrepresented Languages
Corinne Aars
Lauren Adams
Xiaokan Tian
Zhaoyu Wang
Colton Wismer
Jason Wu
Pablo Rivas
Korn Sooksatra
Matthew Fendt
43
0
0
22 May 2024
CReMa: Crisis Response through Computational Identification and Matching
  of Cross-Lingual Requests and Offers Shared on Social Media
CReMa: Crisis Response through Computational Identification and Matching of Cross-Lingual Requests and Offers Shared on Social Media
Rabindra Lamsal
M. Read
S. Karunasekera
Muhammad Imran
62
3
0
20 May 2024
Large Language Models Lack Understanding of Character Composition of
  Words
Large Language Models Lack Understanding of Character Composition of Words
Andrew Shin
Kunitake Kaneko
104
11
0
18 May 2024
Multilingual Substitution-based Word Sense Induction
Multilingual Substitution-based Word Sense Induction
Denis Kokosinskii
Nikolay Arefyev
47
2
0
17 May 2024
Generative Artificial Intelligence: A Systematic Review and Applications
Generative Artificial Intelligence: A Systematic Review and Applications
S. S. Sengar
Affan Bin Hasan
Sanjay Kumar
Fiona Carroll
MedIm
76
74
0
17 May 2024
Multi-Evidence based Fact Verification via A Confidential Graph Neural
  Network
Multi-Evidence based Fact Verification via A Confidential Graph Neural Network
Yuqing Lan
Zhenghao Liu
Yu Gu
Xiaoyuan Yi
Xiaohua Li
Liner Yang
Ge Yu
86
1
0
17 May 2024
A Survey on Transformers in NLP with Focus on Efficiency
A Survey on Transformers in NLP with Focus on Efficiency
Wazib Ansar
Saptarsi Goswami
Amlan Chakrabarti
MedIm
93
2
0
15 May 2024
A Comprehensive Analysis of Static Word Embeddings for Turkish
A Comprehensive Analysis of Static Word Embeddings for Turkish
Karahan Sarıtaş
Cahid Arda Öz
Tunga Güngör
48
4
0
13 May 2024
LGDE: Local Graph-based Dictionary Expansion
LGDE: Local Graph-based Dictionary Expansion
Dominik J. Schindler
Sneha Jha
Xixuan Zhang
Kilian Buehling
Annett Heft
Mauricio Barahona
63
0
0
13 May 2024
Boosting House Price Estimations with Multi-Head Gated Attention
Boosting House Price Estimations with Multi-Head Gated Attention
A. Sellam
C. Distante
Abdelmalik Taleb-Ahmed
P. Mazzeo
47
2
0
13 May 2024
Automating Thematic Analysis: How LLMs Analyse Controversial Topics
Automating Thematic Analysis: How LLMs Analyse Controversial Topics
Awais Hameed Khan
H. Kegalle
Rhea D'Silva
Ned Watt
Daniel Whelan-Shamy
Lida Ghahremanlou
Liam Magee
91
7
0
11 May 2024
Word-specific tonal realizations in Mandarin
Word-specific tonal realizations in Mandarin
Yu-Ying Chuang
Melanie J. Bell
Yu-Hsiang Tseng
R. Baayen
145
5
0
11 May 2024
Open Challenges and Opportunities in Federated Foundation Models Towards
  Biomedical Healthcare
Open Challenges and Opportunities in Federated Foundation Models Towards Biomedical Healthcare
Xingyu Li
Lu Peng
Yuping Wang
Weihua Zhang
AI4CEMedImLM&MA
114
12
0
10 May 2024
Natural Language Processing RELIES on Linguistics
Natural Language Processing RELIES on Linguistics
Juri Opitz
Shira Wein
Nathan Schneider
AI4CE
165
8
0
09 May 2024
Revisiting character-level adversarial attacks
Revisiting character-level adversarial attacks
Elias Abad Rocamora
Yongtao Wu
Fanghui Liu
Grigorios G. Chrysos
Volkan Cevher
AAML
96
4
0
07 May 2024
Previous
12345...899091
Next