Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.05365
Cited By
v1
v2 (latest)
Deep contextualized word representations
15 February 2018
Matthew E. Peters
Mark Neumann
Mohit Iyyer
Matt Gardner
Christopher Clark
Kenton Lee
Luke Zettlemoyer
NAI
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep contextualized word representations"
50 / 4,508 papers shown
Title
Self-Attention Mechanism in Multimodal Context for Banking Transaction Flow
Cyrile Delestre
Yoann Sola
34
0
0
10 Oct 2024
Siamese networks for Poincaré embeddings and the reconstruction of evolutionary trees
Ciro Carvallo
Hernán Bocaccio
G. Mindlin
Pablo Groisman
57
0
0
09 Oct 2024
Collapsed Language Models Promote Fairness
Jingxuan Xu
Wuyang Chen
Linyi Li
Yao Zhao
Yunchao Wei
120
0
0
06 Oct 2024
Inner-Probe: Discovering Copyright-related Data Generation in LLM Architecture
Qichao Ma
Rui-Jie Zhu
Peiye Liu
Renye Yan
Fahong Zhang
...
Meng Li
Zhaofei Yu
Zongwei Wang
Yimao Cai
Tiejun Huang
80
1
0
06 Oct 2024
Deep Transfer Learning Based Peer Review Aggregation and Meta-review Generation for Scientific Articles
Md. Tarek Hasan
Mohammad Nazmush Shamael
H. M. Mutasim Billah
Arifa Akter
M. Hossain
Sumayra Islam
Salekul Islam
Swakkhar Shatabda
62
0
0
05 Oct 2024
A Multi-task Learning Framework for Evaluating Machine Translation of Emotion-loaded User-generated Content
Shenbin Qian
Constantin Orasan
Diptesh Kanojia
Félix do Carmo
98
0
0
04 Oct 2024
Financial Sentiment Analysis on News and Reports Using Large Language Models and FinBERT
Yanxin Shen
Pulin Kirin Zhang
AIFin
61
11
0
02 Oct 2024
Reasoning Elicitation in Language Models via Counterfactual Feedback
Alihan Hüyük
Xinnuo Xu
Jacqueline R. M. A. Maasch
Aditya V. Nori
Javier González
ReLM
LRM
449
3
0
02 Oct 2024
Investigating the Impact of Model Complexity in Large Language Models
Jing Luo
Huiyuan Wang
Weiran Huang
69
0
0
01 Oct 2024
The Construction of Instruction-tuned LLMs for Finance without Instruction Data Using Continual Pretraining and Model Merging
Masanori Hirano
Kentaro Imajo
MoMe
55
1
0
30 Sep 2024
Do We Need Domain-Specific Embedding Models? An Empirical Investigation
Yixuan Tang
Yi Yang
AIFin
195
6
0
27 Sep 2024
"Oh LLM, I'm Asking Thee, Please Give Me a Decision Tree": Zero-Shot Decision Tree Induction and Embedding with Large Language Models
Ricardo Knauer
Mario Koddenbrock
Raphael Wallsberger
Nicholas M. Brisson
Georg N. Duda
Deborah Falla
David W. Evans
Erik Rodner
203
0
0
27 Sep 2024
Joint Source-Channel Coding: Fundamentals and Recent Progress in Practical Designs
Deniz Gündüz
Michèle A. Wigger
Tze-Yang Tung
Ping Zhang
Yong Xiao
98
18
0
26 Sep 2024
SWE2: SubWord Enriched and Significant Word Emphasized Framework for Hate Speech Detection
Guanyi Mou
Pengyi Ye
Kyumin Lee
96
18
0
25 Sep 2024
ViBERTgrid BiLSTM-CRF: Multimodal Key Information Extraction from Unstructured Financial Documents
Furkan Pala
Mehmet Yasin Akpınar
Onur Deniz
Gülşen Eryiğit
51
0
0
23 Sep 2024
Can Language Model Understand Word Semantics as A Chatbot? An Empirical Study of Language Model Internal External Mismatch
Jinman Zhao
Xueyan Zhang
Xingyu Yue
Weizhe Chen
Zifan Qian
Ruiyu Wang
LRM
68
0
0
21 Sep 2024
One Model is All You Need: ByT5-Sanskrit, a Unified Model for Sanskrit NLP Tasks
Sebastian Nehrdich
Oliver Hellwig
Kurt Keutzer
60
5
0
20 Sep 2024
Trajectory Anomaly Detection with Language Models
Jonathan Mbuya
Dieter Pfoser
Antonios Anastasopoulos
37
2
0
18 Sep 2024
Norm of Mean Contextualized Embeddings Determines their Variance
Hiroaki Yamagiwa
Hidetoshi Shimodaira
56
0
0
17 Sep 2024
Surveying the MLLM Landscape: A Meta-Review of Current Surveys
Ming Li
Keyu Chen
Ziqian Bi
Ming Liu
Benji Peng
...
Jinlang Wang
Sen Zhang
X. Pan
Jiawei Xu
Pohsun Feng
OffRL
118
2
0
17 Sep 2024
Attention-Seeker: Dynamic Self-Attention Scoring for Unsupervised Keyphrase Extraction
Erwin D. López Z.
Cheng Tang
Atsushi Shimada
48
1
0
17 Sep 2024
AlpaPICO: Extraction of PICO Frames from Clinical Trial Documents Using LLMs
Madhusudan Ghosh
Shrimon Mukherjee
Asmit Ganguly
Partha Basuchowdhuri
S. Naskar
Debasis Ganguly
99
8
0
15 Sep 2024
Automatic Scene Generation: State-of-the-Art Techniques, Models, Datasets, Challenges, and Future Prospects
Awal Ahmed Fime
Saifuddin Mahmud
Arpita Das
Md. Sunzidul Islam
Hong-Hoon Kim
VGen
3DV
44
1
0
14 Sep 2024
Distilling Monolingual and Crosslingual Word-in-Context Representations
Yuki Arase
Tomoyuki Kajiwara
68
0
0
13 Sep 2024
A BERT-Based Summarization approach for depression detection
Hossein Salahshoor Gavalan
Mohmmad Naim Rastgoo
Bahareh Nakisa
63
2
0
13 Sep 2024
LLM-Enhanced Software Patch Localization
Jinhong Yu
Yi Chen
Di Tang
Xiaozhong Liu
Wenyuan Xu
Chen Wu
Haixu Tang
AAML
64
2
0
10 Sep 2024
What is the Role of Small Models in the LLM Era: A Survey
Lihu Chen
Gaël Varoquaux
ALM
248
32
0
10 Sep 2024
Interactive Machine Teaching by Labeling Rules and Instances
Giannis Karamanolakis
Daniel J. Hsu
Luis Gravano
73
1
0
08 Sep 2024
Do We Trust What They Say or What They Do? A Multimodal User Embedding Provides Personalized Explanations
Zhicheng Ren
Zhiping Xiao
Yizhou Sun
99
0
0
04 Sep 2024
Dreaming is All You Need
Mingze Ni
Wei Liu
53
0
0
03 Sep 2024
Pre-Trained Language Models for Keyphrase Prediction: A Review
Muhammad Umair
Tangina Sultana
Young-Koo Lee
80
4
0
02 Sep 2024
From Latent to Engine Manifolds: Analyzing ImageBind's Multimodal Embedding Space
Andrew Hamara
Pablo Rivas
59
1
0
30 Aug 2024
Great Memory, Shallow Reasoning: Limits of
k
k
k
NN-LMs
Shangyi Geng
Wenting Zhao
Alexander M. Rush
RALM
ReLM
LRM
93
2
0
21 Aug 2024
Recurrent Neural Networks Learn to Store and Generate Sequences using Non-Linear Representations
Róbert Csordás
Christopher Potts
Christopher D. Manning
Atticus Geiger
GAN
82
21
0
20 Aug 2024
Acquiring Bidirectionality via Large and Small Language Models
Takumi Goto
Hiroyoshi Nagao
Yuta Koreeda
56
0
0
19 Aug 2024
Paired Completion: Flexible Quantification of Issue-framing at Scale with LLMs
Simon D Angus
Lachlan O’Neill
31
0
0
19 Aug 2024
PhysBERT: A Text Embedding Model for Physics Scientific Literature
Thorsten Hellert
Joao Montenegro
Andrea Pollastro
PINN
AI4CE
71
4
0
18 Aug 2024
Exploring Retrieval Augmented Generation in Arabic
S. El-Beltagy
Mohamed A. Abdallah
RALM
85
4
0
14 Aug 2024
DataVisT5: A Pre-trained Language Model for Jointly Understanding Text and Data Visualization
Zhuoyue Wan
Yuanfeng Song
Shuaimin Li
Chen Jason Zhang
Raymond Chi-Wing Wong
VLM
74
1
0
14 Aug 2024
BERT's Conceptual Cartography: Mapping the Landscapes of Meaning
Nina Haket
Ryan Daniels
57
0
0
13 Aug 2024
Knowledge Probing for Graph Representation Learning
Mingyu Zhao
Xingyu Huang
Ziyu Lyu
Yanlin Wang
Lixin Cui
Lu Bai
26
0
0
07 Aug 2024
Why transformers are obviously good models of language
Felix Hill
58
1
0
07 Aug 2024
Local Topology Measures of Contextual Language Model Latent Spaces With Applications to Dialogue Term Extraction
Benjamin Matthias Ruppik
Michael Heck
Carel van Niekerk
Renato Vukovic
Hsien-chin Lin
Shutong Feng
Marcus Zibrowius
Milica Gašić
87
3
0
07 Aug 2024
Disentangling Dense Embeddings with Sparse Autoencoders
Charles OÑeill
Christine Ye
K. Iyer
John F. Wu
58
7
0
01 Aug 2024
Maverick: Efficient and Accurate Coreference Resolution Defying Recent Trends
Giuliano Martinelli
Martin Larsson
Johannes Wiesel
88
10
0
31 Jul 2024
Stochastic Parrots or ICU Experts? Large Language Models in Critical Care Medicine: A Scoping Review
Tongyue Shi
Jun Ma
Zihan Yu
Haowei Xu
Minqi Xiong
Meirong Xiao
Yilin Li
Huiying Zhao
Guilan Kong
74
2
0
27 Jul 2024
Fairness Definitions in Language Models Explained
Thang Viet Doan
Zhibo Chu
Zichong Wang
Wenbin Zhang
ALM
108
10
0
26 Jul 2024
Unified Lexical Representation for Interpretable Visual-Language Alignment
Yifan Li
Yikai Wang
Yanwei Fu
Dongyu Ru
Zheng Zhang
Tong He
VLM
59
4
0
25 Jul 2024
NarrationDep: Narratives on Social Media For Automatic Depression Detection
Hamad Zogan
Imran Razzak
Shoaib Jameel
Guandong Xu
39
0
0
24 Jul 2024
Analyzing Polysemy Evolution Using Semantic Cells
Yukio Ohsawa
Dingming Xue
Kaira Sekiguchi
42
0
0
23 Jul 2024
Previous
1
2
3
4
5
6
...
89
90
91
Next