ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.00512
  4. Cited By
How Contextual are Contextualized Word Representations? Comparing the
  Geometry of BERT, ELMo, and GPT-2 Embeddings

How Contextual are Contextualized Word Representations? Comparing the Geometry of BERT, ELMo, and GPT-2 Embeddings

2 September 2019
Kawin Ethayarajh
ArXivPDFHTML

Papers citing "How Contextual are Contextualized Word Representations? Comparing the Geometry of BERT, ELMo, and GPT-2 Embeddings"

21 / 21 papers shown
Title
Origin Tracer: A Method for Detecting LoRA Fine-Tuning Origins in LLMs
Origin Tracer: A Method for Detecting LoRA Fine-Tuning Origins in LLMs
Hongyu Liang
Yuting Zheng
Yihan Li
Yiran Zhang
Shiyu Liang
29
0
0
26 May 2025
Handling Symbolic Language in Student Texts: A Comparative Study of NLP Embedding Models
Handling Symbolic Language in Student Texts: A Comparative Study of NLP Embedding Models
Tom Bleckmann
Paul Tschisgale
112
0
0
23 May 2025
Do Language Models Use Their Depth Efficiently?
Do Language Models Use Their Depth Efficiently?
Róbert Csordás
Christopher D. Manning
Christopher Potts
84
0
0
20 May 2025
Outlier dimensions favor frequent tokens in language models
Outlier dimensions favor frequent tokens in language models
Iuri Macocco
Nora Graichen
Gemma Boleda
Marco Baroni
68
0
0
27 Mar 2025
Sentiment Analysis in SemEval: A Review of Sentiment Identification Approaches
Bousselham EL HADDAOUI
R. Chiheb
R. Faizi
A. E. Afia
75
0
0
13 Mar 2025
Evaluating Discourse Cohesion in Pre-trained Language Models
Jie He
Wanqiu Long
Deyi Xiong
ELM
123
2
0
08 Mar 2025
DReSD: Dense Retrieval for Speculative Decoding
DReSD: Dense Retrieval for Speculative Decoding
Milan Gritta
Huiyin Xue
Gerasimos Lampouras
RALM
133
0
0
21 Feb 2025
Implicit Geometry of Next-token Prediction: From Language Sparsity Patterns to Model Representations
Implicit Geometry of Next-token Prediction: From Language Sparsity Patterns to Model Representations
Yize Zhao
Tina Behnia
V. Vakilian
Christos Thrampoulidis
124
10
0
20 Feb 2025
M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis
M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis
Chengyan Wu
Bolei Ma
Yang Liu
Zheyu Zhang
Ningyuan Deng
Yongqian Li
Baolan Chen
Yi Zhang
Yun Xue
Yun Xue
82
1
0
17 Feb 2025
Regularization, Semi-supervision, and Supervision for a Plausible Attention-Based Explanation
Regularization, Semi-supervision, and Supervision for a Plausible Attention-Based Explanation
Duc Hau Nguyen
Cyrielle Mallart
Guillaume Gravier
Pascale Sébillot
90
0
0
22 Jan 2025
The Geometry of Tokens in Internal Representations of Large Language Models
The Geometry of Tokens in Internal Representations of Large Language Models
Karthik Viswanathan
Yuri Gardinazzi
Giada Panerai
Alberto Cazzaniga
Matteo Biagetti
AIFin
115
6
0
17 Jan 2025
GASLITEing the Retrieval: Exploring Vulnerabilities in Dense Embedding-based Search
GASLITEing the Retrieval: Exploring Vulnerabilities in Dense Embedding-based Search
Matan Ben-Tov
Mahmood Sharif
RALM
94
1
0
31 Dec 2024
Phase Diagram of Vision Large Language Models Inference: A Perspective from Interaction across Image and Instruction
Phase Diagram of Vision Large Language Models Inference: A Perspective from Interaction across Image and Instruction
Houjing Wei
Hakaze Cho
Yuting Shi
MLLM
74
0
0
01 Nov 2024
CAST: Corpus-Aware Self-similarity Enhanced Topic modelling
CAST: Corpus-Aware Self-similarity Enhanced Topic modelling
Yanan Ma
Chenghao Xiao
Chenhan Yuan
Sabine N van der Veer
Lamiece Hassan
Chenghua Lin
Goran Nenadic
62
0
0
19 Oct 2024
Token-based Decision Criteria Are Suboptimal in In-context Learning
Token-based Decision Criteria Are Suboptimal in In-context Learning
Hakaze Cho
Yoshihiro Sakai
Mariko Kato
Kenshiro Tanaka
Akira Ishii
Naoya Inoue
78
5
0
24 Jun 2024
Dissecting Query-Key Interaction in Vision Transformers
Dissecting Query-Key Interaction in Vision Transformers
Xu Pan
Aaron Philip
Ziqian Xie
Odelia Schwartz
69
1
0
04 Apr 2024
The Unreasonable Ineffectiveness of the Deeper Layers
The Unreasonable Ineffectiveness of the Deeper Layers
Andrey Gromov
Kushal Tirumala
Hassan Shapourian
Paolo Glorioso
Daniel A. Roberts
76
93
0
26 Mar 2024
A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods
A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods
Hanlei Jin
Yang Zhang
Dan Meng
Jun Wang
Jinghua Tan
146
87
0
05 Mar 2024
PART: Pre-trained Authorship Representation Transformer
PART: Pre-trained Authorship Representation Transformer
Javier Huertas-Tato
Álvaro Huertas-García
Alejandro Martín
83
8
0
30 Sep 2022
What do you learn from context? Probing for sentence structure in
  contextualized word representations
What do you learn from context? Probing for sentence structure in contextualized word representations
Ian Tenney
Patrick Xia
Berlin Chen
Alex Jinpeng Wang
Adam Poliak
...
Najoung Kim
Benjamin Van Durme
Samuel R. Bowman
Dipanjan Das
Ellie Pavlick
159
853
0
15 May 2019
How transferable are features in deep neural networks?
How transferable are features in deep neural networks?
J. Yosinski
Jeff Clune
Yoshua Bengio
Hod Lipson
OOD
145
8,309
0
06 Nov 2014
1