How Contextual are Contextualized Word Representations? Comparing the Geometry of BERT, ELMo, and GPT-2 Embeddings

2 September 2019

Papers citing "How Contextual are Contextualized Word Representations? Comparing the Geometry of BERT, ELMo, and GPT-2 Embeddings"

21 / 21 papers shown

Title
Origin Tracer: A Method for Detecting LoRA Fine-Tuning Origins in LLMs Hongyu Liang Yuting Zheng Yihan Li Yiran Zhang Shiyu Liang 29 0 0 26 May 2025
Handling Symbolic Language in Student Texts: A Comparative Study of NLP Embedding Models Tom Bleckmann Paul Tschisgale 112 0 0 23 May 2025
Do Language Models Use Their Depth Efficiently? Róbert Csordás Christopher D. Manning Christopher Potts 84 0 0 20 May 2025
Outlier dimensions favor frequent tokens in language models Iuri Macocco Nora Graichen Gemma Boleda Marco Baroni 68 0 0 27 Mar 2025
Sentiment Analysis in SemEval: A Review of Sentiment Identification Approaches Bousselham EL HADDAOUI R. Chiheb R. Faizi A. E. Afia 75 0 0 13 Mar 2025
Evaluating Discourse Cohesion in Pre-trained Language Models Jie He Wanqiu Long Deyi Xiong ELM 123 2 0 08 Mar 2025
DReSD: Dense Retrieval for Speculative Decoding Milan Gritta Huiyin Xue Gerasimos Lampouras RALM 133 0 0 21 Feb 2025
Implicit Geometry of Next-token Prediction: From Language Sparsity Patterns to Model Representations Yize Zhao Tina Behnia V. Vakilian Christos Thrampoulidis 124 10 0 20 Feb 2025
M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis Chengyan Wu Bolei Ma Yang Liu Zheyu Zhang Ningyuan Deng Yongqian Li Baolan Chen Yi Zhang Yun Xue Yun Xue 82 1 0 17 Feb 2025
Regularization, Semi-supervision, and Supervision for a Plausible Attention-Based Explanation Duc Hau Nguyen Cyrielle Mallart Guillaume Gravier Pascale Sébillot 90 0 0 22 Jan 2025
The Geometry of Tokens in Internal Representations of Large Language Models Karthik Viswanathan Yuri Gardinazzi Giada Panerai Alberto Cazzaniga Matteo Biagetti AIFin 115 6 0 17 Jan 2025
GASLITEing the Retrieval: Exploring Vulnerabilities in Dense Embedding-based Search Matan Ben-Tov Mahmood Sharif RALM 94 1 0 31 Dec 2024
Phase Diagram of Vision Large Language Models Inference: A Perspective from Interaction across Image and Instruction Houjing Wei Hakaze Cho Yuting Shi MLLM 74 0 0 01 Nov 2024
CAST: Corpus-Aware Self-similarity Enhanced Topic modelling Yanan Ma Chenghao Xiao Chenhan Yuan Sabine N van der Veer Lamiece Hassan Chenghua Lin Goran Nenadic 62 0 0 19 Oct 2024
Token-based Decision Criteria Are Suboptimal in In-context Learning Hakaze Cho Yoshihiro Sakai Mariko Kato Kenshiro Tanaka Akira Ishii Naoya Inoue 78 5 0 24 Jun 2024
Dissecting Query-Key Interaction in Vision Transformers Xu Pan Aaron Philip Ziqian Xie Odelia Schwartz 69 1 0 04 Apr 2024
The Unreasonable Ineffectiveness of the Deeper Layers Andrey Gromov Kushal Tirumala Hassan Shapourian Paolo Glorioso Daniel A. Roberts 76 93 0 26 Mar 2024
A Comprehensive Survey on Process-Oriented Automatic Text Summarization with Exploration of LLM-Based Methods Hanlei Jin Yang Zhang Dan Meng Jun Wang Jinghua Tan 146 87 0 05 Mar 2024
PART: Pre-trained Authorship Representation Transformer Javier Huertas-Tato Álvaro Huertas-García Alejandro Martín 83 8 0 30 Sep 2022
What do you learn from context? Probing for sentence structure in contextualized word representations Ian Tenney Patrick Xia Berlin Chen Alex Jinpeng Wang Adam Poliak ... Najoung Kim Benjamin Van Durme Samuel R. Bowman Dipanjan Das Ellie Pavlick 159 853 0 15 May 2019
How transferable are features in deep neural networks? J. Yosinski Jeff Clune Yoshua Bengio Hod Lipson OOD 145 8,309 0 06 Nov 2014