Analyzing Encoded Concepts in Transformer Language Models

Analyzing Encoded Concepts in Transformer Language Models

27 June 2022

Firoj Alam

Papers citing "Analyzing Encoded Concepts in Transformer Language Models"

11 / 11 papers shown

Title
I Predict Therefore I Am: Is Next Token Prediction Enough to Learn Human-Interpretable Concepts from Data? Yuhang Liu Dong Gong Erdun Gao Zhen Zhang Zhen Zhang Biwei Huang Anton van den Hengel Anton van den Hengel Javen Qinfeng Shi 157 0 0 12 Mar 2025
From Tokens to Words: On the Inner Lexicon of LLMs Guy Kaplan Matanel Oren Yuval Reif Roy Schwartz 48 12 0 08 Oct 2024
Adversarial Attacks on Parts of Speech: An Empirical Study in Text-to-Image Generation G M Shahariar Jia Chen Jiachen Li Yue Dong 29 0 0 21 Sep 2024
On Behalf of the Stakeholders: Trends in NLP Model Interpretability in the Era of LLMs Nitay Calderon Roi Reichart 40 10 0 27 Jul 2024
Towards a Path Dependent Account of Category Fluency David Heineman Reba Koenen Sashank Varma 29 0 0 09 May 2024
Towards Concept-Aware Large Language Models Chen Shani Jilles Vreeken Dafna Shahaf LRM 22 6 0 03 Nov 2023
ConceptX: A Framework for Latent Concept Analysis Firoj Alam Fahim Dalvi Nadir Durrani Hassan Sajjad A. Khan Jia Xu 22 5 0 12 Nov 2022
On the Transformation of Latent Space in Fine-Tuned NLP Models Nadir Durrani Hassan Sajjad Fahim Dalvi Firoj Alam 32 18 0 23 Oct 2022
Neuron-level Interpretation of Deep NLP Models: A Survey Hassan Sajjad Nadir Durrani Fahim Dalvi MILM AI4CE 35 80 0 30 Aug 2021
Similarity Analysis of Contextual Word Representation Models John M. Wu Yonatan Belinkov Hassan Sajjad Nadir Durrani Fahim Dalvi James R. Glass 51 73 0 03 May 2020
What you can cram into a single vector: Probing sentence embeddings for linguistic properties Alexis Conneau Germán Kruszewski Guillaume Lample Loïc Barrault Marco Baroni 201 882 0 03 May 2018