ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.08411
  4. Cited By
Large Language Models Struggle to Learn Long-Tail Knowledge
v1v2 (latest)

Large Language Models Struggle to Learn Long-Tail Knowledge

15 November 2022
Nikhil Kandpal
H. Deng
Adam Roberts
Eric Wallace
Colin Raffel
    RALMKELM
ArXiv (abs)PDFHTML

Papers citing "Large Language Models Struggle to Learn Long-Tail Knowledge"

50 / 260 papers shown
Title
ComparisonQA: Evaluating Factuality Robustness of LLMs Through Knowledge Frequency Control and Uncertainty
ComparisonQA: Evaluating Factuality Robustness of LLMs Through Knowledge Frequency Control and Uncertainty
Qing Zong
Zhaoxiang Wang
Tianshi Zheng
Xiyu Ren
Yangqiu Song
155
3
0
28 Dec 2024
How Do Artificial Intelligences Think? The Three Mathematico-Cognitive Factors of Categorical Segmentation Operated by Synthetic Neurons
How Do Artificial Intelligences Think? The Three Mathematico-Cognitive Factors of Categorical Segmentation Operated by Synthetic Neurons
Michael Pichat
William Pogrund
Armanush Gasparian
Paloma Pichat
Samuel Demarchi
Michael Veillet-Guillem
120
3
0
26 Dec 2024
COBRA: COmBinatorial Retrieval Augmentation for Few-Shot Adaptation
COBRA: COmBinatorial Retrieval Augmentation for Few-Shot Adaptation
Arnav M. Das
Gantavya Bhatt
Lilly Kumari
Sahil Verma
J. Bilmes
94
0
0
23 Dec 2024
HalluCana: Fixing LLM Hallucination with A Canary Lookahead
HalluCana: Fixing LLM Hallucination with A Canary Lookahead
Tianyi Li
Erenay Dayanik
Shubhi Tyagi
Andrea Pierleoni
HILM
122
0
0
10 Dec 2024
RAG-based Question Answering over Heterogeneous Data and Text
RAG-based Question Answering over Heterogeneous Data and Text
Philipp Christmann
Gerhard Weikum
LMTDRALM
155
5
0
10 Dec 2024
Information Anxiety in Large Language Models
Prasoon Bajpai
Sarah Masud
Tanmoy Chakraborty
69
0
0
16 Nov 2024
Probing LLM Hallucination from Within: Perturbation-Driven Approach via Internal Knowledge
Probing LLM Hallucination from Within: Perturbation-Driven Approach via Internal Knowledge
Seongmin Lee
Hsiang Hsu
Chun-Fu Chen
Duen Horng
Chau
LRM
101
2
0
14 Nov 2024
Continual Memorization of Factoids in Language Models
Continual Memorization of Factoids in Language Models
Howard Chen
Jiayi Geng
Adithya Bhaskar
Dan Friedman
Danqi Chen
KELM
129
1
0
11 Nov 2024
Fine-Grained Guidance for Retrievers: Leveraging LLMs' Feedback in
  Retrieval-Augmented Generation
Fine-Grained Guidance for Retrievers: Leveraging LLMs' Feedback in Retrieval-Augmented Generation
Yuhang Liu
Xueyu Hu
Shengyu Zhang
Jingyuan Chen
Fan Wu
Leilei Gan
RALM
40
0
0
06 Nov 2024
Latent Paraphrasing: Perturbation on Layers Improves Knowledge Injection
  in Language Models
Latent Paraphrasing: Perturbation on Layers Improves Knowledge Injection in Language Models
Minki Kang
Sung Ju Hwang
Gibbeum Lee
Jaewoong Cho
KELM
95
0
0
01 Nov 2024
Human-inspired Perspectives: A Survey on AI Long-term Memory
Human-inspired Perspectives: A Survey on AI Long-term Memory
Zihong He
Weizhe Lin
Hao Zheng
Fan Zhang
Matt Jones
Laurence Aitchison
X. Xu
Miao Liu
Per Ola Kristensson
Junxiao Shen
246
3
0
01 Nov 2024
CurateGPT: A flexible language-model assisted biocuration tool
CurateGPT: A flexible language-model assisted biocuration tool
Harry Caufield
Carlo Kroll
Shawn T O’Neil
Justin T Reese
marcin p. joachimiak
...
James A McLaughlin
Damian Smedley
M. Haendel
Peter N. Robinson
Christopher J. Mungall
SyDa
111
4
0
29 Oct 2024
A Novel Psychometrics-Based Approach to Developing Professional
  Competency Benchmark for Large Language Models
A Novel Psychometrics-Based Approach to Developing Professional Competency Benchmark for Large Language Models
Elena Kardanova
Alina Ivanova
Ksenia Tarasova
Taras Pashchenko
Aleksei Tikhoniuk
Elen Yusupova
Anatoly Kasprzhak
Yaroslav Kuzminov
Ekaterina Kruchinskaia
Irina Brun
125
1
0
29 Oct 2024
Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial
  Applications
Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications
Monica Riedler
Stefan Langer
VLM
85
18
0
29 Oct 2024
Exploring Local Memorization in Diffusion Models via Bright Ending Attention
Exploring Local Memorization in Diffusion Models via Bright Ending Attention
Chong Chen
Daochang Liu
M. Shah
Chang Xu
156
4
0
29 Oct 2024
Belief in the Machine: Investigating Epistemological Blind Spots of
  Language Models
Belief in the Machine: Investigating Epistemological Blind Spots of Language Models
Mirac Suzgun
Tayfun Gur
Federico Bianchi
Daniel E. Ho
Thomas Icard
Dan Jurafsky
James Zou
97
4
0
28 Oct 2024
Mask-based Membership Inference Attacks for Retrieval-Augmented Generation
Mask-based Membership Inference Attacks for Retrieval-Augmented Generation
Mingrui Liu
Sixiao Zhang
Cheng Long
AAML
158
4
0
26 Oct 2024
DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate
  Hallucinations
DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations
Aryo Pradipta Gema
Chen Jin
Ahmed Abdulaal
Tom Diethe
Philip Teare
Beatrice Alex
Pasquale Minervini
Amrutha Saseendran
92
6
0
24 Oct 2024
LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for
  Long-Context Question Answering
LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering
Qingfei Zhao
Ruobing Wang
Yukuo Cen
Daren Zha
Shicheng Tan
Yuxiao Dong
Jie Tang
RALM
72
14
0
23 Oct 2024
Scalable Influence and Fact Tracing for Large Language Model Pretraining
Scalable Influence and Fact Tracing for Large Language Model Pretraining
Tyler A. Chang
Dheeraj Rajagopal
Tolga Bolukbasi
Lucas Dixon
Ian Tenney
TDI
94
5
0
22 Oct 2024
Fact Recall, Heuristics or Pure Guesswork? Precise Interpretations of Language Models for Fact Completion
Fact Recall, Heuristics or Pure Guesswork? Precise Interpretations of Language Models for Fact Completion
Denitsa Saynova
Lovisa Hagström
Moa Johansson
Richard Johansson
Marco Kuhlmann
HILM
123
1
0
18 Oct 2024
RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards
RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards
Xinze Li
Sen Mei
Zhenghao Liu
Yukun Yan
Shuo Wang
...
Haotian Chen
Ge Yu
Zhiyuan Liu
Maosong Sun
Chenyan Xiong
108
12
0
17 Oct 2024
Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models
Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models
Jiatao Li
Xinyu Hu
Xunjian Yin
Xiaojun Wan
RALM
133
0
0
17 Oct 2024
Telco-DPR: A Hybrid Dataset for Evaluating Retrieval Models of 3GPP
  Technical Specifications
Telco-DPR: A Hybrid Dataset for Evaluating Retrieval Models of 3GPP Technical Specifications
Thaina Saraiva
Marco Sousa
Pedro Vieira
António Rodrigues
97
1
0
15 Oct 2024
A Multi-LLM Orchestration Engine for Personalized, Context-Rich
  Assistance
A Multi-LLM Orchestration Engine for Personalized, Context-Rich Assistance
Sumedh Rasal
61
0
0
13 Oct 2024
NoVo: Norm Voting off Hallucinations with Attention Heads in Large
  Language Models
NoVo: Norm Voting off Hallucinations with Attention Heads in Large Language Models
Zheng Yi Ho
Siyuan Liang
Sen Zhang
Yibing Zhan
Dacheng Tao
69
2
0
11 Oct 2024
PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning
PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning
Tingchen Fu
Mrinank Sharma
Philip Torr
Shay B. Cohen
David M. Krueger
Fazl Barez
AAML
129
8
0
11 Oct 2024
Large Language Models in Qualitative Research: Can We Do the Data
  Justice?
Large Language Models in Qualitative Research: Can We Do the Data Justice?
Hope Schroeder
Marianne Aubin Le Quéré
Casey Randazzo
David Mimno
Sarita Schoenebeck
41
4
0
09 Oct 2024
Deciphering the Interplay of Parametric and Non-parametric Memory in
  Retrieval-augmented Language Models
Deciphering the Interplay of Parametric and Non-parametric Memory in Retrieval-augmented Language Models
M. Farahani
Richard Johansson
RALM
100
2
0
07 Oct 2024
ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal
  Large Language Models Via Error Detection
ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection
Yibo Yan
Shen Wang
Jiahao Huo
Hang Li
Yangqiu Song
...
Kun Wang
Hui Xiong
Philip S. Yu
Xuming Hu
Qingsong Wen
LRM
108
19
0
06 Oct 2024
Upsample or Upweight? Balanced Training on Heavily Imbalanced Datasets
Upsample or Upweight? Balanced Training on Heavily Imbalanced Datasets
Tianjian Li
Haoran Xu
Weiting Tan
Kenton Murray
Daniel Khashabi
155
1
0
06 Oct 2024
Reward-RAG: Enhancing RAG with Reward Driven Supervision
Reward-RAG: Enhancing RAG with Reward Driven Supervision
Thang Nguyen
Peter Chin
Yu-Wing Tai
RALM
130
5
0
03 Oct 2024
Mitigating Memorization In Language Models
Mitigating Memorization In Language Models
Mansi Sakarvadia
Aswathy Ajith
Arham Khan
Nathaniel Hudson
Caleb Geniesse
Kyle Chard
Yaoqing Yang
Ian Foster
Michael W. Mahoney
KELMMU
130
2
0
03 Oct 2024
Quantifying Generalization Complexity for Large Language Models
Quantifying Generalization Complexity for Large Language Models
Zhenting Qi
Hongyin Luo
Xuliang Huang
Zhuokai Zhao
Yibo Jiang
Xiangjun Fan
Himabindu Lakkaraju
James Glass
LRMELM
89
7
0
02 Oct 2024
Knowledge-Driven Feature Selection and Engineering for Genotype Data with Large Language Models
Knowledge-Driven Feature Selection and Engineering for Genotype Data with Large Language Models
Joseph Lee
Shu Yang
Jae Young Baik
Xiaoxi Liu
Zhen Tan
...
Zixuan Wen
Bojian Hou
D. Duong-Tran
Tianlong Chen
Li Shen
149
2
0
02 Oct 2024
Reasoning Elicitation in Language Models via Counterfactual Feedback
Reasoning Elicitation in Language Models via Counterfactual Feedback
Alihan Hüyük
Xinnuo Xu
Jacqueline R. M. A. Maasch
Aditya V. Nori
Javier González
ReLMLRM
449
3
0
02 Oct 2024
Decoding Large-Language Models: A Systematic Overview of Socio-Technical
  Impacts, Constraints, and Emerging Questions
Decoding Large-Language Models: A Systematic Overview of Socio-Technical Impacts, Constraints, and Emerging Questions
Zeyneb N. Kaya
Souvick Ghosh
55
0
0
25 Sep 2024
Controlling Risk of Retrieval-augmented Generation: A Counterfactual
  Prompting Framework
Controlling Risk of Retrieval-augmented Generation: A Counterfactual Prompting Framework
Lu Chen
Ruqing Zhang
Jiafeng Guo
Yixing Fan
Xueqi Cheng
56
5
0
24 Sep 2024
RACOON: An LLM-based Framework for Retrieval-Augmented Column Type
  Annotation with a Knowledge Graph
RACOON: An LLM-based Framework for Retrieval-Augmented Column Type Annotation with a Knowledge Graph
Linxi Wei
Guorui Xiao
Magdalena Balazinska
86
1
0
22 Sep 2024
Synthetic continued pretraining
Synthetic continued pretraining
Zitong Yang
Neil Band
Shuangping Li
Emmanuel Candès
Tatsunori Hashimoto
CLLSyDa
100
16
0
11 Sep 2024
Retrieval Augmented Correction of Named Entity Speech Recognition Errors
Retrieval Augmented Correction of Named Entity Speech Recognition Errors
Ernest Pusateri
Anmol Walia
Anirudh Kashi
Bortik Bandyopadhyay
Nadia Hyder
Sayantan Mahinder
R. Anantha
Daben Liu
Sashank Gondala
RALM3DV
125
5
0
09 Sep 2024
Pairing Analogy-Augmented Generation with Procedural Memory for
  Procedural Q&A
Pairing Analogy-Augmented Generation with Procedural Memory for Procedural Q&A
K Roth
Rushil Gupta
Simon Halle
Bang Liu
RALM
66
0
0
02 Sep 2024
Pandora's Box or Aladdin's Lamp: A Comprehensive Analysis Revealing the Role of RAG Noise in Large Language Models
Pandora's Box or Aladdin's Lamp: A Comprehensive Analysis Revealing the Role of RAG Noise in Large Language Models
Jinyang Wu
Feihu Che
Chuyuan Zhang
Mingkuan Feng
Shuai Zhang
Pengpeng Shao
Jianhua Tao
150
6
0
24 Aug 2024
Ancient Wisdom, Modern Tools: Exploring Retrieval-Augmented LLMs for
  Ancient Indian Philosophy
Ancient Wisdom, Modern Tools: Exploring Retrieval-Augmented LLMs for Ancient Indian Philosophy
Priyanka Mandikal
RALMVLM
47
0
0
21 Aug 2024
Multilingual Needle in a Haystack: Investigating Long-Context Behavior
  of Multilingual Large Language Models
Multilingual Needle in a Haystack: Investigating Long-Context Behavior of Multilingual Large Language Models
Amey Hengle
Prasoon Bajpai
Soham Dan
Tanmoy Chakraborty
LRM
63
4
0
19 Aug 2024
Training Language Models on the Knowledge Graph: Insights on
  Hallucinations and Their Detectability
Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability
Jiri Hron
Laura J. Culp
Gamaleldin F. Elsayed
Rosanne Liu
Ben Adlam
...
T. Warkentin
Lechao Xiao
Kelvin Xu
Jasper Snoek
Simon Kornblith
58
1
0
14 Aug 2024
A Survey on Model MoErging: Recycling and Routing Among Specialized
  Experts for Collaborative Learning
A Survey on Model MoErging: Recycling and Routing Among Specialized Experts for Collaborative Learning
Prateek Yadav
Colin Raffel
Mohammed Muqeeth
Lucas Caccia
Haokun Liu
Tianlong Chen
Joey Tianyi Zhou
Leshem Choshen
Alessandro Sordoni
MoMe
118
25
0
13 Aug 2024
KnowPO: Knowledge-aware Preference Optimization for Controllable
  Knowledge Selection in Retrieval-Augmented Language Models
KnowPO: Knowledge-aware Preference Optimization for Controllable Knowledge Selection in Retrieval-Augmented Language Models
Ruizhe Zhang
Yongxin Xu
Yuzhen Xiao
Runchuan Zhu
Xinke Jiang
Xu Chu
Junfeng Zhao
Yasha Wang
80
4
0
06 Aug 2024
Unveiling Factual Recall Behaviors of Large Language Models through
  Knowledge Neurons
Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons
Yifei Wang
Yuheng Chen
Wanting Wen
Yu Sheng
Linjing Li
D. Zeng
KELM
103
9
0
06 Aug 2024
Knowledge Prompting: How Knowledge Engineers Use Large Language Models
Knowledge Prompting: How Knowledge Engineers Use Large Language Models
Elisavet Koutsiana
Johanna Walker
Michelle Nwachukwu
Albert Meroño-Peñuela
Elena Simperl
73
1
0
02 Aug 2024
Previous
123456
Next