Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.08773
Cited By
Cross-Task Generalization via Natural Language Crowdsourcing Instructions
18 April 2021
Swaroop Mishra
Daniel Khashabi
Chitta Baral
Hannaneh Hajishirzi
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Cross-Task Generalization via Natural Language Crowdsourcing Instructions"
50 / 562 papers shown
Title
Mechanistic Fine-tuning for In-context Learning
Hakaze Cho
Peng Luo
Mariko Kato
Rin Kaenbyou
Naoya Inoue
9
0
0
20 May 2025
What Prompts Don't Say: Understanding and Managing Underspecification in LLM Prompts
Chenyang Yang
Y. Shi
Qianou Ma
Michael Xieyang Liu
Christian Kastner
Tongshuang Wu
9
0
0
19 May 2025
Prompt Stability Matters: Evaluating and Optimizing Auto-Generated Prompt in General-Purpose Systems
Ke Chen
Yufei Zhou
Xitong Zhang
Haohan Wang
4
0
0
19 May 2025
Fast RoPE Attention: Combining the Polynomial Method and Fast Fourier Transform
Josh Alman
Zhao Song
15
0
0
17 May 2025
Do different prompting methods yield a common task representation in language models?
Guy Davidson
Todd M. Gureckis
Brenden M. Lake
Adina Williams
2
0
0
17 May 2025
WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models
Abdullah Mushtaq
Imran Taj
Rafay Naeem
Ibrahim Ghaznavi
Junaid Qadir
26
0
0
14 May 2025
REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback
Aniruddha Roy
Pretam Ray
Abhilash Nandy
Somak Aditya
Pawan Goyal
ALM
34
0
0
10 May 2025
Evaluating Vision Language Model Adaptations for Radiology Report Generation in Low-Resource Languages
Marco Salmè
R. Sicilia
Paolo Soda
V. Guarrasi
186
0
0
02 May 2025
Beyond the Last Answer: Your Reasoning Trace Uncovers More than You Think
Hasan Hammoud
Hani Itani
Guohao Li
ReLM
LRM
80
1
0
29 Apr 2025
Toward Generalizable Evaluation in the LLM Era: A Survey Beyond Benchmarks
Yixin Cao
Shibo Hong
Xuzhao Li
Jiahao Ying
Yubo Ma
...
Juanzi Li
Aixin Sun
Xuanjing Huang
Tat-Seng Chua
Tianwei Zhang
ALM
ELM
91
2
0
26 Apr 2025
A Post-trainer's Guide to Multilingual Training Data: Uncovering Cross-lingual Transfer Dynamics
Luísa Shimabucoro
Ahmet Üstün
Marzieh Fadaee
Sebastian Ruder
27
0
0
23 Apr 2025
Instruction-Tuning Data Synthesis from Scratch via Web Reconstruction
Yuxin Jiang
Yufei Wang
Chuhan Wu
Xinyi Dai
Yan Xu
...
Yucheng Wang
Xin Jiang
Lifeng Shang
R. Tang
Wei Wang
36
0
0
22 Apr 2025
MAIN: Mutual Alignment Is Necessary for instruction tuning
Fanyi Yang
Jianfeng Liu
Xinsong Zhang
Haoyu Liu
Xixin Cao
Yuefeng Zhan
H. Sun
Weiwei Deng
Feng Sun
Qi Zhang
ALM
27
0
0
17 Apr 2025
Detecting Instruction Fine-tuning Attack on Language Models with Influence Function
Jiawei Li
TDI
AAML
47
0
0
12 Apr 2025
LSR-MCTS: Alleviating Long Range Dependency in Code Generation
Tingwei Lu
Yangning Li
Liyuan Wang
Binghuai Lin
Jiwei Tang
...
Wanshi Xu
Hai-Tao Zheng
Yinghui Li
Xin Su
Zifei Shan
LLMAG
75
0
0
10 Apr 2025
Towards LLMs Robustness to Changes in Prompt Format Styles
Lilian Ngweta
Kiran Kate
Jason Tsay
Sadhana Kumaravel
AAML
VLM
37
0
0
09 Apr 2025
Generative Linguistics, Large Language Models, and the Social Nature of Scientific Success
Sophie Hao
ELM
AI4CE
56
0
0
25 Mar 2025
Theoretical Foundation of Flow-Based Time Series Generation: Provable Approximation, Generalization, and Efficiency
Jiangxuan Long
Zhao Song
Chiwun Yang
AI4TS
204
0
0
18 Mar 2025
Through the Magnifying Glass: Adaptive Perception Magnification for Hallucination-Free VLM Decoding
Shunqi Mao
Chaoyi Zhang
Weidong Cai
MLLM
190
0
0
13 Mar 2025
Theoretical Guarantees for High Order Trajectory Refinement in Generative Flows
Chengyue Gong
Xiaoyu Li
Yingyu Liang
Jiangxuan Long
Zhenmei Shi
Zhao Song
Yu Tian
56
3
0
12 Mar 2025
Position: Model Collapse Does Not Mean What You Think
Rylan Schaeffer
Joshua Kazdan
Alvan Caleb Arulandu
Sanmi Koyejo
73
0
0
05 Mar 2025
Effective LLM Knowledge Learning via Model Generalization
Mingkang Zhu
Xi Chen
Zihan Wang
Bei Yu
Hengshuang Zhao
Jiaya Jia
65
0
0
05 Mar 2025
LLM as GNN: Graph Vocabulary Learning for Text-Attributed Graph Foundation Models
Xi Zhu
Haochen Xue
Ziwei Zhao
Wujiang Xu
Jingyuan Huang
Minghao Guo
Qifan Wang
Kaixiong Zhou
Yongfeng Zhang
67
2
0
05 Mar 2025
Do GFlowNets Transfer? Case Study on the Game of 24/42
Adesh Gupta
Abhinav Kumar
Mansi Gupta
Paras Chopra
105
0
0
03 Mar 2025
Control Illusion: The Failure of Instruction Hierarchies in Large Language Models
Yilin Geng
Yiming Li
Honglin Mu
Xudong Han
Timothy Baldwin
Omri Abend
Eduard H. Hovy
Lea Frermann
41
2
0
21 Feb 2025
Synthesizing Post-Training Data for LLMs through Multi-Agent Simulation
Shuo Tang
Xianghe Pang
Zexi Liu
Bohan Tang
Guangyi Liu
Xiaowen Dong
Yanjie Wang
Yanfeng Wang
Tian Jin
SyDa
LLMAG
135
4
0
21 Feb 2025
Blessing of Multilinguality: A Systematic Analysis of Multilingual In-Context Learning
Yilei Tu
Andrew Xue
Freda Shi
49
0
0
17 Feb 2025
QExplorer: Large Language Model Based Query Extraction for Toxic Content Exploration
Shaola Ren
Li Ke
Longtao Huang
Dehong Gao
Hui Xue
41
0
0
06 Feb 2025
Evaluation of Large Language Models via Coupled Token Generation
N. C. Benz
Stratis Tsirtsis
Eleni Straitouri
Ivi Chatzi
Ander Artola Velasco
Suhas Thejaswi
Manuel Gomez Rodriguez
51
0
0
03 Feb 2025
Memory-Efficient Fine-Tuning of Transformers via Token Selection
Antoine Simoulin
Namyong Park
Xiaoyi Liu
Grey Yang
115
0
0
31 Jan 2025
FlanEC: Exploring Flan-T5 for Post-ASR Error Correction
Moreno La Quatra
Valerio Mario Salerno
Yu Tsao
Sabato Marco Siniscalchi
99
0
0
22 Jan 2025
Exploring Iterative Enhancement for Improving Learnersourced Multiple-Choice Question Explanations with Large Language Models
Qiming Bao
Juho Leinonen
A. Peng
Wanjun Zhong
Gaël Gendron
Tim Pistotti
Alice Huang
Paul Denny
Michael Witbrock
Jing Liu
AI4Ed
LRM
181
1
0
20 Jan 2025
Assessing and Enhancing the Robustness of Large Language Models with Task Structure Variations for Logical Reasoning
Qiming Bao
Gaël Gendron
A. Peng
Wanjun Zhong
N. Tan
Yang Chen
Michael Witbrock
Qingbin Liu
LRM
ELM
73
2
0
20 Jan 2025
LogLM: From Task-based to Instruction-based Automated Log Analysis
Yilun Liu
Yuhe Ji
Shimin Tao
Minggui He
Weibin Meng
Shenglin Zhang
Yongqian Sun
Yuming Xie
Boxing Chen
Hao Yang
47
2
0
10 Jan 2025
Using Instruction-Tuned Large Language Models to Identify Indicators of Vulnerability in Police Incident Narratives
Sam Relins
Daniel Birks
Charlie Lloyd
74
0
0
16 Dec 2024
Reinforcement Learning Enhanced LLMs: A Survey
Shuhe Wang
Shengyu Zhang
Jingyang Zhang
Runyi Hu
Xiaoya Li
Tianwei Zhang
Jiwei Li
Fei Wu
G. Wang
Eduard H. Hovy
OffRL
134
7
0
05 Dec 2024
SentiXRL: An advanced large language Model Framework for Multilingual Fine-Grained Emotion Classification in Complex Text Environment
Jie Wang
Yichen Wang
Zhilin Zhang
Jianhao Zeng
Kaidi Wang
Zhiyang Chen
72
0
0
27 Nov 2024
DELIFT: Data Efficient Language model Instruction Fine Tuning
Ishika Agarwal
Krishnateja Killamsetty
Lucian Popa
Marina Danilevksy
ALM
VLM
58
3
0
07 Nov 2024
A Bayesian Approach to Data Point Selection
Xinnuo Xu
Minyoung Kim
Royson Lee
Brais Martínez
Timothy M. Hospedales
35
0
0
06 Nov 2024
TODO: Enhancing LLM Alignment with Ternary Preferences
Yuxiang Guo
Lu Yin
Bo Jiang
Jiaqi Zhang
38
1
0
02 Nov 2024
Focus On This, Not That! Steering LLMs With Adaptive Feature Specification
Tom A. Lamb
Adam Davies
Alasdair Paren
Philip Torr
Francesco Pinto
52
0
0
30 Oct 2024
Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language Models
Zheng Zhao
Yftah Ziser
Shay B. Cohen
33
0
0
25 Oct 2024
Parameter-Efficient Fine-Tuning in Large Models: A Survey of Methodologies
Liwen Wang
Sheng Chen
Linnan Jiang
Shu Pan
Runze Cai
Sen Yang
Fei Yang
49
3
0
24 Oct 2024
MotionGlot: A Multi-Embodied Motion Generation Model
Sudarshan Harithas
Srinath Sridhar
82
1
0
22 Oct 2024
Compute-Constrained Data Selection
Junjie Oscar Yin
Alexander M. Rush
39
0
0
21 Oct 2024
Improving Instruction-Following in Language Models through Activation Steering
Alessandro Stolfo
Vidhisha Balachandran
Safoora Yousefi
Eric Horvitz
Besmira Nushi
LLMSV
64
17
0
15 Oct 2024
Bypassing the Exponential Dependency: Looped Transformers Efficiently Learn In-context by Multi-step Gradient Descent
Bo Chen
Xiaoyu Li
Yingyu Liang
Zhenmei Shi
Zhao Song
96
20
0
15 Oct 2024
WILT: A Multi-Turn, Memorization-Robust Inductive Logic Benchmark for LLMs
Eryk Banatt
Jonathan Cheng
Skanda Vaidyanath
Tiffany Hwu
LRM
36
0
0
14 Oct 2024
VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment
Lei Li
Zhihui Xie
Mukai Li
Shunian Chen
Peiyi Wang
L. Chen
Yazheng Yang
Benyou Wang
Lingpeng Kong
Qiang Liu
VLM
ALM
36
17
0
12 Oct 2024
PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning
Tingchen Fu
Mrinank Sharma
Philip Torr
Shay B. Cohen
David M. Krueger
Fazl Barez
AAML
50
7
0
11 Oct 2024
1
2
3
4
...
10
11
12
Next