Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2212.10560
Cited By
v1
v2 (latest)
Self-Instruct: Aligning Language Models with Self-Generated Instructions
20 December 2022
Yizhong Wang
Yeganeh Kordi
Swaroop Mishra
Alisa Liu
Noah A. Smith
Daniel Khashabi
Hannaneh Hajishirzi
ALM
SyDa
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (4380★)
Papers citing
"Self-Instruct: Aligning Language Models with Self-Generated Instructions"
50 / 475 papers shown
Title
Better Language Model Inversion by Compactly Representing Next-Token Distributions
Murtaza Nazir
Matthew Finlayson
John X. Morris
Xiang Ren
Swabha Swayamdipta
20
0
0
20 Jun 2025
SGIC: A Self-Guided Iterative Calibration Framework for RAG
Guanhua Chen
Yutong Yao
Lidia S. Chao
Xuebo Liu
Derek F. Wong
25
0
0
19 Jun 2025
Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights
Zhiyuan Liang
Dongwen Tang
Yuhao Zhou
Xuanlei Zhao
Mingjia Shi
...
Damian Borth
Michael M. Bronstein
Yang You
Zhangyang Wang
Kai Wang
OffRL
23
0
0
19 Jun 2025
Revisiting Compositional Generalization Capability of Large Language Models Considering Instruction Following Ability
Yusuke Sakai
Hidetaka Kamigaito
Taro Watanabe
LRM
31
0
0
18 Jun 2025
Semantically-Aware Rewards for Open-Ended R1 Training in Free-Form Generation
Zongxia Li
Yapei Chang
Yuhang Zhou
Xiyang Wu
Zichao Liang
Yoo Yeon Sung
Jordan L. Boyd-Graber
22
0
0
18 Jun 2025
Mixture of Weight-shared Heterogeneous Group Attention Experts for Dynamic Token-wise KV Optimization
Guanghui Song
Dongping Liao
Yiren Zhao
Kejiang Ye
Cheng-zhong Xu
X. Gao
MoE
19
0
0
16 Jun 2025
Enhancing Goal-oriented Proactive Dialogue Systems via Consistency Reflection and Correction
Didi Zhang
Yaxin Fan
Peifeng Li
Qiaoming Zhu
11
0
0
16 Jun 2025
CMI-Bench: A Comprehensive Benchmark for Evaluating Music Instruction Following
Yinghao Ma
Siyou Li
Juntao Yu
Emmanouil Benetos
Akira Maezawa
AuLLM
VLM
29
0
0
14 Jun 2025
Feedback Friction: LLMs Struggle to Fully Incorporate External Feedback
Dongwei Jiang
Alvin Zhang
Andrew Wang
Nicholas Andrews
Daniel Khashabi
LRM
27
0
0
13 Jun 2025
Self-Adapting Language Models
Adam Zweiger
Jyothish Pari
Han Guo
Ekin Akyürek
Yoon Kim
Pulkit Agrawal
KELM
LRM
127
0
0
12 Jun 2025
Chat-of-Thought: Collaborative Multi-Agent System for Generating Domain Specific Information
Christodoulos Constantinides
Shuxin Lin
Nianjun Zhou
Dhaval Patel
LLMAG
AI4CE
55
0
0
11 Jun 2025
Med-REFL: Medical Reasoning Enhancement via Self-Corrected Fine-grained Reflection
Zongxian Yang
Jiayu Qian
Zegao Peng
Haoyu Zhang
Z. Huang
LRM
16
0
0
11 Jun 2025
Evaluating Multimodal Large Language Models on Video Captioning via Monte Carlo Tree Search
Linhao Yu
Xinguang Ji
Yahui Liu
Fanheng Kong
Chenxi Sun
Jingyuan Zhang
Hongzhi Zhang
Victoria A. Webster-Wood
Fuzheng Zhang
Deyi Xiong
23
0
0
11 Jun 2025
TaskCraft: Automated Generation of Agentic Tasks
Dingfeng Shi
Jingyi Cao
Qianben Chen
W. Sun
W. Li
...
Jiaheng Liu
Changwang Zhang
Jun Wang
Yuchen Eleanor Jiang
Wangchunshu Zhou
57
0
0
11 Jun 2025
TableDreamer: Progressive and Weakness-guided Data Synthesis from Scratch for Table Instruction Tuning
Mingyu Zheng
Zhifan Feng
Jia Wang
Lanrui Wang
Zheng Lin
Yang Hao
Weiping Wang
LMTD
53
0
0
10 Jun 2025
EIFBENCH: Extremely Complex Instruction Following Benchmark for Large Language Models
Tao Zou
Xinghua Zhang
Haiyang Yu
Minzheng Wang
Fei Huang
Yongbin Li
25
0
0
10 Jun 2025
LLM-as-a-qualitative-judge: automating error analysis in natural language generation
Nadezhda Chirkova
Tunde Oluwaseyi Ajayi
Seth Aycock
Zain Muhammad Mujahid
Vladana Perlić
Ekaterina Borisova
Markarit Vartampetian
ELM
30
0
0
10 Jun 2025
SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner
Lei Zhang
J. Yang
Min Yang
Jian Yang
Mouxiang Chen
Jiajun Zhang
Zeyu Cui
Binyuan Hui
Junyang Lin
47
0
0
10 Jun 2025
ScIRGen: Synthesize Realistic and Large-Scale RAG Dataset for Scientific Research
Junyong Lin
Lu Dai
Ruiqian Han
Yijie Sui
Ruilin Wang
Xingliang Sun
Qinglin Wu
Min Feng
Hao Liu
Hui Xiong
22
0
0
09 Jun 2025
PROVSYN: Synthesizing Provenance Graphs for Data Augmentation in Intrusion Detection Systems
Yi Huang
Wajih UI Hassan
Yao Guo
Xiangqun Chen
Ding Li
61
0
0
06 Jun 2025
Being Strong Progressively! Enhancing Knowledge Distillation of Large Language Models through a Curriculum Learning Framework
Lingyuan Liu
Mengxiang Zhang
43
0
0
06 Jun 2025
RewardAnything: Generalizable Principle-Following Reward Models
Zhuohao Yu
Jiali Zeng
Weizheng Gu
Yidong Wang
Jindong Wang
Fandong Meng
Jie Zhou
Yue Zhang
Shikun Zhang
Wei Ye
LRM
109
1
0
04 Jun 2025
Bohdi: Heterogeneous LLM Fusion with Automatic Data Exploration
Junqi Gao
Zhichang Guo
Dazhi Zhang
Dong Li
Runze Liu
Pengfei Li
Kai Tian
Biqing Qi
26
0
0
04 Jun 2025
ConsistentChat: Building Skeleton-Guided Consistent Dialogues for Large Language Models from Scratch
Jiawei Chen
Xinyan Guan
Qianhao Yuan
Guozhao Mo
Weixiang Zhou
Yaojie Lu
Hongyu Lin
Ben He
Le Sun
Xianpei Han
ALM
LRM
81
0
0
04 Jun 2025
FlowerTune: A Cross-Domain Benchmark for Federated Fine-Tuning of Large Language Models
Yan Gao
Massimo Roberto Scamarcia
Javier Fernandez-Marques
Mohammad Naseri
Chong Shen Ng
...
Junyan Wang
Zheyuan Liu
Daniel J. Beutel
Lingjuan Lyu
Nicholas D. Lane
ALM
54
1
0
03 Jun 2025
T-SHIRT: Token-Selective Hierarchical Data Selection for Instruction Tuning
Yanjun Fu
Faisal Hamman
Sanghamitra Dutta
ALM
71
0
0
02 Jun 2025
Data Swarms: Optimizable Generation of Synthetic Evaluation Data
Shangbin Feng
Yike Wang
Weijia Shi
Yulia Tsvetkov
57
0
0
31 May 2025
When Large Multimodal Models Confront Evolving Knowledge:Challenges and Pathways
Kailin Jiang
Yuntao Du
Yukai Ding
Yuchen Ren
Ning Jiang
Zhi Gao
Zilong Zheng
Lei Liu
Bin Li
Qing Li
KELM
51
0
0
30 May 2025
Reason-SVG: Hybrid Reward RL for Aha-Moments in Vector Graphics Generation
Ximing Xing
Yandong Guan
Jing Zhang
Dong Xu
Qian Yu
LRM
75
0
0
30 May 2025
Tag-Evol: Achieving Efficient Instruction Evolving via Tag Injection
Yixuan Wang
Shiqi Zhou
Chuanzhe Guo
Qingfu Zhu
3DV
34
0
0
30 May 2025
The Hallucination Dilemma: Factuality-Aware Reinforcement Learning for Large Reasoning Models
Junyi Li
Hwee Tou Ng
OffRL
HILM
LRM
57
0
0
30 May 2025
Infinite-Instruct: Synthesizing Scaling Code instruction Data with Bidirectional Synthesis and Static Verification
Wenjing Xing
Wenke Lu
Yeheng Duan
Bing Zhao
Zhenghui kang
Yaolong Wang
Kai Gao
Lei Qiao
SyDa
56
0
0
29 May 2025
Generating Diverse Training Samples for Relation Extraction with Large Language Models
Zexuan Li
Hongliang Dai
Piji Li
SyDa
27
0
0
29 May 2025
VisualSphinx: Large-Scale Synthetic Vision Logic Puzzles for RL
Yichen Feng
Zhangchen Xu
Fengqing Jiang
Yuetai Li
Bhaskar Ramasubramanian
Luyao Niu
Bill Yuchen Lin
Radha Poovendran
ReLM
LRM
12
0
0
29 May 2025
ConsRec: Denoising Sequential Recommendation through User-Consistent Preference Modeling
Haidong Xin
Qiushi Xiong
Zhenghao Liu
Sen Mei
Yukun Yan
Shi Yu
Shuo Wang
Yu Gu
Ge Yu
Chenyan Xiong
HAI
55
0
0
28 May 2025
What Has Been Lost with Synthetic Evaluation?
Alexander Gill
Abhilasha Ravichander
Ana Marasović
ELM
31
0
0
28 May 2025
A Tool for Generating Exceptional Behavior Tests With Large Language Models
Linghan Zhong
Samuel Yuan
Jiyang Zhang
Yu Liu
Pengyu Nie
Junyi Jessy Li
Miloš Gligorić
VLM
19
0
0
28 May 2025
Knowledge Base Construction for Knowledge-Augmented Text-to-SQL
Jinheon Baek
Horst Samulowitz
Oktie Hassanzadeh
D. Subramanian
Sola S. Shirai
A. Gliozzo
D. Bhattacharjya
37
0
0
28 May 2025
ArgInstruct: Specialized Instruction Fine-Tuning for Computational Argumentation
Maja Stahl
Timon Ziegenbein
Joonsuk Park
Henning Wachsmuth
ALM
LRM
36
0
0
28 May 2025
Advancing Expert Specialization for Better MoE
Hongcan Guo
Haolang Lu
Guoshun Nan
Bolun Chu
Jialin Zhuang
Yuan Yang
Wenhao Che
Sicong Leng
Qimei Cui
Xudong Jiang
MoE
MoMe
97
0
0
28 May 2025
STEER-BENCH: A Benchmark for Evaluating the Steerability of Large Language Models
Kai Chen
Zihao He
Taiwei Shi
Kristina Lerman
ALM
LLMSV
102
0
0
27 May 2025
Can Large Reasoning Models Self-Train?
Sheikh Shafayat
Fahim Tajwar
Ruslan Salakhutdinov
J. Schneider
Andrea Zanette
ReLM
OffRL
LRM
81
2
0
27 May 2025
Distilling Closed-Source LLM's Knowledge for Locally Stable and Economic Biomedical Entity Linking
Yihao Ai
Zhiyuan Ning
Weiwei Dai
P. Wang
Yi Du
Wenjuan Cui
Kunpeng Liu
Yuanchun Zhou
46
1
0
26 May 2025
Leveraging Importance Sampling to Detach Alignment Modules from Large Language Models
Yi Liu
Dianqing Liu
Mingye Zhu
Junbo Guo
Yongdong Zhang
Zhendong Mao
102
0
0
26 May 2025
RECAST: Strengthening LLMs' Complex Instruction Following with Constraint-Verifiable Data
Wenhao Liu
Zhengkang Guo
Mingchen Xie
Jingwen Xu
Zisu Huang
...
Changze Lv
He-Da Wang
Hu Yao
Xiaoqing Zheng
Xuanjing Huang
181
0
0
25 May 2025
The Quest for Efficient Reasoning: A Data-Centric Benchmark to CoT Distillation
Ruichen Zhang
Rana Muhammad Shahroz Khan
Zhen Tan
Dawei Li
Song Wang
Tianlong Chen
LRM
53
0
0
24 May 2025
ALPS: Attention Localization and Pruning Strategy for Efficient Alignment of Large Language Models
Hao Chen
Haoze Li
Zhiqing Xiao
Lirong Gao
Qi Zhang
Xiaomeng Hu
Ningtao Wang
Xing Fu
Junbo Zhao
206
0
0
24 May 2025
TAG-INSTRUCT: Controlled Instruction Complexity Enhancement through Structure-based Augmentation
He Zhu
Zhiwen Ruan
Junyou Su
Xingwei He
Wenjia Zhang
Yun-Nung Chen
Guanhua Chen
62
0
0
24 May 2025
QwenLong-CPRS: Towards
∞
\infty
∞
-LLMs with Dynamic Context Optimization
Weizhou Shen
Chenliang Li
Fanqi Wan
Shengyi Liao
Shaopeng Lai
...
Bin Yang
Ji Zhang
Fei Huang
Jingren Zhou
Ming Yan
49
1
0
23 May 2025
Dynamic Risk Assessments for Offensive Cybersecurity Agents
Boyi Wei
Benedikt Stroebl
Jiacen Xu
Joie Zhang
Zhou Li
Peter Henderson
84
0
0
23 May 2025
1
2
3
4
...
8
9
10
Next