Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.17744
Cited By
Following Length Constraints in Instructions
25 June 2024
Weizhe Yuan
Ilia Kulikov
Ping Yu
Kyunghyun Cho
Sainbayar Sukhbaatar
Jason Weston
Jing Xu
FaML
ALM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Following Length Constraints in Instructions"
15 / 15 papers shown
Title
CIE: Controlling Language Model Text Generations Using Continuous Signals
Vinay Samuel
Harshita Diddee
Yiming Zhang
Daphne Ippolito
14
0
0
19 May 2025
Making Small Language Models Efficient Reasoners: Intervention, Supervision, Reinforcement
Xuechen Zhang
Zijian Huang
Chenshun Ni
Ziyang Xiong
Jiacheng Chen
Samet Oymak
ReLM
LRM
45
0
0
12 May 2025
Scalable Chain of Thoughts via Elastic Reasoning
Yuhui Xu
Hanze Dong
Lei Wang
Doyen Sahoo
Junnan Li
Caiming Xiong
OffRL
LRM
59
3
0
08 May 2025
Reasoning Beyond Limits: Advances and Open Problems for LLMs
M. Ferrag
Norbert Tihanyi
Merouane Debbah
ELM
OffRL
LRM
AI4CE
214
3
0
26 Mar 2025
AMPO: Active Multi-Preference Optimization
Taneesh Gupta
Rahul Madhavan
Xuchao Zhang
Chetan Bansal
Saravan Rajmohan
55
0
0
25 Feb 2025
Order Matters: Investigate the Position Bias in Multi-constraint Instruction Following
Jie Zeng
Qianyu He
Qingyu Ren
Jiaqing Liang
Yanghua Xiao
Weikang Zhou
Zeye Sun
Fei Yu
86
1
0
24 Feb 2025
Zero-Shot Strategies for Length-Controllable Summarization
Fabian Retkowski
A. Waibel
57
3
0
31 Dec 2024
Prompting and Fine-Tuning of Small LLMs for Length-Controllable Telephone Call Summarization
David Thulke
Yingbo Gao
Rricha Jalota
Christian Dugast
Hermann Ney
29
3
0
24 Oct 2024
Thinking LLMs: General Instruction Following with Thought Generation
Tianhao Wu
Janice Lan
Weizhe Yuan
Jiantao Jiao
Jason Weston
Sainbayar Sukhbaatar
LRM
30
16
0
14 Oct 2024
Self-controller: Controlling LLMs with Multi-round Step-by-step Self-awareness
Xiao Peng
Xufan Geng
LLMAG
29
0
0
01 Oct 2024
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Yushi Bai
Jiajie Zhang
Xin Lv
Linzhi Zheng
Siqi Zhu
Lei Hou
Yuxiao Dong
Jie Tang
Juanzi Li
VGen
LLMAG
ALM
42
41
0
13 Aug 2024
Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge
Tianhao Wu
Weizhe Yuan
O. Yu. Golovneva
Jing Xu
Yuandong Tian
Jiantao Jiao
Jason Weston
Sainbayar Sukhbaatar
ALM
KELM
LRM
64
74
0
28 Jul 2024
ODIN: Disentangled Reward Mitigates Hacking in RLHF
Lichang Chen
Chen Zhu
Davit Soselia
Jiuhai Chen
Dinesh Manocha
Tom Goldstein
Heng-Chiao Huang
M. Shoeybi
Bryan Catanzaro
AAML
50
54
0
11 Feb 2024
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning
Hao Zhao
Maksym Andriushchenko
Francesco Croce
Nicolas Flammarion
ALM
97
44
0
07 Feb 2024
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
384
12,081
0
04 Mar 2022
1