Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2104.08773
Cited By
Cross-Task Generalization via Natural Language Crowdsourcing Instructions
18 April 2021
Swaroop Mishra
Daniel Khashabi
Chitta Baral
Hannaneh Hajishirzi
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Cross-Task Generalization via Natural Language Crowdsourcing Instructions"
50 / 562 papers shown
Title
Simultaneous Reward Distillation and Preference Learning: Get You a Language Model Who Can Do Both
Abhijnan Nath
Changsoo Jung
Ethan Seefried
Nikhil Krishnaswamy
176
1
0
11 Oct 2024
Controllable Safety Alignment: Inference-Time Adaptation to Diverse Safety Requirements
Jingyu Zhang
Ahmed Elgohary
Ahmed Magooda
Daniel Khashabi
Benjamin Van Durme
174
2
0
11 Oct 2024
Packing Analysis: Packing Is More Appropriate for Large Models or Datasets in Supervised Fine-tuning
Shuhe Wang
Guoyin Wang
Yucheng Wang
Jiwei Li
Eduard H. Hovy
Chen Guo
37
4
0
10 Oct 2024
StablePrompt: Automatic Prompt Tuning using Reinforcement Learning for Large Language Models
Minchan Kwon
Gaeun Kim
Jongsuk Kim
Haeil Lee
Junmo Kim
OffRL
LRM
LLMAG
26
2
0
10 Oct 2024
Uncovering Factor Level Preferences to Improve Human-Model Alignment
Juhyun Oh
Eunsu Kim
Jiseon Kim
Wenda Xu
Inha Cha
William Yang Wang
Alice Oh
34
0
0
09 Oct 2024
Weak-eval-Strong: Evaluating and Eliciting Lateral Thinking of LLMs with Situation Puzzles
Qi Chen
Bowen Zhang
Gang Wang
Qi Wu
ReLM
LRM
42
3
0
09 Oct 2024
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints
Thomas Palmeira Ferraz
Kartik Mehta
Yu-Hsiang Lin
Haw-Shiuan Chang
Shereen Oraby
Sijia Liu
Vivek Subramanian
Tagyoung Chung
Mohit Bansal
Nanyun Peng
56
8
0
09 Oct 2024
On Instruction-Finetuning Neural Machine Translation Models
Vikas Raunak
Roman Grundkiewicz
Marcin Junczys-Dowmunt
33
1
0
07 Oct 2024
Cookbook: A framework for improving LLM generative abilities via programmatic data generating templates
A. Narayan
Mayee F. Chen
Kush S. Bhatia
Christopher Ré
SyDa
41
3
0
07 Oct 2024
Compression via Pre-trained Transformers: A Study on Byte-Level Multimodal Data
David Heurtel-Depeiges
Anian Ruoss
Joel Veness
Tim Genewein
43
1
0
07 Oct 2024
TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation
Jonathan Cook
Tim Rocktaschel
Jakob Foerster
Dennis Aumiller
Alex Wang
ALM
37
10
0
04 Oct 2024
ProcBench: Benchmark for Multi-Step Reasoning and Following Procedure
Ippei Fujisawa
Sensho Nobe
Hiroki Seto
Rina Onda
Yoshiaki Uchida
Hiroki Ikoma
Pei-Chun Chien
Ryota Kanai
LRM
44
3
0
04 Oct 2024
Mitigating Backdoor Threats to Large Language Models: Advancement and Challenges
Qin Liu
Wenjie Mo
Terry Tong
Lyne Tchapmi
Fei Wang
Chaowei Xiao
Muhao Chen
AAML
39
4
0
30 Sep 2024
Show and Guide: Instructional-Plan Grounded Vision and Language Model
Diogo Glória-Silva
David Semedo
João Magalhães
26
0
0
27 Sep 2024
From Linguistic Giants to Sensory Maestros: A Survey on Cross-Modal Reasoning with Large Language Models
Shengsheng Qian
Zuyi Zhou
Dizhan Xue
Bing Wang
Changsheng Xu
LRM
39
1
0
19 Sep 2024
Multi-Document Grounded Multi-Turn Synthetic Dialog Generation
Young-Suk Lee
Chulaka Gunasekara
Danish Contractor
Ramón Fernandez Astudillo
Radu Florian
32
1
0
17 Sep 2024
LLM-as-a-Judge & Reward Model: What They Can and Cannot Do
Guijin Son
Hyunwoo Ko
Hoyoung Lee
Yewon Kim
Seunghyeok Hong
ALM
ELM
54
7
0
17 Sep 2024
RNR: Teaching Large Language Models to Follow Roles and Rules
Kuan-Chieh Jackson Wang
Alexander Bukharin
Haoming Jiang
Qingyu Yin
Zhengyang Wang
...
Chao Zhang
Bing Yin
Xian Li
Jianshu Chen
Shiyang Li
ALM
26
1
0
10 Sep 2024
Selective Self-Rehearsal: A Fine-Tuning Approach to Improve Generalization in Large Language Models
Sonam Gupta
Yatin Nandwani
Asaf Yehudai
Mayank Mishra
Gaurav Pandey
Dinesh Raghu
Sachindra Joshi
LRM
25
1
0
07 Sep 2024
End User Authoring of Personalized Content Classifiers: Comparing Example Labeling, Rule Writing, and LLM Prompting
Leijie Wang
Kathryn Yurechko
Pranati Dani
Quan Ze Chen
Amy X. Zhang
50
3
0
05 Sep 2024
Quality or Quantity? On Data Scale and Diversity in Adapting Large Language Models for Low-Resource Translation
Vivek Iyer
Bhavitvya Malik
Pavel Stepachev
Pinzhen Chen
Barry Haddow
Alexandra Birch
ALM
34
3
0
23 Aug 2024
EvalYaks: Instruction Tuning Datasets and LoRA Fine-tuned Models for Automated Scoring of CEFR B2 Speaking Assessment Transcripts
Nicy Scaria
Silvester John Joseph Kennedy
Thomas Latinovich
Deepak N. Subramani
29
0
0
22 Aug 2024
REInstruct: Building Instruction Data from Unlabeled Corpus
Shu Chen
Xinyan Guan
Yaojie Lu
Hongyu Lin
Xianpei Han
Le Sun
ALM
SyDa
27
2
0
20 Aug 2024
CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs
Yassine Ouali
Adrian Bulat
Brais Martínez
Georgios Tzimiropoulos
VLM
MLLM
40
18
0
19 Aug 2024
Creating Arabic LLM Prompts at Scale
Abdelrahman El-Sheikh
Ahmed Elmogtaba
Kareem Darwish
Muhammad Morsy Elmallah
Ashraf Elneima
Hassan Sawaf
LRM
19
0
0
12 Aug 2024
P3: A Policy-Driven, Pace-Adaptive, and Diversity-Promoted Framework for Optimizing LLM Training
Yingxuan Yang
Huayi Wang
Muning Wen
Weinan Zhang
49
0
0
10 Aug 2024
Better Alignment with Instruction Back-and-Forth Translation
Thao Nguyen
Jeffrey Li
Sewoong Oh
Ludwig Schmidt
Jason Weston
Luke Zettlemoyer
Xian Li
SyDa
38
6
0
08 Aug 2024
Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models
Zhi Rui Tam
Cheng-Kuang Wu
Yi-Lin Tsai
Chieh-Yen Lin
Hung-yi Lee
Yun-Nung Chen
32
24
0
05 Aug 2024
FANNO: Augmenting High-Quality Instruction Data with Open-Sourced LLMs Only
He Zhu
Junyou Su
Tianle Lun
Yicheng Tao
Wenjia Zhang
Zipei Fan
Guanhua Chen
ALM
37
2
0
02 Aug 2024
PERSOMA: PERsonalized SOft ProMpt Adapter Architecture for Personalized Language Prompting
Liam Hebert
Krishna Sayana
Ambarish Jash
Alexandros Karatzoglou
Geordie Williamson
Sumanth Doddapaneni
Yanli Cai
Dima Kuzmin
36
3
0
02 Aug 2024
CFBench: A Comprehensive Constraints-Following Benchmark for LLMs
Leo Micklem
Yan-Bin Shen
Wenjing Luo
Yan Zhang
Hao Liang
...
Weipeng Chen
Bin Cui
Blair Thornton
Wentao Zhang
Zenan Zhou
ELM
84
16
0
02 Aug 2024
Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs
Shiping Liu
Kecheng Zheng
Wei Chen
MLLM
52
34
0
31 Jul 2024
Dancing in Chains: Reconciling Instruction Following and Faithfulness in Language Models
Zhengxuan Wu
Yuhao Zhang
Linquan Wei
Yumo Xu
Rujun Han
Yi Liu
Jifan Chen
Bonan Min
Zhiheng Huang
33
0
0
31 Jul 2024
CollectiveSFT: Scaling Large Language Models for Chinese Medical Benchmark with Collective Instructions in Healthcare
Jingwei Zhu
Minghuan Tan
Min Yang
Ruixue Li
Hamid Alinejad-Rokny
ALM
LM&MA
38
0
0
29 Jul 2024
The power of Prompts: Evaluating and Mitigating Gender Bias in MT with LLMs
Aleix Sant
Carlos Escolano
Audrey Mash
Francesca de Luca Fornaciari
Maite Melero
36
4
0
26 Jul 2024
Do Large Language Models Have Compositional Ability? An Investigation into Limitations and Scalability
Zhuoyan Xu
Zhenmei Shi
Yingyu Liang
CoGe
LRM
40
27
0
22 Jul 2024
Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL
Yunseon Choi
Sangmin Bae
Seonghyun Ban
Minchan Jeong
Chuheng Zhang
Lei Song
Li Zhao
Jiang Bian
Kee-Eung Kim
VLM
AAML
36
3
0
20 Jul 2024
Learning-From-Mistakes Prompting for Indigenous Language Translation
You-Cheng Liao
Chen-Jui Yu
Chi-Yi Lin
He-Feng Yun
Yen-Hsiang Wang
Hsiao-Min Li
Yao-Chung Fan
44
1
0
18 Jul 2024
Retrieval-Enhanced Machine Learning: Synthesis and Opportunities
To Eun Kim
Alireza Salemi
Andrew Drozdov
Fernando Diaz
Hamed Zamani
56
7
0
17 Jul 2024
Analyzing the Generalization and Reliability of Steering Vectors
Daniel Tan
David Chanin
Aengus Lynch
Dimitrios Kanoulas
Brooks Paige
Adrià Garriga-Alonso
Robert Kirk
LLMSV
84
17
0
17 Jul 2024
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning
Chenyang Zhao
Xueying Jia
Vijay Viswanathan
Tongshuang Wu
Graham Neubig
SyDa
ALM
53
25
0
16 Jul 2024
Mix-CPT: A Domain Adaptation Framework via Decoupling Knowledge Learning and Format Alignment
Jinhao Jiang
Junyi Li
Wayne Xin Zhao
Yang Song
Tao Zhang
Ji-Rong Wen
CLL
41
3
0
15 Jul 2024
sPhinX: Sample Efficient Multilingual Instruction Fine-Tuning Through N-shot Guided Prompting
Sanchit Ahuja
Kumar Tanmay
Hardik Hansrajbhai Chauhan
Barun Patra
Kriti Aggarwal
...
Tejas I. Dhamecha
Ahmed Awadallah
Monojit Choudhary
Vishrav Chaudhary
Sunayana Sitaram
32
3
0
13 Jul 2024
Language-Augmented Symbolic Planner for Open-World Task Planning
Guanqi Chen
Lei Yang
Ruixing Jia
Zhe Hu
Yizhou Chen
Wei Zhang
Wenping Wang
Jia Pan
LM&Ro
LLMAG
48
8
0
13 Jul 2024
Beyond Instruction Following: Evaluating Inferential Rule Following of Large Language Models
Wangtao Sun
Chenxiang Zhang
Xueyou Zhang
Ziyang Huang
Haotian Xu
Pei Chen
Shizhu He
Jun Zhao
Kang Liu
ELM
LRM
38
5
0
11 Jul 2024
LIONs: An Empirically Optimized Approach to Align Language Models
Xiao Yu
Qingyang Wu
Yu Li
Zhou Yu
ALM
40
3
0
09 Jul 2024
AI Safety in Generative AI Large Language Models: A Survey
Jaymari Chua
Yun Yvonna Li
Shiyi Yang
Chen Wang
Lina Yao
LM&MA
42
12
0
06 Jul 2024
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the Wild
Ahmed Masry
Megh Thakkar
Aayush Bajaj
Aaryaman Kartha
Enamul Hoque
Chenyu You
VLM
44
26
0
04 Jul 2024
MAPO: Boosting Large Language Model Performance with Model-Adaptive Prompt Optimization
Yuyan Chen
Zhihao Wen
Ge Fan
Zhengyu Chen
Wei Wu
Dayiheng Liu
Zhixu Li
Bang Liu
Yanghua Xiao
41
18
0
04 Jul 2024
GemmAr: Enhancing LLMs Through Arabic Instruction-Tuning
Hasna Chouikhi
Manel Aloui
Cyrine Ben Hammou
Ghaith Chaabane
Haithem Kchaou
Chehir Dhaouadi
44
0
0
02 Jul 2024
Previous
1
2
3
4
5
...
10
11
12
Next