Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2402.01694
Cited By
ARGS: Alignment as Reward-Guided Search
23 January 2024
Maxim Khanov
Jirayu Burapacheep
Yixuan Li
Re-assign community
ArXiv (abs)
PDF
HTML
Github (40★)
Papers citing
"ARGS: Alignment as Reward-Guided Search"
50 / 113 papers shown
Title
Mitigating Memorization of Noisy Labels by Clipping the Model Prediction
Hongxin Wei
Huiping Zhuang
Renchunzi Xie
Lei Feng
Gang Niu
Bo An
Yixuan Li
VLM
NoLa
79
30
0
08 Dec 2022
Delving into Out-of-Distribution Detection with Vision-Language Representations
Yifei Ming
Ziyan Cai
Jiuxiang Gu
Yiyou Sun
W. Li
Yixuan Li
VLM
OODD
106
170
0
24 Nov 2022
Contrastive Decoding: Open-ended Text Generation as Optimization
Xiang Lisa Li
Ari Holtzman
Daniel Fried
Percy Liang
Jason Eisner
Tatsunori Hashimoto
Luke Zettlemoyer
M. Lewis
110
360
0
27 Oct 2022
Is Out-of-Distribution Detection Learnable?
Zhen Fang
Yixuan Li
Jie Lu
Jiahua Dong
Bo Han
Feng Liu
OODD
118
129
0
26 Oct 2022
Improving alignment of dialogue agents via targeted human judgements
Amelia Glaese
Nat McAleese
Maja Trkebacz
John Aslanides
Vlad Firoiu
...
John F. J. Mellor
Demis Hassabis
Koray Kavukcuoglu
Lisa Anne Hendricks
G. Irving
ALM
AAML
304
528
0
28 Sep 2022
SoLar: Sinkhorn Label Refinery for Imbalanced Partial-Label Learning
Haobo Wang
Mingxuan Xia
Yixuan Li
Yuren Mao
Lei Feng
Gang Chen
Jiaqi Zhao
92
40
0
21 Sep 2022
The Alignment Problem from a Deep Learning Perspective
Richard Ngo
Lawrence Chan
Sören Mindermann
107
192
0
30 Aug 2022
Out-of-distribution Detection via Frequency-regularized Generative Models
Mu Cai
Yixuan Li
OODD
48
32
0
18 Aug 2022
Task Agnostic and Post-hoc Unseen Distribution Detection
Radhika Dua
Seong-sil Yang
Yixuan Li
Edward Choi
OODD
49
11
0
26 Jul 2022
POEM: Out-of-Distribution Detection with Posterior Sampling
Yifei Ming
Ying Fan
Yixuan Li
OODD
78
117
0
28 Jun 2022
Emergent Abilities of Large Language Models
Jason W. Wei
Yi Tay
Rishi Bommasani
Colin Raffel
Barret Zoph
...
Tatsunori Hashimoto
Oriol Vinyals
Percy Liang
J. Dean
W. Fedus
ELM
ReLM
LRM
279
2,480
0
15 Jun 2022
Making Large Language Models Better Reasoners with Step-Aware Verifier
Yifei Li
Zeqi Lin
Shizhuo Zhang
Qiang Fu
B. Chen
Jian-Guang Lou
Weizhu Chen
ReLM
LRM
84
223
0
06 Jun 2022
Offline RL for Natural Language Generation with Implicit Language Q Learning
Charles Burton Snell
Ilya Kostrikov
Yi Su
Mengjiao Yang
Sergey Levine
OffRL
197
110
0
05 Jun 2022
NaturalProver: Grounded Mathematical Proof Generation with Language Models
Sean Welleck
Jiacheng Liu
Ximing Lu
Hannaneh Hajishirzi
Yejin Choi
AIMat
LRM
73
73
0
25 May 2022
RankGen: Improving Text Generation with Large Ranking Models
Kalpesh Krishna
Yapei Chang
John Wieting
Mohit Iyyer
AIMat
66
69
0
19 May 2022
Mitigating Neural Network Overconfidence with Logit Normalization
Hongxin Wei
Renchunzi Xie
Hao-Ran Cheng
Lei Feng
Bo An
Yixuan Li
OODD
220
285
0
19 May 2022
OPT: Open Pre-trained Transformer Language Models
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
...
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
VLM
OSLM
AI4CE
336
3,667
0
02 May 2022
Out-of-Distribution Detection with Deep Nearest Neighbors
Yiyou Sun
Yifei Ming
Xiaojin Zhu
Yixuan Li
OODD
200
520
0
13 Apr 2022
Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback
Yuntao Bai
Andy Jones
Kamal Ndousse
Amanda Askell
Anna Chen
...
Jack Clark
Sam McCandlish
C. Olah
Benjamin Mann
Jared Kaplan
254
2,561
0
12 Apr 2022
Are Vision Transformers Robust to Spurious Correlations?
Soumya Suvra Ghosal
Yifei Ming
Yixuan Li
ViT
61
34
0
17 Mar 2022
How to Exploit Hyperspherical Embeddings for Out-of-Distribution Detection?
Yifei Ming
Yiyou Sun
Ousmane Amadou Dia
Yixuan Li
OODD
97
103
0
08 Mar 2022
Unknown-Aware Object Detection: Learning What You Don't Know from Videos in the Wild
Xuefeng Du
Xin Eric Wang
Gabriel Gozum
Yixuan Li
OODD
98
92
0
08 Mar 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
880
12,973
0
04 Mar 2022
Training OOD Detectors in their Natural Habitats
Julian Katz-Samuels
Julia B. Nakhleh
Robert D. Nowak
Yixuan Li
OODD
47
92
0
07 Feb 2022
PiCO+: Contrastive Label Disambiguation for Robust Partial Label Learning
Haobo Wang
Rui Xiao
Yixuan Li
Lei Feng
Gang Niu
Gang Chen
Jiaqi Zhao
VLM
91
31
0
22 Jan 2022
WebGPT: Browser-assisted question-answering with human feedback
Reiichiro Nakano
Jacob Hilton
S. Balaji
Jeff Wu
Ouyang Long
...
Gretchen Krueger
Kevin Button
Matthew Knight
B. Chess
John Schulman
ALM
RALM
187
1,275
0
17 Dec 2021
Ethical and social risks of harm from Language Models
Laura Weidinger
John F. J. Mellor
Maribeth Rauh
Conor Griffin
J. Uesato
...
Lisa Anne Hendricks
William S. Isaac
Sean Legassick
G. Irving
Iason Gabriel
PILM
114
1,041
0
08 Dec 2021
A General Language Assistant as a Laboratory for Alignment
Amanda Askell
Yuntao Bai
Anna Chen
Dawn Drain
Deep Ganguli
...
Tom B. Brown
Jack Clark
Sam McCandlish
C. Olah
Jared Kaplan
ALM
118
779
0
01 Dec 2021
Provable Guarantees for Understanding Out-of-distribution Detection
Peyman Morteza
Yixuan Li
OODD
102
92
0
01 Dec 2021
ReAct: Out-of-distribution Detection With Rectified Activations
Yiyou Sun
Chuan Guo
Yixuan Li
OODD
113
478
0
24 Nov 2021
A Unified Survey on Anomaly, Novelty, Open-Set, and Out-of-Distribution Detection: Solutions and Future Challenges
Mohammadreza Salehi
Hossein Mirzaei
Dan Hendrycks
Yixuan Li
M. Rohban
Mohammad Sabokrou
OOD
87
198
0
26 Oct 2021
Generalized Out-of-Distribution Detection: A Survey
Jingkang Yang
Kaiyang Zhou
Yixuan Li
Ziwei Liu
284
927
0
21 Oct 2021
On the Importance of Gradients for Detecting Distributional Shifts in the Wild
Rui Huang
Andrew Geng
Yixuan Li
276
350
0
01 Oct 2021
Can multi-label classification networks know what they don't know?
Haoran Wang
Weitang Liu
Alex E. Bocchieri
Yixuan Li
OODD
136
127
0
29 Sep 2021
PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided MCTS Decoding
Antoine Chaffin
Vincent Claveau
Ewa Kijak
53
38
0
28 Sep 2021
On the Impact of Spurious Correlation for Out-of-distribution Detection
Yifei Ming
Hang Yin
Yixuan Li
OODD
182
75
0
12 Sep 2021
Noise-robust Graph Learning by Estimating and Leveraging Pairwise Interactions
Xuefeng Du
Tian Bian
Yu Rong
Bo Han
Tongliang Liu
Tingyang Xu
Wenbing Huang
Yixuan Li
Junzhou Huang
NoLa
82
13
0
14 Jun 2021
PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training
Kimin Lee
Laura M. Smith
Pieter Abbeel
OffRL
63
287
0
09 Jun 2021
DExperts: Decoding-Time Controlled Text Generation with Experts and Anti-Experts
Alisa Liu
Maarten Sap
Ximing Lu
Swabha Swayamdipta
Chandra Bhagavatula
Noah A. Smith
Yejin Choi
MU
107
374
0
07 May 2021
MOS: Towards Scaling Out-of-distribution Detection for Large Semantic Space
Rui Huang
Yixuan Li
OODD
82
244
0
05 May 2021
FUDGE: Controlled Text Generation With Future Discriminators
Kevin Kaichuang Yang
Dan Klein
103
333
0
12 Apr 2021
NeuroLogic Decoding: (Un)supervised Neural Text Generation with Predicate Logic Constraints
Ximing Lu
Peter West
Rowan Zellers
Ronan Le Bras
Chandra Bhagavatula
Yejin Choi
NAI
83
147
0
24 Oct 2020
Energy-based Out-of-distribution Detection
Weitang Liu
Xiaoyun Wang
John Douglas Owens
Yixuan Li
OODD
271
1,356
0
08 Oct 2020
RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models
Samuel Gehman
Suchin Gururangan
Maarten Sap
Yejin Choi
Noah A. Smith
158
1,209
0
24 Sep 2020
GeDi: Generative Discriminator Guided Sequence Generation
Ben Krause
Akhilesh Deepak Gotmare
Bryan McCann
N. Keskar
Shafiq Joty
R. Socher
Nazneen Rajani
128
407
0
14 Sep 2020
Learning to summarize from human feedback
Nisan Stiennon
Long Ouyang
Jeff Wu
Daniel M. Ziegler
Ryan J. Lowe
Chelsea Voss
Alec Radford
Dario Amodei
Paul Christiano
ALM
240
2,147
0
02 Sep 2020
Model Patching: Closing the Subgroup Performance Gap with Data Augmentation
Karan Goel
Albert Gu
Yixuan Li
Christopher Ré
84
121
0
15 Aug 2020
Robust Out-of-distribution Detection for Neural Networks
Jiefeng Chen
Yixuan Li
Xi Wu
Yingyu Liang
S. Jha
OODD
192
87
0
21 Mar 2020
Plug and Play Language Models: A Simple Approach to Controlled Text Generation
Sumanth Dathathri
Andrea Madotto
Janice Lan
Jane Hung
Eric Frank
Piero Molino
J. Yosinski
Rosanne Liu
KELM
141
976
0
04 Dec 2019
Comparison of Diverse Decoding Methods from Conditional Language Models
Daphne Ippolito
Reno Kriz
M. Kustikova
João Sedoc
Chris Callison-Burch
AI4CE
71
114
0
14 Jun 2019
Previous
1
2
3
Next