Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.18341
Cited By
IterAlign: Iterative Constitutional Alignment of Large Language Models
27 March 2024
Xiusi Chen
Hongzhi Wen
Sreyashi Nag
Chen Luo
Qingyu Yin
Ruirui Li
Zheng Li
Wei Wang
AILaw
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"IterAlign: Iterative Constitutional Alignment of Large Language Models"
7 / 7 papers shown
Title
Generating and Evaluating Tests for K-12 Students with Language Model Simulations: A Case Study on Sentence Reading Efficiency
E. Zelikman
Wanjing Anya Ma
Jasmine E. Tran
Diyi Yang
Jason D. Yeatman
Nick Haber
AI4Ed
52
11
0
10 Oct 2023
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
Zhiqing Sun
Songlin Yang
Qinhong Zhou
Hongxin Zhang
Zhenfang Chen
David D. Cox
Yiming Yang
Chuang Gan
SyDa
ALM
99
337
0
04 May 2023
On Second Thought, Let's Not Think Step by Step! Bias and Toxicity in Zero-Shot Reasoning
Omar Shaikh
Hongxin Zhang
William B. Held
Michael S. Bernstein
Diyi Yang
ReLM
LRM
137
200
0
15 Dec 2022
Constitutional AI: Harmlessness from AI Feedback
Yuntao Bai
Saurav Kadavath
Sandipan Kundu
Amanda Askell
John Kernion
...
Dario Amodei
Nicholas Joseph
Sam McCandlish
Tom B. Brown
Jared Kaplan
SyDa
MoMe
209
1,640
0
15 Dec 2022
Self-critiquing models for assisting human evaluators
William Saunders
Catherine Yeh
Jeff Wu
Steven Bills
Ouyang Long
Jonathan Ward
Jan Leike
ALM
ELM
107
305
0
12 Jun 2022
A General Language Assistant as a Laboratory for Alignment
Amanda Askell
Yuntao Bai
Anna Chen
Dawn Drain
Deep Ganguli
...
Tom B. Brown
Jack Clark
Sam McCandlish
C. Olah
Jared Kaplan
ALM
118
789
0
01 Dec 2021
SciBERT: A Pretrained Language Model for Scientific Text
Iz Beltagy
Kyle Lo
Arman Cohan
163
2,983
0
26 Mar 2019
1