Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2306.05426
Cited By
SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking
8 June 2023
Chris Cundy
Stefano Ermon
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SequenceMatch: Imitation Learning for Autoregressive Sequence Modelling with Backtracking"
7 / 7 papers shown
Title
From Demonstrations to Rewards: Alignment Without Explicit Human Preferences
Siliang Zeng
Yao Liu
Huzefa Rangwala
George Karypis
Mingyi Hong
Rasool Fakoor
57
2
0
15 Mar 2025
All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning
Gokul Swamy
Sanjiban Choudhury
Wen Sun
Zhiwei Steven Wu
J. Andrew Bagnell
OffRL
55
8
0
03 Mar 2025
Backtracking Improves Generation Safety
Yiming Zhang
Jianfeng Chi
Hailey Nguyen
Kartikeya Upasani
Daniel M. Bikel
Jason Weston
Eric Michael Smith
SILM
54
7
0
22 Sep 2024
Imitating Language via Scalable Inverse Reinforcement Learning
Markus Wulfmeier
Michael Bloesch
Nino Vieillard
Arun Ahuja
Jorg Bornschein
...
Jost Tobias Springenberg
Nikola Momchev
Olivier Bachem
Matthieu Geist
Martin Riedmiller
47
9
0
02 Sep 2024
Hybrid Inverse Reinforcement Learning
Juntao Ren
Gokul Swamy
Zhiwei Steven Wu
J. Andrew Bagnell
Sanjiban Choudhury
41
18
0
13 Feb 2024
Mapping the Challenges of HCI: An Application and Evaluation of ChatGPT and GPT-4 for Mining Insights at Scale
Jonas Oppenlaender
Joonas Hamalainen
41
6
0
08 Jun 2023
A Theoretical Analysis of the Repetition Problem in Text Generation
Z. Fu
Wai Lam
Anthony Man-Cho So
Bei Shi
82
90
0
29 Dec 2020
1