REFINER: Reasoning Feedback on Intermediate Representations

4 April 2023

Boi Faltings

Papers citing "REFINER: Reasoning Feedback on Intermediate Representations"

4 / 54 papers shown

Title
Proximal Policy Optimization Algorithms John Schulman Filip Wolski Prafulla Dhariwal Alec Radford Oleg Klimov OffRL 448 19,006 0 20 Jul 2017
Deep reinforcement learning from human preferences Paul Christiano Jan Leike Tom B. Brown Miljan Martic Shane Legg Dario Amodei 134 3,288 0 12 Jun 2017
Program Induction by Rationale Generation : Learning to Solve and Explain Algebraic Word Problems Wang Ling Dani Yogatama Chris Dyer Phil Blunsom AIMat 76 727 0 11 May 2017
Dialog-based Language Learning Jason Weston LLMAG 50 109 0 20 Apr 2016