ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2402.01694
  4. Cited By
ARGS: Alignment as Reward-Guided Search

ARGS: Alignment as Reward-Guided Search

23 January 2024
Maxim Khanov
Jirayu Burapacheep
Yixuan Li
ArXiv (abs)PDFHTMLGithub (40★)

Papers citing "ARGS: Alignment as Reward-Guided Search"

13 / 113 papers shown
Title
The Curious Case of Neural Text Degeneration
The Curious Case of Neural Text Degeneration
Ari Holtzman
Jan Buys
Li Du
Maxwell Forbes
Yejin Choi
187
3,184
0
22 Apr 2019
Defense Against Adversarial Images using Web-Scale Nearest-Neighbor
  Search
Defense Against Adversarial Images using Web-Scale Nearest-Neighbor Search
Abhimanyu Dubey
Laurens van der Maaten
Zeki Yalniz
Yixuan Li
D. Mahajan
AAML
102
66
0
05 Mar 2019
Hierarchical Neural Story Generation
Hierarchical Neural Story Generation
Angela Fan
M. Lewis
Yann N. Dauphin
DiffM
181
1,623
0
13 May 2018
Exploring the Limits of Weakly Supervised Pretraining
Exploring the Limits of Weakly Supervised Pretraining
D. Mahajan
Ross B. Girshick
Vignesh Ramanathan
Kaiming He
Manohar Paluri
Yixuan Li
Ashwin R. Bharambe
Laurens van der Maaten
VLM
188
1,368
0
02 May 2018
Understanding the Loss Surface of Neural Networks for Binary
  Classification
Understanding the Loss Surface of Neural Networks for Binary Classification
Shiyu Liang
Ruoyu Sun
Yixuan Li
R. Srikant
85
88
0
19 Feb 2018
Deep Reinforcement Learning that Matters
Deep Reinforcement Learning that Matters
Peter Henderson
Riashat Islam
Philip Bachman
Joelle Pineau
Doina Precup
David Meger
OffRL
118
1,954
0
19 Sep 2017
Attention Is All You Need
Attention Is All You Need
Ashish Vaswani
Noam M. Shazeer
Niki Parmar
Jakob Uszkoreit
Llion Jones
Aidan Gomez
Lukasz Kaiser
Illia Polosukhin
3DV
713
131,652
0
12 Jun 2017
Deep reinforcement learning from human preferences
Deep reinforcement learning from human preferences
Paul Christiano
Jan Leike
Tom B. Brown
Miljan Martic
Shane Legg
Dario Amodei
190
3,318
0
12 Jun 2017
Enhancing The Reliability of Out-of-distribution Image Detection in
  Neural Networks
Enhancing The Reliability of Out-of-distribution Image Detection in Neural Networks
Shiyu Liang
Yixuan Li
R. Srikant
UQCVOODD
168
2,072
0
08 Jun 2017
Snapshot Ensembles: Train 1, get M for free
Snapshot Ensembles: Train 1, get M for free
Gao Huang
Yixuan Li
Geoff Pleiss
Zhuang Liu
John E. Hopcroft
Kilian Q. Weinberger
OODFedMLUQCV
134
951
0
01 Apr 2017
Self-critical Sequence Training for Image Captioning
Self-critical Sequence Training for Image Captioning
Steven J. Rennie
E. Marcheret
Youssef Mroueh
Jerret Ross
Vaibhava Goel
107
1,887
0
02 Dec 2016
Convergent Learning: Do different neural networks learn the same
  representations?
Convergent Learning: Do different neural networks learn the same representations?
Yixuan Li
J. Yosinski
Jeff Clune
Hod Lipson
John E. Hopcroft
SSL
86
370
0
24 Nov 2015
Deep Manifold Traversal: Changing Labels with Convolutional Features
Deep Manifold Traversal: Changing Labels with Convolutional Features
Jacob R. Gardner
P. Upchurch
Matt J. Kusner
Yixuan Li
Kilian Q. Weinberger
Kavita Bala
John E. Hopcroft
62
67
0
19 Nov 2015
Previous
123