ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2211.11890
  4. Cited By
TEMPERA: Test-Time Prompting via Reinforcement Learning

TEMPERA: Test-Time Prompting via Reinforcement Learning

21 November 2022
Tianjun Zhang
Xuezhi Wang
Denny Zhou
Dale Schuurmans
Joseph E. Gonzalez
    VLM
ArXivPDFHTML

Papers citing "TEMPERA: Test-Time Prompting via Reinforcement Learning"

39 / 39 papers shown
Title
TAPO: Task-Referenced Adaptation for Prompt Optimization
TAPO: Task-Referenced Adaptation for Prompt Optimization
Wenxin Luo
Luu Anh Tuan
Xiaopeng Li
Weibo Zhou
Pengyue Jia
Xiangyu Zhao
47
0
0
12 Jan 2025
TP-Eval: Tap Multimodal LLMs' Potential in Evaluation by Customizing
  Prompts
TP-Eval: Tap Multimodal LLMs' Potential in Evaluation by Customizing Prompts
Yuxuan Xie
Tianhua Li
Wenqi Shao
Kaipeng Zhang
28
0
0
23 Oct 2024
Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey
  on How to Make your LLMs use External Data More Wisely
Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely
Siyun Zhao
Yuqing Yang
Zilong Wang
Zhiyuan He
Luna Qiu
Lili Qiu
SyDa
RALM
3DV
44
35
0
23 Sep 2024
Large Language Models are Interpretable Learners
Large Language Models are Interpretable Learners
Ruochen Wang
Si Si
Felix X. Yu
Dorothea Wiesmann
Cho-Jui Hsieh
Inderjit Dhillon
24
3
0
25 Jun 2024
Concentrate Attention: Towards Domain-Generalizable Prompt Optimization
  for Language Models
Concentrate Attention: Towards Domain-Generalizable Prompt Optimization for Language Models
Chengzhengxu Li
Xiaoming Liu
Zhaohan Zhang
Yichen Wang
Chen Liu
Y. Lan
Chao Shen
60
2
0
15 Jun 2024
Task Facet Learning: A Structured Approach to Prompt Optimization
Task Facet Learning: A Structured Approach to Prompt Optimization
Gurusha Juneja
Nagarajan Natarajan
Hua Li
Hua Li
Jian Jiao
Amit Sharma
53
7
0
15 Jun 2024
CodeCloak: A Method for Evaluating and Mitigating Code Leakage by LLM
  Code Assistants
CodeCloak: A Method for Evaluating and Mitigating Code Leakage by LLM Code Assistants
Amit Finkman
Eden Bar-Kochva
Avishag Shapira
D. Mimran
Yuval Elovici
A. Shabtai
ELM
38
3
0
13 Apr 2024
Efficient Prompting Methods for Large Language Models: A Survey
Efficient Prompting Methods for Large Language Models: A Survey
Kaiyan Chang
Songcheng Xu
Chenglong Wang
Yingfeng Luo
Tong Xiao
Jingbo Zhu
LRM
45
32
0
01 Apr 2024
Intent-based Prompt Calibration: Enhancing prompt optimization with
  synthetic boundary cases
Intent-based Prompt Calibration: Enhancing prompt optimization with synthetic boundary cases
Elad Levi
Eli Brosh
Matan Friedmann
24
8
0
05 Feb 2024
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement
  Learning and Large Language Models
The RL/LLM Taxonomy Tree: Reviewing Synergies Between Reinforcement Learning and Large Language Models
M. Pternea
Prerna Singh
Abir Chakraborty
Y. Oruganti
M. Milletarí
Sayli Bapat
Kebei Jiang
OffRL
24
7
0
02 Feb 2024
A Bayesian approach for prompt optimization in pre-trained language
  models
A Bayesian approach for prompt optimization in pre-trained language models
Antonio Sabbatella
Andrea Ponti
Antonio Candelieri
I. Giordani
Francesco Archetti
34
1
0
01 Dec 2023
PromptAgent: Strategic Planning with Language Models Enables
  Expert-level Prompt Optimization
PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization
Xinyuan Wang
Chenxi Li
Zhen Wang
Fan Bai
Haotian Luo
Jiayou Zhang
Nebojsa Jojic
Eric P. Xing
Zhiting Hu
28
102
0
25 Oct 2023
What's the Magic Word? A Control Theory of LLM Prompting
What's the Magic Word? A Control Theory of LLM Prompting
Aman Bhargava
Cameron Witkowski
Manav Shah
Matt W. Thomson
LLMAG
61
30
0
02 Oct 2023
SPELL: Semantic Prompt Evolution based on a LLM
SPELL: Semantic Prompt Evolution based on a LLM
Yujian Betterest Li
Kai Wu
48
10
0
02 Oct 2023
Query-Dependent Prompt Evaluation and Optimization with Offline Inverse
  RL
Query-Dependent Prompt Evaluation and Optimization with Offline Inverse RL
Hao Sun
Alihan Huyuk
M. Schaar
OffRL
LRM
23
28
0
13 Sep 2023
Matching Table Metadata with Business Glossaries Using Large Language
  Models
Matching Table Metadata with Business Glossaries Using Large Language Models
Elita Lobo
Oktie Hassanzadeh
Nhan Pham
Nandana Mihindukulasooriya
D. Subramanian
Horst Samulowitz
16
3
0
08 Sep 2023
Reinforcement Learning for Generative AI: A Survey
Reinforcement Learning for Generative AI: A Survey
Yuanjiang Cao
Quan.Z Sheng
Julian McAuley
Lina Yao
SyDa
46
10
0
28 Aug 2023
Diverse Data Augmentation with Diffusions for Effective Test-time Prompt
  Tuning
Diverse Data Augmentation with Diffusions for Effective Test-time Prompt Tuning
Chun-Mei Feng
Kai Yu
Yong Liu
Salman Khan
W. Zuo
VLM
22
78
0
11 Aug 2023
Pre-Trained Large Language Models for Industrial Control
Pre-Trained Large Language Models for Industrial Control
Lei Song
Chuheng Zhang
Li Zhao
Jiang Bian
LM&Ro
AI4CE
32
12
0
06 Aug 2023
Selective Perception: Optimizing State Descriptions with Reinforcement
  Learning for Language Model Actors
Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors
Kolby Nottingham
Yasaman Razeghi
Kyungmin Kim
JB Lanier
Pierre Baldi
Roy Fox
Sameer Singh
30
9
0
21 Jul 2023
AutoHint: Automatic Prompt Optimization with Hint Generation
AutoHint: Automatic Prompt Optimization with Hint Generation
Hong Sun
Xue Li
Yi Xu
Youkow Homma
Qinhao Cao
Min-man Wu
Jian Jiao
Denis Xavier Charles
34
23
0
13 Jul 2023
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Preference-grounded Token-level Guidance for Language Model Fine-tuning
Shentao Yang
Shujian Zhang
Congying Xia
Yihao Feng
Caiming Xiong
Mi Zhou
29
23
0
01 Jun 2023
Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in
  Vision-Language Models
Test-Time Adaptation with CLIP Reward for Zero-Shot Generalization in Vision-Language Models
Shuai Zhao
Xiaohan Wang
Linchao Zhu
Yezhou Yang
VLM
31
22
0
29 May 2023
Universal Self-Adaptive Prompting
Universal Self-Adaptive Prompting
Xingchen Wan
Ruoxi Sun
Hootan Nakhost
H. Dai
Julian Martin Eisenschlos
Sercan Ö. Arik
Tomas Pfister
LRM
38
9
0
24 May 2023
Query Rewriting for Retrieval-Augmented Large Language Models
Query Rewriting for Retrieval-Augmented Large Language Models
Xinbei Ma
Yeyun Gong
Pengcheng He
Hai Zhao
Nan Duan
KELM
LRM
36
103
0
23 May 2023
Robust Prompt Optimization for Large Language Models Against
  Distribution Shifts
Robust Prompt Optimization for Large Language Models Against Distribution Shifts
Moxin Li
Wenjie Wang
Fuli Feng
Yixin Cao
Jizhi Zhang
Tat-Seng Chua
OffRL
42
15
0
23 May 2023
Flatness-Aware Prompt Selection Improves Accuracy and Sample Efficiency
Flatness-Aware Prompt Selection Improves Accuracy and Sample Efficiency
Lingfeng Shen
Weiting Tan
Boyuan Zheng
Daniel Khashabi
VLM
39
6
0
18 May 2023
Automatic Prompt Optimization with "Gradient Descent" and Beam Search
Automatic Prompt Optimization with "Gradient Descent" and Beam Search
Reid Pryzant
Dan Iter
Jerry Li
Y. Lee
Chenguang Zhu
Michael Zeng
18
303
0
04 May 2023
Guiding Large Language Models via Directional Stimulus Prompting
Guiding Large Language Models via Directional Stimulus Prompting
Zekun Li
Baolin Peng
Pengcheng He
Michel Galley
Jianfeng Gao
Xi Yan
LLMAG
LRM
LM&Ro
40
95
0
22 Feb 2023
In-context Example Selection with Influences
In-context Example Selection with Influences
Nguyen Tai
Eric Wong
13
48
0
21 Feb 2023
Explanation Selection Using Unlabeled Data for Chain-of-Thought
  Prompting
Explanation Selection Using Unlabeled Data for Chain-of-Thought Prompting
Xi Ye
Greg Durrett
LRM
ReLM
32
12
0
09 Feb 2023
PromptSource: An Integrated Development Environment and Repository for
  Natural Language Prompts
PromptSource: An Integrated Development Environment and Repository for Natural Language Prompts
Stephen H. Bach
Victor Sanh
Zheng-Xin Yong
Albert Webson
Colin Raffel
...
Khalid Almubarak
Xiangru Tang
Dragomir R. Radev
Mike Tian-Jian Jiang
Alexander M. Rush
VLM
225
339
0
02 Feb 2022
P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally
  Across Scales and Tasks
P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks
Xiao Liu
Kaixuan Ji
Yicheng Fu
Weng Lam Tam
Zhengxiao Du
Zhilin Yang
Jie Tang
VLM
238
808
0
14 Oct 2021
Fantastically Ordered Prompts and Where to Find Them: Overcoming
  Few-Shot Prompt Order Sensitivity
Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order Sensitivity
Yao Lu
Max Bartolo
Alastair Moore
Sebastian Riedel
Pontus Stenetorp
AILaw
LRM
279
1,124
0
18 Apr 2021
The Power of Scale for Parameter-Efficient Prompt Tuning
The Power of Scale for Parameter-Efficient Prompt Tuning
Brian Lester
Rami Al-Rfou
Noah Constant
VPVLM
280
3,858
0
18 Apr 2021
What Makes Good In-Context Examples for GPT-$3$?
What Makes Good In-Context Examples for GPT-333?
Jiachang Liu
Dinghan Shen
Yizhe Zhang
Bill Dolan
Lawrence Carin
Weizhu Chen
AAML
RALM
275
1,312
0
17 Jan 2021
Making Pre-trained Language Models Better Few-shot Learners
Making Pre-trained Language Models Better Few-shot Learners
Tianyu Gao
Adam Fisch
Danqi Chen
243
1,924
0
31 Dec 2020
Exploiting Cloze Questions for Few Shot Text Classification and Natural
  Language Inference
Exploiting Cloze Questions for Few Shot Text Classification and Natural Language Inference
Timo Schick
Hinrich Schütze
258
1,589
0
21 Jan 2020
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
299
6,984
0
20 Apr 2018
1