ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.06985
  4. Cited By
PEER: Expertizing Domain-Specific Tasks with a Multi-Agent Framework and
  Tuning Methods

PEER: Expertizing Domain-Specific Tasks with a Multi-Agent Framework and Tuning Methods

9 July 2024
Yiying Wang
Xiaojing Li
Binzhu Wang
Yueyang Zhou
Yingru Lin
Han Ji
Hong Chen
Jinshi Zhang
Fei Yu
Zewei Zhao
Song Jin
Renji Gong
Wanqing Xu
ArXivPDFHTML

Papers citing "PEER: Expertizing Domain-Specific Tasks with a Multi-Agent Framework and Tuning Methods"

22 / 22 papers shown
Title
RAFT: Adapting Language Model to Domain Specific RAG
RAFT: Adapting Language Model to Domain Specific RAG
Tianjun Zhang
Shishir G. Patil
Naman Jain
Sheng Shen
Matei A. Zaharia
Ion Stoica
Joseph E. Gonzalez
RALM
70
192
0
15 Mar 2024
Trial and Error: Exploration-Based Trajectory Optimization for LLM
  Agents
Trial and Error: Exploration-Based Trajectory Optimization for LLM Agents
Yifan Song
Da Yin
Xiang Yue
Jie Huang
Sujian Li
Bill Yuchen Lin
56
74
0
04 Mar 2024
Direct Language Model Alignment from Online AI Feedback
Direct Language Model Alignment from Online AI Feedback
Shangmin Guo
Biao Zhang
Tianlin Liu
Tianqi Liu
Misha Khalman
...
Thomas Mesnard
Yao-Min Zhao
Bilal Piot
Johan Ferret
Mathieu Blondel
ALM
54
146
0
07 Feb 2024
KTO: Model Alignment as Prospect Theoretic Optimization
KTO: Model Alignment as Prospect Theoretic Optimization
Kawin Ethayarajh
Winnie Xu
Niklas Muennighoff
Dan Jurafsky
Douwe Kiela
199
510
0
02 Feb 2024
Iterative Preference Learning from Human Feedback: Bridging Theory and
  Practice for RLHF under KL-Constraint
Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-Constraint
Wei Xiong
Hanze Dong
Chen Ye
Ziqi Wang
Han Zhong
Heng Ji
Nan Jiang
Tong Zhang
OffRL
62
179
0
18 Dec 2023
Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case
  Study in Medicine
Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine
Harsha Nori
Yin Tat Lee
Sheng Zhang
Dean Carignan
Richard Edgar
...
Hoifung Poon
Tao Qin
Naoto Usuyama
Chris White
Eric Horvitz
LM&MA
AI4MH
MedIm
ELM
68
303
0
28 Nov 2023
PromptAgent: Strategic Planning with Language Models Enables
  Expert-level Prompt Optimization
PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization
Xinyuan Wang
Chenxi Li
Zhen Wang
Fan Bai
Haotian Luo
Jiayou Zhang
Nebojsa Jojic
Eric P. Xing
Zhiting Hu
90
112
0
25 Oct 2023
A General Theoretical Paradigm to Understand Learning from Human
  Preferences
A General Theoretical Paradigm to Understand Learning from Human Preferences
M. G. Azar
Mark Rowland
Bilal Piot
Daniel Guo
Daniele Calandriello
Michal Valko
Rémi Munos
112
597
0
18 Oct 2023
Prompt-augmented Temporal Point Process for Streaming Event Sequence
Prompt-augmented Temporal Point Process for Streaming Event Sequence
Siqiao Xue
Yan Wang
Zhixuan Chu
Xiaoming Shi
Caigao Jiang
Hongyan Hao
Gangwei Jiang
Xiaoyun Feng
James Y. Zhang
Junqing Zhou
AI4TS
46
24
0
08 Oct 2023
CoCA: Fusing Position Embedding with Collinear Constrained Attention in
  Transformers for Long Context Window Extending
CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window Extending
Shiyi Zhu
Jingting Ye
Wei Jiang
Siqiao Xue
Qi Zhang
Yifan Wu
Jianguo Li
34
4
0
15 Sep 2023
The Rise and Potential of Large Language Model Based Agents: A Survey
The Rise and Potential of Large Language Model Based Agents: A Survey
Zhiheng Xi
Wenxiang Chen
Xin Guo
Wei He
Yiwen Ding
...
Wenjuan Qin
Yongyan Zheng
Xipeng Qiu
Xuanjing Huan
Tao Gui
LM&MA
LM&Ro
3DV
AI4CE
75
907
0
14 Sep 2023
A Survey on Large Language Model based Autonomous Agents
A Survey on Large Language Model based Autonomous Agents
Lei Wang
Chengbang Ma
Xueyang Feng
Zeyu Zhang
Hao-ran Yang
...
Xu Chen
Yankai Lin
Wayne Xin Zhao
Zhewei Wei
Ji-Rong Wen
LLMAG
AI4CE
LM&Ro
66
1,225
0
22 Aug 2023
ExpeL: LLM Agents Are Experiential Learners
ExpeL: LLM Agents Are Experiential Learners
Andrew Zhao
Daniel Huang
Quentin Xu
Matthieu Lin
Yang Liu
Gao Huang
LLMAG
74
208
0
20 Aug 2023
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation
Qingyun Wu
Gagan Bansal
Jieyu Zhang
Yiran Wu
Beibin Li
...
Jiale Liu
Ahmed Hassan Awadallah
Ryen W. White
Doug Burger
Chi Wang
LLMAG
AI4CE
53
315
0
16 Aug 2023
WeaverBird: Empowering Financial Decision-Making with Large Language
  Model, Knowledge Base, and Search Engine
WeaverBird: Empowering Financial Decision-Making with Large Language Model, Knowledge Base, and Search Engine
Siqiao Xue
Fan Zhou
Y. Xu
Ming Jin
Qingsong Wen
...
Jun Zhou
Shuo Xie
D. Xiu
James Y. Zhang
Hongyuan Mei
RALM
AIFin
41
15
0
10 Aug 2023
Llama 2: Open Foundation and Fine-Tuned Chat Models
Llama 2: Open Foundation and Fine-Tuned Chat Models
Hugo Touvron
Louis Martin
Kevin R. Stone
Peter Albert
Amjad Almahairi
...
Sharan Narang
Aurelien Rodriguez
Robert Stojnic
Sergey Edunov
Thomas Scialom
AI4MH
ALM
206
11,636
0
18 Jul 2023
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Lianmin Zheng
Wei-Lin Chiang
Ying Sheng
Siyuan Zhuang
Zhanghao Wu
...
Dacheng Li
Eric Xing
Haotong Zhang
Joseph E. Gonzalez
Ion Stoica
ALM
OSLM
ELM
244
4,186
0
09 Jun 2023
Multi-Agent Collaboration: Harnessing the Power of Intelligent LLM
  Agents
Multi-Agent Collaboration: Harnessing the Power of Intelligent LLM Agents
Yashar Talebirad
Amirhossein Nadiri
LLMAG
85
209
0
05 Jun 2023
Domain Specialization as the Key to Make Large Language Models
  Disruptive: A Comprehensive Survey
Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey
Chen Ling
Xujiang Zhao
Jiaying Lu
Chengyuan Deng
Can Zheng
...
Chris White
Quanquan Gu
Jian Pei
Carl Yang
Liang Zhao
ALM
77
133
0
30 May 2023
Direct Preference Optimization: Your Language Model is Secretly a Reward
  Model
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Rafael Rafailov
Archit Sharma
E. Mitchell
Stefano Ermon
Christopher D. Manning
Chelsea Finn
ALM
288
3,712
0
29 May 2023
HYPRO: A Hybridly Normalized Probabilistic Model for Long-Horizon
  Prediction of Event Sequences
HYPRO: A Hybridly Normalized Probabilistic Model for Long-Horizon Prediction of Event Sequences
Siqiao Xue
Xiaoming Shi
James Y. Zhang
Hongyuan Mei
AI4TS
36
35
0
04 Oct 2022
A Meta Reinforcement Learning Approach for Predictive Autoscaling in the
  Cloud
A Meta Reinforcement Learning Approach for Predictive Autoscaling in the Cloud
Siqiao Xue
Chao Qu
Xiaoming Shi
Cong Liao
Shiyi Zhu
...
Yun Hu
Lei Lei
Yang Zheng
Jianguo Li
James Y. Zhang
83
39
0
31 May 2022
1