ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.02882
  4. Cited By
DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models

DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models

2 April 2025
S. Jung
Donghun Lee
Shinbok Lee
Gaeun Seo
Daniel Lee
Byeongil Ko
Junrae Cho
Kihyun Kim
EungGyun Kim
M. Shin
ArXiv (abs)PDFHTML

Papers citing "DiaTool-DPO: Multi-Turn Direct Preference Optimization for Tool-Augmented Large Language Models"

2 / 2 papers shown
Title
ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities
ToolSandbox: A Stateful, Conversational, Interactive Evaluation Benchmark for LLM Tool Use Capabilities
Jiarui Lu
Thomas Holleis
Yizhe Zhang
Bernhard Aumayer
Feng Nan
...
Shen Ma
Mengyu Li
Guoli Yin
Zirui Wang
Ruoming Pang
LLMAGELM
110
39
0
08 Aug 2024
Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?
Can Tool-augmented Large Language Models be Aware of Incomplete Conditions?
Seungbin Yang
Yujin Baek
Taehee Kim
Jaegul Choo
82
2
0
18 Jun 2024
1