ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2407.11006
  4. Cited By
Evaluating the Efficacy of Foundational Models: Advancing Benchmarking
  Practices to Enhance Fine-Tuning Decision-Making

Evaluating the Efficacy of Foundational Models: Advancing Benchmarking Practices to Enhance Fine-Tuning Decision-Making

25 June 2024
O. Amujo
S. Yang
ArXivPDFHTML

Papers citing "Evaluating the Efficacy of Foundational Models: Advancing Benchmarking Practices to Enhance Fine-Tuning Decision-Making"

7 / 7 papers shown
Title
Joint Detection of Fraud and Concept Drift inOnline Conversations with LLM-Assisted Judgment
Joint Detection of Fraud and Concept Drift inOnline Conversations with LLM-Assisted Judgment
Ali Şenol
Garima Agrawal
Huan Liu
24
0
0
07 May 2025
From Cool Demos to Production-Ready FMware: Core Challenges and a Technology Roadmap
From Cool Demos to Production-Ready FMware: Core Challenges and a Technology Roadmap
Gopi Krishnan Rajbahadur
G. Oliva
Dayi Lin
Ahmed E. Hassan
46
1
0
28 Jan 2025
Gemma: Open Models Based on Gemini Research and Technology
Gemma: Open Models Based on Gemini Research and Technology
Gemma Team
Gemma Team Thomas Mesnard
Cassidy Hardin
Robert Dadashi
Surya Bhupatiraju
...
Armand Joulin
Noah Fiedel
Evan Senter
Alek Andreev
Kathleen Kenealy
VLM
LLMAG
131
431
0
13 Mar 2024
On a Foundation Model for Operating Systems
On a Foundation Model for Operating Systems
Divyanshu Saxena
Nihal Sharma
Donghyun Kim
Rohit Dwivedula
Jiayi Chen
...
Alex Dimakis
P. B. Godfrey
Daehyeok Kim
Chris Rossbach
Gang Wang
47
2
0
13 Dec 2023
Multimodal Foundation Models: From Specialists to General-Purpose
  Assistants
Multimodal Foundation Models: From Specialists to General-Purpose Assistants
Chunyuan Li
Zhe Gan
Zhengyuan Yang
Jianwei Yang
Linjie Li
Lijuan Wang
Jianfeng Gao
MLLM
115
228
0
18 Sep 2023
Can ChatGPT Replace Traditional KBQA Models? An In-depth Analysis of the
  Question Answering Performance of the GPT LLM Family
Can ChatGPT Replace Traditional KBQA Models? An In-depth Analysis of the Question Answering Performance of the GPT LLM Family
Yiming Tan
Dehai Min
Y. Li
Wenbo Li
Nan Hu
Yongrui Chen
Guilin Qi
AI4MH
ELM
49
95
0
14 Mar 2023
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
297
6,956
0
20 Apr 2018
1