ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2504.02799
  4. Cited By
Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence

Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence

3 April 2025
Anita Rau
Mark Endo
Josiah Aklilu
Jaewoo Heo
Khaled Saab
Alberto Paderno
Jeffrey Jopling
F. C. Holsinger
Serena Yeung-Levy
ArXiv (abs)PDFHTML

Papers citing "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"

14 / 14 papers shown
Title
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
DeepSeek-AI
Daya Guo
Dejian Yang
Haowei Zhang
Junxiao Song
...
Shiyu Wang
S. Yu
Shunfeng Zhou
Shuting Pan
S.S. Li
ReLMVLMOffRLAI4TSLRM
380
2,013
0
22 Jan 2025
TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal
  Foundation Models
TOMATO: Assessing Visual Temporal Reasoning Capabilities in Multimodal Foundation Models
Ziyao Shangguan
Chuhan Li
Yuxuan Ding
Yanan Zheng
Yilun Zhao
Tesca Fitzgerald
Arman Cohan
68
16
0
30 Oct 2024
Self-Supervised Learning for Endoscopic Video Analysis
Self-Supervised Learning for Endoscopic Video Analysis
Roy Hirsch
Mathilde Caron
Regev Cohen
Amir Livne
Ron Shapiro
Tomer Golany
Roman Goldenberg
Daniel Freedman
Ehud Rivlin
SSL
74
20
0
23 Aug 2023
Learning Multi-modal Representations by Watching Hundreds of Surgical Video Lectures
Learning Multi-modal Representations by Watching Hundreds of Surgical Video Lectures
Kun Yuan
V. Srivastav
Tong Yu
Joël L. Lavanchy
J. Marescaux
Pietro Mascagni
Nassir Navab
N. Padoy
158
23
0
27 Jul 2023
GPT-4 Technical Report
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAGMLLM
1.5K
14,761
0
15 Mar 2023
Latent Graph Representations for Critical View of Safety Assessment
Latent Graph Representations for Critical View of Safety Assessment
Aditya Murali
Deepak Alapatt
Pietro Mascagni
Armine Vardazaryan
Alain Garcia
Nariaki Okamoto
Didier Mutter
N. Padoy
MedIm
104
24
0
08 Dec 2022
AutoLaparo: A New Dataset of Integrated Multi-tasks for Image-guided
  Surgical Automation in Laparoscopic Hysterectomy
AutoLaparo: A New Dataset of Integrated Multi-tasks for Image-guided Surgical Automation in Laparoscopic Hysterectomy
Ziyi Wang
Bo Lu
Yonghao Long
Fangxun Zhong
T. Cheung
Qi Dou
Yunhui Liu
68
63
0
03 Aug 2022
Dissecting Self-Supervised Learning Methods for Surgical Computer Vision
Dissecting Self-Supervised Learning Methods for Surgical Computer Vision
Sanat Ramesh
V. Srivastav
Deepak Alapatt
Tong Yu
Aditya Murali
...
Saurav Sharma
A. Fleurentin
Georgios Exarchakis
Alexandros Karargyris
N. Padoy
115
46
0
01 Jul 2022
Flamingo: a Visual Language Model for Few-Shot Learning
Flamingo: a Visual Language Model for Few-Shot Learning
Jean-Baptiste Alayrac
Jeff Donahue
Pauline Luc
Antoine Miech
Iain Barr
...
Mikolaj Binkowski
Ricardo Barreira
Oriol Vinyals
Andrew Zisserman
Karen Simonyan
MLLMVLM
418
3,610
0
29 Apr 2022
Comparative Validation of Machine Learning Algorithms for Surgical
  Workflow and Skill Analysis with the HeiChole Benchmark
Comparative Validation of Machine Learning Algorithms for Surgical Workflow and Skill Analysis with the HeiChole Benchmark
M. Wagner
Beat-Peter Müller-Stich
A. Kisilenko
Duc Tran
P. Heger
...
M. Frankenberg
F. Mathis-Ullrich
Lena Maier-Hein
Stefanie Speidel
S. Bodenstedt
86
78
0
30 Sep 2021
Does Your Dermatology Classifier Know What It Doesn't Know? Detecting
  the Long-Tail of Unseen Conditions
Does Your Dermatology Classifier Know What It Doesn't Know? Detecting the Long-Tail of Unseen Conditions
Abhijit Guha Roy
Jie Jessie Ren
Shekoofeh Azizi
Aaron Loh
Vivek Natarajan
...
Yun-Hui Liu
taylan. cemgil
Alan Karthikesalingam
Balaji Lakshminarayanan
Jim Winkens
117
108
0
08 Apr 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIPVLM
1.0K
29,926
0
26 Feb 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy
  Text Supervision
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLMCLIP
480
3,906
0
11 Feb 2021
Improved Baselines with Momentum Contrastive Learning
Improved Baselines with Momentum Contrastive Learning
Xinlei Chen
Haoqi Fan
Ross B. Girshick
Kaiming He
SSL
517
3,449
0
09 Mar 2020
1