ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2502.14886
  4. Cited By
Surgical Scene Understanding in the Era of Foundation AI Models: A Comprehensive Review

Surgical Scene Understanding in the Era of Foundation AI Models: A Comprehensive Review

24 February 2025
Ufaq Khan
Umair Nawaz
A. Qayyum
Shazad Ashraf
Muhammad Bilal
Junaid Qadir
ArXivPDFHTML

Papers citing "Surgical Scene Understanding in the Era of Foundation AI Models: A Comprehensive Review"

50 / 142 papers shown
Title
Polyp-SAM: Transfer SAM for Polyp Segmentation
Polyp-SAM: Transfer SAM for Polyp Segmentation
Yuheng Li
Mingzhe Hu
Xiaofeng Yang
MedIm
180
84
0
29 Apr 2023
Methods and datasets for segmentation of minimally invasive surgical
  instruments in endoscopic images and videos: A review of the state of the art
Methods and datasets for segmentation of minimally invasive surgical instruments in endoscopic images and videos: A review of the state of the art
Tobias Rueckert
Daniel Rueckert
Christoph Palm
49
17
0
25 Apr 2023
SurgicalGPT: End-to-End Language-Vision GPT for Visual Question
  Answering in Surgery
SurgicalGPT: End-to-End Language-Vision GPT for Visual Question Answering in Surgery
Lalithkumar Seenivasan
Mobarakol Islam
Gokul Kannan
Hongliang Ren
52
41
0
19 Apr 2023
Segment Anything Model (SAM) for Digital Pathology: Assess Zero-shot
  Segmentation on Whole Slide Imaging
Segment Anything Model (SAM) for Digital Pathology: Assess Zero-shot Segmentation on Whole Slide Imaging
Ruining Deng
C. Cui
Quan Liu
Tianyuan Yao
Lucas W. Remedios
...
Shilin Zhao
Agnes B. Fogo
Haichun Yang
Yucheng Tang
Yuankai Huo
VLM
MedIm
33
203
0
09 Apr 2023
Segment Anything
Segment Anything
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
...
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
MLLM
VLM
238
7,047
0
05 Apr 2023
EVA-CLIP: Improved Training Techniques for CLIP at Scale
EVA-CLIP: Improved Training Techniques for CLIP at Scale
Quan-Sen Sun
Yuxin Fang
Ledell Yu Wu
Xinlong Wang
Yue Cao
CLIP
VLM
104
478
0
27 Mar 2023
LLaMA: Open and Efficient Foundation Language Models
LLaMA: Open and Efficient Foundation Language Models
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
...
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
678
12,840
0
27 Feb 2023
VideoSum: A Python Library for Surgical Video Summarization
VideoSum: A Python Library for Surgical Video Summarization
Luis C. Garcia-Peraza-Herrera
Sebastien Ourselin
Tom Vercauteren
42
2
0
15 Feb 2023
Medical Image Segmentation Review: The success of U-Net
Medical Image Segmentation Review: The success of U-Net
Reza Azad
Ehsan Khodapanah Aghdam
Amelie Rauland
Yiwei Jia
Atlas Haddadi Avval
Afshin Bozorgpour
Sanaz Karimijafarbigloo
Joseph Paul Cohen
Ehsan Adeli
Dorit Merhof
SSeg
67
280
0
27 Nov 2022
From Forks to Forceps: A New Framework for Instance Segmentation of
  Surgical Instruments
From Forks to Forceps: A New Framework for Instance Segmentation of Surgical Instruments
Britty Baby
Daksh Thapar
Mustafa Chasmai
Tamajit Banerjee
Kunal Dargan
A. Suri
Subhashis Banerjee
Chetan Arora
50
27
0
26 Nov 2022
Prompt Tuning for Parameter-efficient Medical Image Segmentation
Prompt Tuning for Parameter-efficient Medical Image Segmentation
Marc Fischer
Alexander Bartler
Bin Yang
SSeg
31
19
0
16 Nov 2022
Unsupervised Model Adaptation for Source-free Segmentation of Medical
  Images
Unsupervised Model Adaptation for Source-free Segmentation of Medical Images
Serban Stan
Mohammad Rostami
OOD
47
11
0
02 Nov 2022
A semi-supervised Teacher-Student framework for surgical tool detection
  and localization
A semi-supervised Teacher-Student framework for surgical tool detection and localization
Mansoor Ali Teevno
Gilberto Ochoa-Ruiz
Sharib Ali
44
9
0
21 Aug 2022
AutoLaparo: A New Dataset of Integrated Multi-tasks for Image-guided
  Surgical Automation in Laparoscopic Hysterectomy
AutoLaparo: A New Dataset of Integrated Multi-tasks for Image-guided Surgical Automation in Laparoscopic Hysterectomy
Ziyi Wang
Bo Lu
Yonghao Long
Fangxun Zhong
T. Cheung
Qi Dou
Yunhui Liu
49
58
0
03 Aug 2022
Dissecting Self-Supervised Learning Methods for Surgical Computer Vision
Dissecting Self-Supervised Learning Methods for Surgical Computer Vision
Sanat Ramesh
V. Srivastav
Deepak Alapatt
Tong Yu
Aditya Murali
...
Saurav Sharma
A. Fleurentin
Georgios Exarchakis
Alexandros Karargyris
N. Padoy
64
43
0
01 Jul 2022
Surgical-VQA: Visual Question Answering in Surgical Scenes using
  Transformer
Surgical-VQA: Visual Question Answering in Surgical Scenes using Transformer
Lalithkumar Seenivasan
Mobarakol Islam
Adithya K. Krishna
Hongliang Ren
MedIm
31
46
0
22 Jun 2022
Free Lunch for Surgical Video Understanding by Distilling
  Self-Supervisions
Free Lunch for Surgical Video Understanding by Distilling Self-Supervisions
Xinpeng Ding
Ziwei Liu
Xuelong Li
50
13
0
19 May 2022
RGB-D Semantic SLAM for Surgical Robot Navigation in the Operating Room
RGB-D Semantic SLAM for Surgical Robot Navigation in the Operating Room
Cong Gao
Dinesh Rabindran
Omid Mohareri
9
4
0
12 Apr 2022
Min-Max Similarity: A Contrastive Semi-Supervised Deep Learning Network
  for Surgical Tools Segmentation
Min-Max Similarity: A Contrastive Semi-Supervised Deep Learning Network for Surgical Tools Segmentation
Ange Lou
Kareem O. Tawfik
X. Yao
Ziteng Liu
J. Noble
43
38
0
29 Mar 2022
Advancing Spiking Neural Networks towards Deep Residual Learning
Advancing Spiking Neural Networks towards Deep Residual Learning
Yifan Hu
Lei Deng
Yujie Wu
Man Yao
Guoqi Li
42
89
0
15 Dec 2021
ST-MTL: Spatio-Temporal Multitask Learning Model to Predict Scanpath
  While Tracking Instruments in Robotic Surgery
ST-MTL: Spatio-Temporal Multitask Learning Model to Predict Scanpath While Tracking Instruments in Robotic Surgery
Mobarakol Islam
V. Vibashan
C. Lim
Hongliang Ren
39
40
0
10 Dec 2021
Real-time Instance Segmentation of Surgical Instruments using Attention
  and Multi-scale Feature Fusion
Real-time Instance Segmentation of Surgical Instruments using Attention and Multi-scale Feature Fusion
Juan Carlos Angeles Ceron
Gilberto Ochoa-Ruiz
Leonardo Chang
Sharib Ali
46
36
0
09 Nov 2021
A Critical Study on the Recent Deep Learning Based Semi-Supervised Video
  Anomaly Detection Methods
A Critical Study on the Recent Deep Learning Based Semi-Supervised Video Anomaly Detection Methods
M. Baradaran
R. Bergevin
44
16
0
02 Nov 2021
Comparative Validation of Machine Learning Algorithms for Surgical
  Workflow and Skill Analysis with the HeiChole Benchmark
Comparative Validation of Machine Learning Algorithms for Surgical Workflow and Skill Analysis with the HeiChole Benchmark
M. Wagner
Beat-Peter Müller-Stich
A. Kisilenko
Duc Tran
P. Heger
...
M. Frankenberg
F. Mathis-Ullrich
Lena Maier-Hein
Stefanie Speidel
S. Bodenstedt
45
71
0
30 Sep 2021
Reducing Annotating Load: Active Learning with Synthetic Images in
  Surgical Instrument Segmentation
Reducing Annotating Load: Active Learning with Synthetic Images in Surgical Instrument Segmentation
Haonan Peng
Shan Lin
Daniel King
Yun-Hsuan Su
Randall Bly
K. Moe
Blake Hannaford
MedIm
54
6
0
07 Aug 2021
LoRA: Low-Rank Adaptation of Large Language Models
LoRA: Low-Rank Adaptation of Large Language Models
J. E. Hu
Yelong Shen
Phillip Wallis
Zeyuan Allen-Zhu
Yuanzhi Li
Shean Wang
Lu Wang
Weizhu Chen
OffRL
AI4TS
AI4CE
ALM
AIMat
223
9,946
0
17 Jun 2021
Emerging Properties in Self-Supervised Vision Transformers
Emerging Properties in Self-Supervised Vision Transformers
Mathilde Caron
Hugo Touvron
Ishan Misra
Hervé Jégou
Julien Mairal
Piotr Bojanowski
Armand Joulin
553
5,920
0
29 Apr 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
666
28,659
0
26 Feb 2021
Surgical Visual Domain Adaptation: Results from the MICCAI 2020
  SurgVisDom Challenge
Surgical Visual Domain Adaptation: Results from the MICCAI 2020 SurgVisDom Challenge
Aneeq Zia
Kiran D. Bhattacharyya
Xi Liu
Ziheng Wang
S. Kondo
...
Raabid Hussain
Lena Maier-Hein
Danail Stoyanov
Stefanie Speidel
A. Jarc
49
20
0
26 Feb 2021
Image Compositing for Segmentation of Surgical Tools without Manual
  Annotations
Image Compositing for Segmentation of Surgical Tools without Manual Annotations
Luis C. Garcia-Peraza-Herrera
Lucas Fidon
Claudia DrEttorre
Danail Stoyanov
Tom Vercauteren
Sebastien Ourselin
20
40
0
18 Feb 2021
Gesture Recognition in Robotic Surgery: a Review
Gesture Recognition in Robotic Surgery: a Review
Beatrice van Amsterdam
Matthew J. Clarkson
Danail Stoyanov
73
94
0
29 Jan 2021
Towards a Computed-Aided Diagnosis System in Colonoscopy: Automatic
  Polyp Segmentation Using Convolution Neural Networks
Towards a Computed-Aided Diagnosis System in Colonoscopy: Automatic Polyp Segmentation Using Convolution Neural Networks
P. Brandao
Odysseas Zisimopoulos
E. Mazomenos
G. Ciuti
Jorge Bernal
...
P. Dario
Anastasios Koulaouzidis
A. Arezzo
D. Hawkes
Danail Stoyanov
MedIm
63
64
0
15 Jan 2021
Big Self-Supervised Models Advance Medical Image Classification
Big Self-Supervised Models Advance Medical Image Classification
Shekoofeh Azizi
Basil Mustafa
Fiona Ryan
Zach Beaver
Jan Freyberg
...
Alan Karthikesalingam
Simon Kornblith
Ting-Li Chen
Vivek Natarajan
Mohammad Norouzi
SSL
84
511
0
13 Jan 2021
Deep Learning for Medical Anomaly Detection -- A Survey
Deep Learning for Medical Anomaly Detection -- A Survey
Tharindu Fernando
Harshala Gammulle
Simon Denman
Sridha Sridharan
Clinton Fookes
OOD
41
271
0
04 Dec 2020
Anomaly Detection in Video via Self-Supervised and Multi-Task Learning
Anomaly Detection in Video via Self-Supervised and Multi-Task Learning
Mariana-Iuliana Georgescu
Antonio Bărbălău
Radu Tudor Ionescu
Fahad Shahbaz Khan
Marius Popescu
M. Shah
SSL
62
256
0
15 Nov 2020
U-Net and its variants for medical image segmentation: theory and
  applications
U-Net and its variants for medical image segmentation: theory and applications
N. Siddique
Sidike Paheding
Colin P. Elkin
Vijay Devabhaktuni
SSeg
42
1,061
0
02 Nov 2020
Kvasir-Instrument: Diagnostic and therapeutic tool segmentation dataset
  in gastrointestinal endoscopy
Kvasir-Instrument: Diagnostic and therapeutic tool segmentation dataset in gastrointestinal endoscopy
Debesh Jha
Sharib Ali
Krister Emanuelsen
Steven A. Hicks
VajiraThambawita
...
Thomas de Lange
P. Schmidt
H. Johansen
Dag Johansen
Pål Halvorsen
25
111
0
23 Oct 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at
  Scale
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
340
40,217
0
22 Oct 2020
Medical Image Segmentation Using Deep Learning: A Survey
Medical Image Segmentation Using Deep Learning: A Survey
Risheng Wang
Tao Lei
Xiaogang Du
Yong Wan
Hongying Meng
Asoke K. Nandi
SSeg
OOD
59
554
0
28 Sep 2020
Real-Time Segmentation of Non-Rigid Surgical Tools based on Deep
  Learning and Tracking
Real-Time Segmentation of Non-Rigid Surgical Tools based on Deep Learning and Tracking
Luis C. Garcia-Peraza-Herrera
Wenqi Li
Caspar Gruijthuijsen
A. Devreker
G. Attilakos
Jan Deprest
E. V. Poorten
Danail Stoyanov
Tom Vercauteren
Sébastien Ourselin
23
105
0
07 Sep 2020
Detection and Localization of Robotic Tools in Robot-Assisted Surgery
  Videos Using Deep Neural Networks for Region Proposal and Detection
Detection and Localization of Robotic Tools in Robot-Assisted Surgery Videos Using Deep Neural Networks for Region Proposal and Detection
Duygu Sarikaya
Jason J. Corso
K. Guru
38
202
0
29 Jul 2020
Endo-Sim2Real: Consistency learning-based domain adaptation for
  instrument segmentation
Endo-Sim2Real: Consistency learning-based domain adaptation for instrument segmentation
Manish Sahu
Ronja Strömsdörfer
Anirban Mukhopadhyay
S. Zachow
69
35
0
22 Jul 2020
Real-Time Instrument Segmentation in Robotic Surgery using Auxiliary
  Supervised Deep Adversarial Learning
Real-Time Instrument Segmentation in Robotic Surgery using Auxiliary Supervised Deep Adversarial Learning
Mobarakol Islam
Daniel Anojan Atputharuban
Ravikiran Ramesh
Hongliang Ren
MedIm
112
94
0
22 Jul 2020
Synthetic and Real Inputs for Tool Segmentation in Robotic Surgery
Synthetic and Real Inputs for Tool Segmentation in Robotic Surgery
Emanuele Colleoni
P. J. Eddie Edwards
Danail Stoyanov
MedIm
43
61
0
17 Jul 2020
ISINet: An Instance-Based Approach for Surgical Instrument Segmentation
ISINet: An Instance-Based Approach for Surgical Instrument Segmentation
Cristina González
Laura Bravo-Sánchez
Pablo Arbeláez
47
79
0
10 Jul 2020
Recognition of Instrument-Tissue Interactions in Endoscopic Videos via
  Action Triplets
Recognition of Instrument-Tissue Interactions in Endoscopic Videos via Action Triplets
C. Nwoye
Cristians Gonzalez
Tong Yu
Pietro Mascagni
Didier Mutter
J. Marescaux
N. Padoy
42
77
0
10 Jul 2020
Searching for Efficient Architecture for Instrument Segmentation in
  Robotic Surgery
Searching for Efficient Architecture for Instrument Segmentation in Robotic Surgery
D. Pakhomov
Nassir Navab
13
15
0
08 Jul 2020
Denoising Diffusion Probabilistic Models
Denoising Diffusion Probabilistic Models
Jonathan Ho
Ajay Jain
Pieter Abbeel
DiffM
279
17,550
0
19 Jun 2020
Language Models are Few-Shot Learners
Language Models are Few-Shot Learners
Tom B. Brown
Benjamin Mann
Nick Ryder
Melanie Subbiah
Jared Kaplan
...
Christopher Berner
Sam McCandlish
Alec Radford
Ilya Sutskever
Dario Amodei
BDL
467
41,106
0
28 May 2020
End-to-End Object Detection with Transformers
End-to-End Object Detection with Transformers
Nicolas Carion
Francisco Massa
Gabriel Synnaeve
Nicolas Usunier
Alexander Kirillov
Sergey Zagoruyko
ViT
3DV
PINN
275
12,847
0
26 May 2020
Previous
123
Next