ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2403.05949
  4. Cited By
General surgery vision transformer: A video pre-trained foundation model
  for general surgery
v1v2v3 (latest)

General surgery vision transformer: A video pre-trained foundation model for general surgery

9 March 2024
Samuel Schmidgall
Ji Woong Kim
Jeffery Jopling
Axel Krieger
    ViTMedIm
ArXiv (abs)PDFHTML

Papers citing "General surgery vision transformer: A video pre-trained foundation model for general surgery"

22 / 22 papers shown
Title
Addressing cognitive bias in medical language models
Addressing cognitive bias in medical language models
Samuel Schmidgall
Carl Harris
Ime Essien
Daniel Olshvang
Tawsifur Rahman
Ji Woong Kim
Rojin Ziaei
Jason K. Eshraghian
Peter M Abadir
Rama Chellappa
ELM
71
26
0
12 Feb 2024
General-purpose foundation models for increased autonomy in
  robot-assisted surgery
General-purpose foundation models for increased autonomy in robot-assisted surgery
Samuel Schmidgall
Ji Woong Kim
Alan Kuntz
A. Ghazi
Axel Krieger
MedIm
86
14
0
01 Jan 2024
Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case
  Study in Medicine
Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine
Harsha Nori
Yin Tat Lee
Sheng Zhang
Dean Carignan
Richard Edgar
...
Hoifung Poon
Tao Qin
Naoto Usuyama
Chris White
Eric Horvitz
LM&MAAI4MHMedImELM
88
323
0
28 Nov 2023
Language models are susceptible to incorrect patient self-diagnosis in
  medical applications
Language models are susceptible to incorrect patient self-diagnosis in medical applications
Rojin Ziaei
Samuel Schmidgall
ELMLM&MA
64
9
0
17 Sep 2023
LLaVA-Med: Training a Large Language-and-Vision Assistant for
  Biomedicine in One Day
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day
Chunyuan Li
Cliff Wong
Sheng Zhang
Naoto Usuyama
Haotian Liu
Jianwei Yang
Tristan Naumann
Hoifung Poon
Jianfeng Gao
LM&MAMedIm
118
792
0
01 Jun 2023
LoViT: Long Video Transformer for Surgical Phase Recognition
LoViT: Long Video Transformer for Surgical Phase Recognition
Yang Liu
Maxence Boels
Luis C. Garcia-Peraza-Herrera
Tom Vercauteren
P. Dasgupta
Alejandro Granados
Sebastien Ourselin
89
35
0
15 May 2023
EfficientViT: Memory Efficient Vision Transformer with Cascaded Group
  Attention
EfficientViT: Memory Efficient Vision Transformer with Cascaded Group Attention
Xinyu Liu
Houwen Peng
Ningxin Zheng
Yuqing Yang
Han Hu
Yixuan Yuan
ViT
74
305
0
11 May 2023
Whether and When does Endoscopy Domain Pretraining Make Sense?
Whether and When does Endoscopy Domain Pretraining Make Sense?
Dominik Batić
Felix Holm
Ege Özsoy
Tobias Czempiel
Nassir Navab
22
7
0
30 Mar 2023
ViT-AE++: Improving Vision Transformer Autoencoder for Self-supervised
  Medical Image Representations
ViT-AE++: Improving Vision Transformer Autoencoder for Self-supervised Medical Image Representations
Chinmay Prabhakar
Hongwei Bran Li
Jiancheng Yang
Suprosana Shit
Benedikt Wiestler
Bjoern Menze
ViTMedIm
62
11
0
18 Jan 2023
MaskViT: Masked Visual Pre-Training for Video Prediction
MaskViT: Masked Visual Pre-Training for Video Prediction
Agrim Gupta
Stephen Tian
Yunzhi Zhang
Jiajun Wu
Roberto Martín-Martín
Li Fei-Fei
169
120
0
23 Jun 2022
SimVP: Simpler yet Better Video Prediction
SimVP: Simpler yet Better Video Prediction
Zhangyang Gao
Cheng Tan
Lirong Wu
Stan Z. Li
93
219
0
09 Jun 2022
Large Language Models are Few-Shot Clinical Information Extractors
Large Language Models are Few-Shot Clinical Information Extractors
Monica Agrawal
S. Hegselmann
Hunter Lang
Yoon Kim
David Sontag
BDLLM&MA
238
346
0
25 May 2022
GLiT: Neural Architecture Search for Global and Local Image Transformer
GLiT: Neural Architecture Search for Global and Local Image Transformer
Boyu Chen
Peixia Li
Chuming Li
Baopu Li
Lei Bai
Chen Lin
Ming Sun
Junjie Yan
Wanli Ouyang
ViT
76
86
0
07 Jul 2021
TransUNet: Transformers Make Strong Encoders for Medical Image
  Segmentation
TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation
Jieneng Chen
Yongyi Lu
Qihang Yu
Xiangde Luo
Ehsan Adeli
Yan Wang
Le Lu
Alan Yuille
Yuyin Zhou
ViTMedIm
98
3,497
0
08 Feb 2021
CholecSeg8k: A Semantic Segmentation Dataset for Laparoscopic
  Cholecystectomy Based on Cholec80
CholecSeg8k: A Semantic Segmentation Dataset for Laparoscopic Cholecystectomy Based on Cholec80
W.-Y. Hong
Chang-Lung Kao
Y.-H. Kuo
J.-R. Wang
Wanxing Chang
C.-S. Shih
48
103
0
23 Dec 2020
TeCNO: Surgical Phase Recognition with Multi-Stage Temporal
  Convolutional Networks
TeCNO: Surgical Phase Recognition with Multi-Stage Temporal Convolutional Networks
Tobias Czempiel
Magdalini Paschali
Matthias Keicher
Walter Simson
H. Feußner
S. T. Kim
Nassir Navab
77
186
0
24 Mar 2020
2018 Robotic Scene Segmentation Challenge
2018 Robotic Scene Segmentation Challenge
M. Allan
S. Kondo
S. Bodenstedt
S. Leger
Rahim Kadkhodamohammadi
...
Sang Hyun Park
M. Azizian
Danail Stoyanov
Lena Maier-Hein
Stefanie Speidel
66
135
0
30 Jan 2020
Multi-Task Recurrent Convolutional Network with Correlation Loss for
  Surgical Video Analysis
Multi-Task Recurrent Convolutional Network with Correlation Loss for Surgical Video Analysis
Yueming Jin
Huaxia Li
Qi Dou
Hao Chen
J. Qin
Chi-Wing Fu
Pheng-Ann Heng
69
177
0
13 Jul 2019
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
Mingxing Tan
Quoc V. Le
3DVMedIm
144
18,168
0
28 May 2019
Layer Normalization
Layer Normalization
Jimmy Lei Ba
J. Kiros
Geoffrey E. Hinton
422
10,526
0
21 Jul 2016
EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic
  Videos
EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos
A. P. Twinanda
S. Shehata
Didier Mutter
J. Marescaux
M. de Mathelin
N. Padoy
241
864
0
09 Feb 2016
Fast and Accurate Deep Network Learning by Exponential Linear Units
  (ELUs)
Fast and Accurate Deep Network Learning by Exponential Linear Units (ELUs)
Djork-Arné Clevert
Thomas Unterthiner
Sepp Hochreiter
305
5,532
0
23 Nov 2015
1