ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2302.06294
30
17

CholecTriplet2022: Show me a tool and tell me the triplet -- an endoscopic vision challenge for surgical action triplet detection

13 February 2023
C. Nwoye
Tong Yu
Saurav Sharma
Aditya Murali
Deepak Alapatt
Armine Vardazaryan
Kun Yuan
Jonas Hajek
Wolfgang Reiter
Amine Yamlahi
Finn-Henri Smidt
Xiaoyang Zou
G. Zheng
Bruno Oliveira
Helena R. Torres
Satoshi Kondo
Satoshi Kasai
Felix Holm
Ege Ozsoy
Shuangchun Gui
Han Li
Sista Raviteja
R. Sathish
Pranav Poudel
Binod Bhattarai
Ziheng Wang
Guo Rui
Melanie Schellenberg
Joao L. Vilacca
Tobias Czempiel
Zhenkun Wang
Debdoot Sheet
S. Thapa
Max Berniker
Patrick Godau
Pedro Morais
Sudarshan Regmi
T. Tran
J. Fonseca
Jan-Hinrich Nolke
Estevão Lima
E. Vazquez
Lena Maier-Hein
Nassir Navab
Pietro Mascagni
B. Seeliger
Cristians Gonzalez
Didier Mutter
N. Padoy
ArXivPDFHTML
Abstract

Formalizing surgical activities as triplets of the used instruments, actions performed, and target anatomies is becoming a gold standard approach for surgical activity modeling. The benefit is that this formalization helps to obtain a more detailed understanding of tool-tissue interaction which can be used to develop better Artificial Intelligence assistance for image-guided surgery. Earlier efforts and the CholecTriplet challenge introduced in 2021 have put together techniques aimed at recognizing these triplets from surgical footage. Estimating also the spatial locations of the triplets would offer a more precise intraoperative context-aware decision support for computer-assisted intervention. This paper presents the CholecTriplet2022 challenge, which extends surgical action triplet modeling from recognition to detection. It includes weakly-supervised bounding box localization of every visible surgical instrument (or tool), as the key actors, and the modeling of each tool-activity in the form of <instrument, verb, target> triplet. The paper describes a baseline method and 10 new deep learning algorithms presented at the challenge to solve the task. It also provides thorough methodological comparisons of the methods, an in-depth analysis of the obtained results across multiple metrics, visual and procedural challenges; their significance, and useful insights for future research directions and applications in surgery.

View on arXiv
Comments on this paper