ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.02643
  4. Cited By
Segment Anything

Segment Anything

5 April 2023
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
Laura Gustafson
Tete Xiao
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
    MLLM
    VLM
ArXivPDFHTML

Papers citing "Segment Anything"

50 / 4,200 papers shown
Title
MARS: a Multimodal Alignment and Ranking System for Few-Shot Segmentation
MARS: a Multimodal Alignment and Ranking System for Few-Shot Segmentation
Nico Catalano
Stefano Samele
Paolo Pertino
Matteo Matteucci
3DPC
53
0
0
10 Apr 2025
FlexIP: Dynamic Control of Preservation and Personality for Customized Image Generation
FlexIP: Dynamic Control of Preservation and Personality for Customized Image Generation
Linyan Huang
Haonan Lin
Yanning Zhou
Kaiwen Xiao
49
0
0
10 Apr 2025
VideoExpert: Augmented LLM for Temporal-Sensitive Video Understanding
VideoExpert: Augmented LLM for Temporal-Sensitive Video Understanding
Henghao Zhao
Ge-Peng Ji
Rui Yan
Huan Xiong
Zechao Li
29
0
0
10 Apr 2025
HoloPart: Generative 3D Part Amodal Segmentation
HoloPart: Generative 3D Part Amodal Segmentation
Yanting Yang
Yu Guo
Yukun Huang
Zi-Xin Zou
Zhipeng Yu
Yangguang Li
Yan-Pei Cao
Xihui Liu
DiffM
50
1
0
10 Apr 2025
Towards Unconstrained 2D Pose Estimation of the Human Spine
Towards Unconstrained 2D Pose Estimation of the Human Spine
Muhammad Gul Zain Ali Khan
Stephan Krauß
Didier Stricker
3DH
61
0
0
10 Apr 2025
A Comparison of Deep Learning Methods for Cell Detection in Digital Cytology
A Comparison of Deep Learning Methods for Cell Detection in Digital Cytology
Marco Acerbis
Natasa Sladoje
Joakim Lindblad
32
0
0
09 Apr 2025
MultiADS: Defect-aware Supervision for Multi-type Anomaly Detection and Segmentation in Zero-Shot Learning
MultiADS: Defect-aware Supervision for Multi-type Anomaly Detection and Segmentation in Zero-Shot Learning
Ylli Sadikaj
Hongkuan Zhou
Lavdim Halilaj
Stefan Schmid
Steffen Staab
Claudia Plant
28
0
0
09 Apr 2025
RayFronts: Open-Set Semantic Ray Frontiers for Online Scene Understanding and Exploration
RayFronts: Open-Set Semantic Ray Frontiers for Online Scene Understanding and Exploration
Omar Alama
A. Bhattacharya
Haoyang He
Seungchan Kim
Yuheng Qiu
Wenshan Wang
Cherie Ho
Nikhil Varma Keetha
Sebastian A. Scherer
33
0
0
09 Apr 2025
Domain Generalization through Attenuation of Domain-Specific Information
Domain Generalization through Attenuation of Domain-Specific Information
Reiji Saito
Kazuhiro Hotta
33
0
0
09 Apr 2025
DUKAE: DUal-level Knowledge Accumulation and Ensemble for Pre-Trained Model-Based Continual Learning
DUKAE: DUal-level Knowledge Accumulation and Ensemble for Pre-Trained Model-Based Continual Learning
Songze Li
Tonghua Su
Xu-Yao Zhang
Qixing Xu
Zhongjie Wang
CLL
43
0
0
09 Apr 2025
Human-like compositional learning of visually-grounded concepts using synthetic environments
Human-like compositional learning of visually-grounded concepts using synthetic environments
Zijun Lin
M Ganesh Kumar
Cheston Tan
OCL
CoGe
80
0
0
09 Apr 2025
Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting
Wheat3DGS: In-field 3D Reconstruction, Instance Segmentation and Phenotyping of Wheat Heads with Gaussian Splatting
Daiwei Zhang
Joaquin Gajardo
Tomislav Medic
Isinsu Katircioglu
Mike Boss
Norbert Kirchgessner
Achim Walter
Lukas Roth
34
0
0
09 Apr 2025
Are We Done with Object-Centric Learning?
Are We Done with Object-Centric Learning?
Alexander Rubinstein
Ameya Prabhu
Matthias Bethge
Seong Joon Oh
OCL
739
0
0
09 Apr 2025
MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking
MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking
Chang Nie
Yiqing Xu
Guangming Wang
Zhe Liu
Yanzi Miao
Hesheng Wang
VLM
41
0
0
09 Apr 2025
Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception
Patch Matters: Training-free Fine-grained Image Caption Enhancement via Local Perception
Ruotian Peng
Haiying He
Yake Wei
Yandong Wen
D. Hu
VLM
41
0
0
09 Apr 2025
ASHiTA: Automatic Scene-grounded HIerarchical Task Analysis
ASHiTA: Automatic Scene-grounded HIerarchical Task Analysis
Yun Chang
Leonor Fermoselle
Duy Ta
Bernadette Bucher
Luca Carlone
Jiuguang Wang
42
0
0
09 Apr 2025
PosterMaker: Towards High-Quality Product Poster Generation with Accurate Text Rendering
PosterMaker: Towards High-Quality Product Poster Generation with Accurate Text Rendering
Y. Gao
Zihang Lin
Chuanbin Liu
Min Zhou
T. Ge
Bo Zheng
Hongtao Xie
DiffM
45
0
0
09 Apr 2025
MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs
MedSegFactory: Text-Guided Generation of Medical Image-Mask Pairs
Jiawei Mao
Yucheng Wang
Yucheng Tang
Daguang Xu
Kang Wang
Yang Yang
Zongwei Zhou
Yuyin Zhou
MedIm
29
0
0
09 Apr 2025
GraspClutter6D: A Large-scale Real-world Dataset for Robust Perception and Grasping in Cluttered Scenes
GraspClutter6D: A Large-scale Real-world Dataset for Robust Perception and Grasping in Cluttered Scenes
S. Back
J. Lee
Kangmin Kim
Heeseon Rho
Geonhyup Lee
...
S. Lee
Sangjun Noh
Youngjin Lee
Taeyeop Lee
K. Lee
3DV
51
0
0
09 Apr 2025
Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene Understanding
Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene Understanding
Pedro Hermosilla
Christian Stippel
Leon Sick
SSL
3DPC
84
0
0
09 Apr 2025
KAN-SAM: Kolmogorov-Arnold Network Guided Segment Anything Model for RGB-T Salient Object Detection
KAN-SAM: Kolmogorov-Arnold Network Guided Segment Anything Model for RGB-T Salient Object Detection
Xingyuan Li
Ruichao Hou
Tongwei Ren
Gangshan Wu
29
0
0
08 Apr 2025
Transferable Mask Transformer: Cross-domain Semantic Segmentation with Region-adaptive Transferability Estimation
Transferable Mask Transformer: Cross-domain Semantic Segmentation with Region-adaptive Transferability Estimation
Enming Zhang
Zhu Li
Yanru Wu
Jun Wang
Yang Tan
Ruizhe Zhao
Guan Wang
Yang Li
ViT
43
0
0
08 Apr 2025
PromptHMR: Promptable Human Mesh Recovery
PromptHMR: Promptable Human Mesh Recovery
Yufu Wang
Yu Sun
Priyanka Patel
Kostas Daniilidis
Michael J. Black
Muhammed Kocabas
3DH
64
0
0
08 Apr 2025
HRMedSeg: Unlocking High-resolution Medical Image segmentation via Memory-efficient Attention Modeling
HRMedSeg: Unlocking High-resolution Medical Image segmentation via Memory-efficient Attention Modeling
Qing Xu
Zhenye Lou
Chenxin Li
Xiangjian He
Rong Qu
Tesema Fiseha Berhanu
Yi Wang
Wenting Duan
Zhen Chen
MedIm
38
0
0
08 Apr 2025
On the Importance of Conditioning for Privacy-Preserving Data Augmentation
On the Importance of Conditioning for Privacy-Preserving Data Augmentation
Julian Lorenz
K. Ludwig
Valentin Haug
Rainer Lienhart
DiffM
45
0
0
08 Apr 2025
TAPNext: Tracking Any Point (TAP) as Next Token Prediction
TAPNext: Tracking Any Point (TAP) as Next Token Prediction
Artem Zholus
Carl Doersch
Yi Yang
Skanda Koppula
Viorica Patraucean
Xu He
Ignacio Rocco
Mehdi S. M. Sajjadi
Sarath Chandar
Ross Goroshin
40
0
0
08 Apr 2025
Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation
Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation
Xiaoxing Hu
Ziyang Gong
Yansen Wang
Yuru Jia
Gen Luo
Xue Yang
184
0
0
08 Apr 2025
Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision
Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision
Yuandong Pu
Le Zhuo
Kaiwen Zhu
Liangbin Xie
Wenlong Zhang
Xiangyu Chen
Peng Gao
Yu Qiao
Chao Dong
Yihao Liu
MLLM
71
1
0
07 Apr 2025
InteractVLM: 3D Interaction Reasoning from 2D Foundational Models
InteractVLM: 3D Interaction Reasoning from 2D Foundational Models
Sai Kumar Dwivedi
Dimitrije Antić
Shashank Tripathi
Omid Taheri
Cordelia Schmid
M. Black
Dimitrios Tzionas
45
1
0
07 Apr 2025
TactileNet: Bridging the Accessibility Gap with AI-Generated Tactile Graphics for Individuals with Vision Impairment
TactileNet: Bridging the Accessibility Gap with AI-Generated Tactile Graphics for Individuals with Vision Impairment
Adnan Khan
Alireza Choubineh
Mai A. Shaaban
Abbas Akkasi
Majid Komeili
DiffM
40
0
0
07 Apr 2025
Inverse++: Vision-Centric 3D Semantic Occupancy Prediction Assisted with 3D Object Detection
Inverse++: Vision-Centric 3D Semantic Occupancy Prediction Assisted with 3D Object Detection
Zhenxing Ming
J. S. Berrio
Mao Shan
Stewart Worrall
3DPC
53
2
0
07 Apr 2025
DiCoTTA: Domain-invariant Learning for Continual Test-time Adaptation
DiCoTTA: Domain-invariant Learning for Continual Test-time Adaptation
Sohyun Lee
N. Kim
Juwon Kang
Seong Joon Oh
Suha Kwak
99
0
0
07 Apr 2025
Prior2Former -- Evidential Modeling of Mask Transformers for Assumption-Free Open-World Panoptic Segmentation
Prior2Former -- Evidential Modeling of Mask Transformers for Assumption-Free Open-World Panoptic Segmentation
Sebastian Schmidt
Julius Körner
Dominik Fuchsgruber
Stefano Gasperini
F. Tombari
Stephan Günnemann
31
0
0
07 Apr 2025
DeclutterNeRF: Generative-Free 3D Scene Recovery for Occlusion Removal
DeclutterNeRF: Generative-Free 3D Scene Recovery for Occlusion Removal
Wanzhou Liu
Zhexiao Xiong
Xinyu Li
Nathan Jacobs
43
0
0
07 Apr 2025
URECA: Unique Region Caption Anything
URECA: Unique Region Caption Anything
Sangbeom Lim
J. Kim
Heeji Yoon
Jaewoo Jung
Seungryong Kim
43
0
0
07 Apr 2025
CMaP-SAM: Contraction Mapping Prior for SAM-driven Few-shot Segmentation
CMaP-SAM: Contraction Mapping Prior for SAM-driven Few-shot Segmentation
Shuai Chen
Fanman Meng
Haoran Wei
Chenhao Wu
Qingbo Wu
Linfeng Xu
Haoyang Li
37
0
0
07 Apr 2025
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
Mengchao Wang
Qiang Wang
Fan Jiang
Yaqi Fan
Yunpeng Zhang
Yonggang Qi
Kun Zhao
Mu Xu
DiffM
VGen
41
0
0
07 Apr 2025
Embodied Perception for Test-time Grasping Detection Adaptation with Knowledge Infusion
Embodied Perception for Test-time Grasping Detection Adaptation with Knowledge Infusion
Jin Liu
Jialong Xie
Leibing Xiao
Chaoqun Wang
Fengyu Zhou
30
0
0
07 Apr 2025
Studying Image Diffusion Features for Zero-Shot Video Object Segmentation
Studying Image Diffusion Features for Zero-Shot Video Object Segmentation
Thanos Delatolas
Vicky S. Kalogeiton
Dim P. Papadopoulos
DiffM
VOS
57
1
0
07 Apr 2025
Playing Non-Embedded Card-Based Games with Reinforcement Learning
Playing Non-Embedded Card-Based Games with Reinforcement Learning
Tianyang Wu
Lipeng Wan
Yuhang Wang
Qiang Wan
Xuguang Lan
OffRL
32
0
0
07 Apr 2025
LEO-MINI: An Efficient Multimodal Large Language Model using Conditional Token Reduction and Mixture of Multi-Modal Experts
LEO-MINI: An Efficient Multimodal Large Language Model using Conditional Token Reduction and Mixture of Multi-Modal Experts
Yimu Wang
Mozhgan Nasr Azadani
Sean Sedwards
Krzysztof Czarnecki
MLLM
MoE
57
0
0
07 Apr 2025
S^4M: Boosting Semi-Supervised Instance Segmentation with SAM
S^4M: Boosting Semi-Supervised Instance Segmentation with SAM
Heeji Yoon
Heeseong Shin
Eunbeen Hong
Hyunwook Choi
Hansang Cho
Daun Jeong
Seungryong Kim
33
0
0
07 Apr 2025
SAM2MOT: A Novel Paradigm of Multi-Object Tracking by Segmentation
SAM2MOT: A Novel Paradigm of Multi-Object Tracking by Segmentation
Junjie Jiang
Zelin Wang
Manqi Zhao
Yin Li
Dongsheng Jiang
48
0
0
06 Apr 2025
PRISM: Probabilistic Representation for Integrated Shape Modeling and Generation
PRISM: Probabilistic Representation for Integrated Shape Modeling and Generation
Lei Cheng
Mahdi Saleh
Qing Cheng
Lu Sang
Hongli Xu
Daniel Cremers
F. Tombari
25
0
0
06 Apr 2025
Targetless LiDAR-Camera Calibration with Anchored 3D Gaussians
Targetless LiDAR-Camera Calibration with Anchored 3D Gaussians
Haebeom Jung
Namtae Kim
Jungwoo Kim
Jaesik Park
3DGS
186
0
0
06 Apr 2025
The Point, the Vision and the Text: Does Point Cloud Boost Spatial Reasoning of Large Language Models?
The Point, the Vision and the Text: Does Point Cloud Boost Spatial Reasoning of Large Language Models?
Weichen Zhang
Ruiying Peng
Chen Gao
Jianjie Fang
Xin Zeng
...
Ziyi Wang
Jinqiang Cui
Xin Wang
Xinlei Chen
Yongqian Li
LRM
81
0
0
06 Apr 2025
UCS: A Universal Model for Curvilinear Structure Segmentation
UCS: A Universal Model for Curvilinear Structure Segmentation
Dianshuo Li
Li Chen
Yuhang Cao
Kai Zhu
Jun Cheng
45
0
0
05 Apr 2025
DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning
DocSAM: Unified Document Image Segmentation via Query Decomposition and Heterogeneous Mixed Learning
Xiao-Hui Li
Fei Yin
Cheng-Lin Liu
49
0
0
05 Apr 2025
Resilience of Vision Transformers for Domain Generalisation in the Presence of Out-of-Distribution Noisy Images
Resilience of Vision Transformers for Domain Generalisation in the Presence of Out-of-Distribution Noisy Images
Hamza Riaz
Alan F. Smeaton
41
0
0
05 Apr 2025
WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments
WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments
Jianhao Zheng
Zihan Zhu
Valentin Bieri
Marc Pollefeys
Songyou Peng
Iro Armeni
3DGS
31
0
0
04 Apr 2025
Previous
123...567...828384
Next