ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2304.02643
  4. Cited By
Segment Anything

Segment Anything

5 April 2023
A. Kirillov
Eric Mintun
Nikhila Ravi
Hanzi Mao
Chloe Rolland
Laura Gustafson
Tete Xiao
Spencer Whitehead
Alexander C. Berg
Wan-Yen Lo
Piotr Dollár
Ross B. Girshick
    MLLMVLM
ArXiv (abs)PDFHTML

Papers citing "Segment Anything"

50 / 1,373 papers shown
Title
HQ-SMem: Video Segmentation and Tracking Using Memory Efficient Object Embedding With Selective Update and Self-Supervised Distillation Feedback
HQ-SMem: Video Segmentation and Tracking Using Memory Efficient Object Embedding With Selective Update and Self-Supervised Distillation Feedback
Elham Soltani Kazemi
Imad Eddine Toubal
Gani Rahmon
Jaired Collins
K. Palaniappan
VOS
10
0
0
25 Jul 2025
Adversarial Distribution Matching for Diffusion Distillation Towards Efficient Image and Video Synthesis
Adversarial Distribution Matching for Diffusion Distillation Towards Efficient Image and Video Synthesis
Yanzuo Lu
Yuxi Ren
Xin Xia
Shanchuan Lin
Xing Wang
Xuefeng Xiao
Andy J. Ma
Xiaohua Xie
Jian-Huang Lai
DiffM
7
0
0
24 Jul 2025
Flow Stochastic Segmentation Networks
Flow Stochastic Segmentation Networks
Fabio De Sousa Ribeiro
Omar Todd
Charles Jones
Avinash Kori
Raghav Mehta
Ben Glocker
0
0
0
24 Jul 2025
Q-Former Autoencoder: A Modern Framework for Medical Anomaly Detection
Q-Former Autoencoder: A Modern Framework for Medical Anomaly Detection
Francesco Dalmonte
Emirhan Bayar
Emre Akbas
Mariana-Iuliana Georgescu
ViTMedIm
11
0
0
24 Jul 2025
Object segmentation in the wild with foundation models: application to vision assisted neuro-prostheses for upper limbs
Object segmentation in the wild with foundation models: application to vision assisted neuro-prostheses for upper limbs
Bolutife Atoki
J. Benois-Pineau
Renaud Péteri
Fabien Baldacci
A. Rugy
VLM
0
0
0
24 Jul 2025
Adaptive Articulated Object Manipulation On The Fly with Foundation Model Reasoning and Part Grounding
Adaptive Articulated Object Manipulation On The Fly with Foundation Model Reasoning and Part Grounding
Xiaojie Zhang
Yuanfei Wang
Ruihai Wu
Kunqi Xu
Yu Li
Liuyu Xiang
Hao Dong
Zhaofeng He
5
0
0
24 Jul 2025
Synthetic Data Matters: Re-training with Geo-typical Synthetic Labels for Building Detection
Synthetic Data Matters: Re-training with Geo-typical Synthetic Labels for Building Detection
Shuang Song
Yang Tang
R. Qin
0
0
0
22 Jul 2025
Part Segmentation of Human Meshes via Multi-View Human Parsing
Part Segmentation of Human Meshes via Multi-View Human Parsing
James Dickens
Kamyar Hamad
3DH
14
0
0
22 Jul 2025
CLEVER: Stream-based Active Learning for Robust Semantic Perception from Human Instructions
CLEVER: Stream-based Active Learning for Robust Semantic Perception from Human Instructions
Jongseok Lee
Timo Birr
Rudolph Triebel
Tamim Asfour
0
0
0
21 Jul 2025
AutoPartGen: Autogressive 3D Part Generation and Discovery
AutoPartGen: Autogressive 3D Part Generation and Discovery
Minghao Chen
Jianyuan Wang
Roman Shapovalov
Tom Monnier
Hyunyoung Jung
Dilin Wang
Rakesh Ranjan
Iro Laina
Andrea Vedaldi
3DPC
12
0
0
17 Jul 2025
EEG Foundation Models: A Critical Review of Current Progress and Future Directions
EEG Foundation Models: A Critical Review of Current Progress and Future Directions
Gayal Kuruppu
Neeraj Wagh
Y. Varatharajah
7
0
0
15 Jul 2025
Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion
Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion
Aleksandar Jevtić
Christoph Reich
Felix Wimbauer
Oliver Hahn
Christian Rupprecht
Stefan Roth
Daniel Cremers
13
0
0
08 Jul 2025
Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement
Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement
Yuqi Liu
Bohao Peng
Zhisheng Zhong
Zihao Yue
Fanbin Lu
Bei Yu
Jiaya Jia
LRMVLM
135
46
0
01 Jul 2025
Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey and Benchmark
Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey and Benchmark
Yi Xin
Jianjiang Yang
Haodi Zhou
Junlong Du
Qi Qin
...
Bin Fu
Xiaokang Yang
Guangtao Zhai
Ming-Hsuan Yang
Xiaohong Liu
VLM
190
86
0
01 Jul 2025
Vision Technologies with Applications in Traffic Surveillance Systems: A Holistic Survey
Wei Zhou
Lei Zhao
Lei Zhao
Runyu Zhang
Yifan Cui
Hongpu Huang
Kun Qie
Chen Wang
AI4TS
207
0
0
01 Jul 2025
Think Before You Segment: High-Quality Reasoning Segmentation with GPT Chain of Thoughts
Think Before You Segment: High-Quality Reasoning Segmentation with GPT Chain of Thoughts
Shiu-hong Kao
Yu-Wing Tai
Chi-Keung Tang
MLLMLRM
304
1
0
01 Jul 2025
3DRealCar: An In-the-wild RGB-D Car Dataset with 360-degree Views
3DRealCar: An In-the-wild RGB-D Car Dataset with 360-degree Views
Xiaobiao Du
Yida Wang
Shuyun Wang
Zhuojie Wu
Hongwei Sheng
...
Jiaying Ying
Tianqing Zhu
Tianqing Zhu
Kun Zhan
Xin Yu
3DPC
100
7
0
01 Jul 2025
Object Retrieval for Visual Question Answering with Outside Knowledge
Object Retrieval for Visual Question Answering with Outside Knowledge
Shichao Kan
Yuhai Deng
Yixiong Liang
Lihui Cen
Zhe Qu
Linna Zhang
Zhihai He
Yigang Cen
97
0
0
01 Jul 2025
Emergent Temporal Correspondences from Video Diffusion Transformers
Emergent Temporal Correspondences from Video Diffusion Transformers
Jisu Nam
Soowon Son
Dahyun Chung
Jiyoung Kim
Siyoon Jin
Junhwa Hur
Seungryong Kim
VGen
54
0
0
20 Jun 2025
Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping
Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping
Teng Guo
Baichuan Huang
Jingjin Yu
42
0
0
20 Jun 2025
UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation
UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation
Teng Li
Quanfeng Lu
Lirui Zhao
Hao Li
X. Zhu
Yu Qiao
Jun Zhang
Wenqi Shao
36
0
0
20 Jun 2025
Revisiting LoRA through the Lens of Parameter Redundancy: Spectral Encoding Helps
Revisiting LoRA through the Lens of Parameter Redundancy: Spectral Encoding Helps
Jiashun Cheng
Aochuan Chen
Nuo Chen
Ziqi Gao
Yuhan Li
Jia Li
Fugee Tsung
29
0
0
20 Jun 2025
With Limited Data for Multimodal Alignment, Let the STRUCTURE Guide You
With Limited Data for Multimodal Alignment, Let the STRUCTURE Guide You
Fabian Gröger
Shuo Wen
Huyen Le
Maria Brbic
33
0
0
20 Jun 2025
From Lab to Factory: Pitfalls and Guidelines for Self-/Unsupervised Defect Detection on Low-Quality Industrial Images
From Lab to Factory: Pitfalls and Guidelines for Self-/Unsupervised Defect Detection on Low-Quality Industrial Images
Sebastian Hönel
Jonas Nordqvist
29
0
0
20 Jun 2025
Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation
Co-Seg++: Mutual Prompt-Guided Collaborative Learning for Versatile Medical Segmentation
Qing Xu
Yuxiang Luo
Wenting Duan
Zhen Chen
36
0
0
20 Jun 2025
AutoV: Learning to Retrieve Visual Prompt for Large Vision-Language Models
AutoV: Learning to Retrieve Visual Prompt for Large Vision-Language Models
Yuan Zhang
Chun-Kai Fan
Tao Huang
Ming Lu
Sicheng Yu
Junwen Pan
Kuan Cheng
Qi She
Shanghang Zhang
VLMLRM
36
0
0
19 Jun 2025
From Semantic To Instance: A Semi-Self-Supervised Learning Approach
From Semantic To Instance: A Semi-Self-Supervised Learning Approach
Keyhan Najafian
F. Maleki
Lingling Jin
Ian Stavness
ISeg
41
0
0
19 Jun 2025
MBA: Multimodal Bidirectional Attack for Referring Expression Segmentation Models
MBA: Multimodal Bidirectional Attack for Referring Expression Segmentation Models
Xingbai Chen
Tingchao Fu
Renyang Liu
Wei Zhou
Chao Yi
AAML
38
0
0
19 Jun 2025
Segment Anything for Satellite Imagery: A Strong Baseline and a Regional Dataset for Automatic Field Delineation
Segment Anything for Satellite Imagery: A Strong Baseline and a Regional Dataset for Automatic Field Delineation
Carmelo Scribano
Elena Govi
Paolo Bertellini
Simone Parisi
Giorgia Franchini
Marko Bertogna
27
0
0
19 Jun 2025
Heterogeneous-Modal Unsupervised Domain Adaptation via Latent Space Bridging
Heterogeneous-Modal Unsupervised Domain Adaptation via Latent Space Bridging
Jiawen Yang
Shuhao Chen
Yucong Duan
K. Tang
Yu Zhang
30
0
0
19 Jun 2025
Polyline Path Masked Attention for Vision Transformer
Polyline Path Masked Attention for Vision Transformer
Zhongchen Zhao
Chaodong Xiao
H. Lin
Qi Xie
Lei Zhang
Deyu Meng
Mamba
55
0
0
19 Jun 2025
SynPo: Boosting Training-Free Few-Shot Medical Segmentation via High-Quality Negative Prompts
SynPo: Boosting Training-Free Few-Shot Medical Segmentation via High-Quality Negative Prompts
Yufei Liu
Haoke Xiao
Jiaxing Chai
Yongcun Zhang
Rong Wang
Zijie Meng
Shaozi Li
MedImVLM
27
0
0
18 Jun 2025
Particle-Grid Neural Dynamics for Learning Deformable Object Models from RGB-D Videos
Particle-Grid Neural Dynamics for Learning Deformable Object Models from RGB-D Videos
Kaifeng Zhang
Baoyu Li
Kris Hauser
Yunzhu Li
AI4CE
56
0
0
18 Jun 2025
BoxFusion: Reconstruction-Free Open-Vocabulary 3D Object Detection via Real-Time Multi-View Box Fusion
BoxFusion: Reconstruction-Free Open-Vocabulary 3D Object Detection via Real-Time Multi-View Box Fusion
Yuqing Lan
Chenyang Zhu
Zhirui Gao
JIazhao Zhang
Yihan Cao
Renjiao Yi
Yijie Wang
Kai Xu
3DPC
68
0
0
18 Jun 2025
GenRecal: Generation after Recalibration from Large to Small Vision-Language Models
GenRecal: Generation after Recalibration from Large to Small Vision-Language Models
Byung-Kwan Lee
Ryo Hachiuma
Yong Man Ro
Yu-Chun Wang
Yueh-Hua Wu
VLM
55
0
0
18 Jun 2025
GRIM: Task-Oriented Grasping with Conditioning on Generative Examples
GRIM: Task-Oriented Grasping with Conditioning on Generative Examples
Shailesh
Alok Raj
Nayan Kumar
Priya Shukla
Andrew Melnik
Micheal Beetz
G. C. Nandi
54
0
0
18 Jun 2025
MCOO-SLAM: A Multi-Camera Omnidirectional Object SLAM System
MCOO-SLAM: A Multi-Camera Omnidirectional Object SLAM System
Miaoxin Pan
Jinnan Li
Yaowen Zhang
Yi Yang
Yufeng Yue
29
0
0
18 Jun 2025
BCRNet: Enhancing Landmark Detection in Laparoscopic Liver Surgery via Bezier Curve Refinement
BCRNet: Enhancing Landmark Detection in Laparoscopic Liver Surgery via Bezier Curve Refinement
Qian Li
Feng Liu
Shuojue Yang
Daiyun Shen
Yueming Jin
MedIm
31
0
0
18 Jun 2025
Efficient Retail Video Annotation: A Robust Key Frame Generation Approach for Product and Customer Interaction Analysis
Efficient Retail Video Annotation: A Robust Key Frame Generation Approach for Product and Customer Interaction Analysis
Varun Mannam
Zhenyu Shi
30
0
0
17 Jun 2025
Brain Imaging Foundation Models, Are We There Yet? A Systematic Review of Foundation Models for Brain Imaging and Biomedical Research
Brain Imaging Foundation Models, Are We There Yet? A Systematic Review of Foundation Models for Brain Imaging and Biomedical Research
Salah Ghamizi
G. Kanli
Yu Deng
Magali Perquin
O. Keunen
MedImAI4CE
46
0
0
16 Jun 2025
Scaling Algorithm Distillation for Continuous Control with Mamba
Scaling Algorithm Distillation for Continuous Control with Mamba
Samuel Beaussant
Mehdi Mounsif
38
0
0
16 Jun 2025
Anomaly Object Segmentation with Vision-Language Models for Steel Scrap Recycling
Anomaly Object Segmentation with Vision-Language Models for Steel Scrap Recycling
Daichi Tanaka
Takumi Karasawa
Shu Takenouchi
Rei Kawakami
33
0
0
16 Jun 2025
A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects
A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects
Guohuan Xie
Syed Ariff Syed Hesham
Wenya Guo
Bing Li
Ming-Ming Cheng
Guolei Sun
Yun-Hai Liu
39
0
0
16 Jun 2025
Unleashing Diffusion and State Space Models for Medical Image Segmentation
Unleashing Diffusion and State Space Models for Medical Image Segmentation
Rong Wu
Ziqi Chen
Liming Zhong
Heng Li
Hai Shu
MedIm
50
0
0
15 Jun 2025
Benchmarking Image Similarity Metrics for Novel View Synthesis Applications
Benchmarking Image Similarity Metrics for Novel View Synthesis Applications
Charith Wickrema
Sara Leary
Shivangi Sarkar
Mark Giglio
Eric Bianchi
Eliza Mace
Michael Twardowski
28
0
0
14 Jun 2025
Prohibited Items Segmentation via Occlusion-aware Bilayer Modeling
Prohibited Items Segmentation via Occlusion-aware Bilayer Modeling
Yunhan Ren
Ruihuang Li
Lingbo Liu
Changwen Chen
32
0
0
13 Jun 2025
OV-MAP : Open-Vocabulary Zero-Shot 3D Instance Segmentation Map for Robots
OV-MAP : Open-Vocabulary Zero-Shot 3D Instance Segmentation Map for Robots
Juno Kim
Yesol Park
Hye Jung Yoon
Byoung-Tak Zhang
78
0
0
13 Jun 2025
SPLATART: Articulated Gaussian Splatting with Estimated Object Structure
SPLATART: Articulated Gaussian Splatting with Estimated Object Structure
Stanley Lewis
Vishal Chandra
Tom Gao
Odest Chadwicke Jenkins
26
0
0
13 Jun 2025
Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection
Semantic-decoupled Spatial Partition Guided Point-supervised Oriented Object Detection
Xinyuan Liu
Hang Xu
Yike Ma
Yucheng Zhang
Feng Dai
116
0
0
12 Jun 2025
Uncertainty-Masked Bernoulli Diffusion for Camouflaged Object Detection Refinement
Uncertainty-Masked Bernoulli Diffusion for Camouflaged Object Detection Refinement
Yuqi Shen
Fengyang Xiao
Sujie Hu
Youwei Pang
Yifan Pu
Chengyu Fang
Xiu Li
Chunming He
112
0
0
12 Jun 2025
1234...262728
Next