ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.18325
  4. Cited By
Towards Training-free Anomaly Detection with Vision and Language Foundation Models

Towards Training-free Anomaly Detection with Vision and Language Foundation Models

24 March 2025
Jinjin Zhang
Guodong Wang
Yizhou Jin
Di Huang
ArXivPDFHTML

Papers citing "Towards Training-free Anomaly Detection with Vision and Language Foundation Models"

9 / 9 papers shown
Title
LAD-Reasoner: Tiny Multimodal Models are Good Reasoners for Logical Anomaly Detection
LAD-Reasoner: Tiny Multimodal Models are Good Reasoners for Logical Anomaly Detection
Weijia Li
Guanglei Chu
Jiong Chen
Guo-Sen Xie
Caifeng Shan
Fang Zhao
LRM
64
1
0
17 Apr 2025
Few Shot Part Segmentation Reveals Compositional Logic for Industrial
  Anomaly Detection
Few Shot Part Segmentation Reveals Compositional Logic for Industrial Anomaly Detection
Soopil Kim
Sion An
Philip Chikontwe
Myeongkyun Kang
Ehsan Adeli
K. Pohl
Sanghyun Park
57
17
0
21 Dec 2023
GPT-4 Technical Report
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAG
MLLM
473
13,788
0
15 Mar 2023
When and why vision-language models behave like bags-of-words, and what
  to do about it?
When and why vision-language models behave like bags-of-words, and what to do about it?
Mert Yuksekgonul
Federico Bianchi
Pratyusha Kalluri
Dan Jurafsky
James Zou
VLM
CoGe
54
378
0
04 Oct 2022
Winoground: Probing Vision and Language Models for Visio-Linguistic
  Compositionality
Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality
Tristan Thrush
Ryan Jiang
Max Bartolo
Amanpreet Singh
Adina Williams
Douwe Kiela
Candace Ross
CoGe
80
413
0
07 Apr 2022
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models
Jason W. Wei
Xuezhi Wang
Dale Schuurmans
Maarten Bosma
Brian Ichter
F. Xia
Ed H. Chi
Quoc Le
Denny Zhou
LM&Ro
LRM
AI4CE
ReLM
570
9,009
0
28 Jan 2022
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
666
28,659
0
26 Feb 2021
PaDiM: a Patch Distribution Modeling Framework for Anomaly Detection and
  Localization
PaDiM: a Patch Distribution Modeling Framework for Anomaly Detection and Localization
Thomas Defard
Aleksandr Setkov
Angélique Loesch
Romaric Audigier
UQCV
57
827
0
17 Nov 2020
An Image is Worth 16x16 Words: Transformers for Image Recognition at
  Scale
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale
Alexey Dosovitskiy
Lucas Beyer
Alexander Kolesnikov
Dirk Weissenborn
Xiaohua Zhai
...
Matthias Minderer
G. Heigold
Sylvain Gelly
Jakob Uszkoreit
N. Houlsby
ViT
317
40,217
0
22 Oct 2020
1