ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.13847
  4. Cited By
Disentangling Fine-Tuning from Pre-Training in Visual Captioning with Hybrid Markov Logic

Disentangling Fine-Tuning from Pre-Training in Visual Captioning with Hybrid Markov Logic

18 March 2025
Monika Shah
Somdeb Sarkhel
Deepak Venugopal
    MLLM
    BDL
    VLM
ArXivPDFHTML

Papers citing "Disentangling Fine-Tuning from Pre-Training in Visual Captioning with Hybrid Markov Logic"

21 / 21 papers shown
Title
Knowledge Acquisition Disentanglement for Knowledge-based Visual
  Question Answering with Large Language Models
Knowledge Acquisition Disentanglement for Knowledge-based Visual Question Answering with Large Language Models
Wenbin An
Feng Tian
Jiahao Nie
Wenkai Shi
Haonan Lin
Yan Chen
Qianying Wang
Y. Wu
Guang Dai
Ping Chen
VLM
79
4
0
22 Jul 2024
On the verification of Embeddings using Hybrid Markov Logic
On the verification of Embeddings using Hybrid Markov Logic
Anup Shakya
Abisha Thapa Magar
Somdeb Sarkhel
Deepak Venugopal
29
2
0
13 Dec 2023
CLAIR: Evaluating Image Captions with Large Language Models
CLAIR: Evaluating Image Captions with Large Language Models
David M. Chan
Suzanne Petryk
Joseph E. Gonzalez
Trevor Darrell
John F. Canny
59
20
0
19 Oct 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image
  Encoders and Large Language Models
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
401
4,508
0
30 Jan 2023
A Baseline for Detecting Out-of-Distribution Examples in Image
  Captioning
A Baseline for Detecting Out-of-Distribution Examples in Image Captioning
Gabi Shalev
Gal-Lev Shalev
Joseph Keshet
OODD
52
7
0
12 Jul 2022
OPT: Open Pre-trained Transformer Language Models
OPT: Open Pre-trained Transformer Language Models
Susan Zhang
Stephen Roller
Naman Goyal
Mikel Artetxe
Moya Chen
...
Daniel Simig
Punit Singh Koura
Anjali Sridhar
Tianlu Wang
Luke Zettlemoyer
VLM
OSLM
AI4CE
292
3,634
0
02 May 2022
CLIPScore: A Reference-free Evaluation Metric for Image Captioning
CLIPScore: A Reference-free Evaluation Metric for Image Captioning
Jack Hessel
Ari Holtzman
Maxwell Forbes
Ronan Le Bras
Yejin Choi
CLIP
117
1,545
0
18 Apr 2021
Learning Transferable Visual Models From Natural Language Supervision
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
808
29,167
0
26 Feb 2021
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize
  Long-Tail Visual Concepts
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts
Soravit Changpinyo
P. Sharma
Nan Ding
Radu Soricut
VLM
426
1,120
0
17 Feb 2021
X-Linear Attention Networks for Image Captioning
X-Linear Attention Networks for Image Captioning
Yingwei Pan
Ting Yao
Yehao Li
Tao Mei
92
510
0
31 Mar 2020
Meshed-Memory Transformer for Image Captioning
Meshed-Memory Transformer for Image Captioning
Marcella Cornia
Matteo Stefanini
Lorenzo Baraldi
Rita Cucchiara
59
874
0
17 Dec 2019
Attention on Attention for Image Captioning
Attention on Attention for Image Captioning
Lun Huang
Wenmin Wang
Jie Chen
Xiao-Yong Wei
56
829
0
19 Aug 2019
VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions
VIFIDEL: Evaluating the Visual Fidelity of Image Descriptions
Pranava Madhyastha
Josiah Wang
Lucia Specia
32
32
0
22 Jul 2019
Bias Correction of Learned Generative Models using Likelihood-Free
  Importance Weighting
Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting
Aditya Grover
Jiaming Song
Alekh Agarwal
Kenneth Tran
Ashish Kapoor
Eric Horvitz
Stefano Ermon
51
124
0
23 Jun 2019
Auto-Encoding Scene Graphs for Image Captioning
Auto-Encoding Scene Graphs for Image Captioning
Xu Yang
Kaihua Tang
Hanwang Zhang
Jianfei Cai
140
698
0
06 Dec 2018
Explanation in Artificial Intelligence: Insights from the Social
  Sciences
Explanation in Artificial Intelligence: Insights from the Social Sciences
Tim Miller
XAI
236
4,249
0
22 Jun 2017
SPICE: Semantic Propositional Image Caption Evaluation
SPICE: Semantic Propositional Image Caption Evaluation
Peter Anderson
Basura Fernando
Mark Johnson
Stephen Gould
EGVM
84
1,909
0
29 Jul 2016
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense
  Image Annotations
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
Ranjay Krishna
Yuke Zhu
Oliver Groth
Justin Johnson
Kenji Hata
...
Yannis Kalantidis
Li Li
David A. Shamma
Michael S. Bernstein
Fei-Fei Li
194
5,726
0
23 Feb 2016
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal
  Networks
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Shaoqing Ren
Kaiming He
Ross B. Girshick
Jian Sun
AIMat
ObjD
461
62,122
0
04 Jun 2015
Hinge-Loss Markov Random Fields and Probabilistic Soft Logic
Hinge-Loss Markov Random Fields and Probabilistic Soft Logic
Stephen H. Bach
Matthias Broecheler
Bert Huang
Lise Getoor
TPM
AI4CE
89
386
0
17 May 2015
Semi-Supervised Learning with Deep Generative Models
Semi-Supervised Learning with Deep Generative Models
Diederik P. Kingma
Danilo Jimenez Rezende
S. Mohamed
Max Welling
GAN
SSL
BDL
83
2,738
0
20 Jun 2014
1