ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.03708
  4. Cited By
Context-Aware Group Captioning via Self-Attention and Contrastive
  Features

Context-Aware Group Captioning via Self-Attention and Contrastive Features

Computer Vision and Pattern Recognition (CVPR), 2020
7 April 2020
Zhuowan Li
Quan Hung Tran
Long Mai
Zhe Lin
Alan Yuille
    VLM
ArXiv (abs)PDFHTML

Papers citing "Context-Aware Group Captioning via Self-Attention and Contrastive Features"

21 / 21 papers shown
Group-based Distinctive Image Captioning with Memory Difference Encoding and Attention
Group-based Distinctive Image Captioning with Memory Difference Encoding and AttentionInternational Journal of Computer Vision (IJCV), 2024
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
445
2
0
03 Apr 2025
ImageSet2Text: Describing Sets of Images through Text
ImageSet2Text: Describing Sets of Images through Text
Piera Riccio
F. Galati
Kajetan Schweighofer
Noa Garcia
Nuria Oliver
VLMCoGe
527
1
0
25 Mar 2025
Maybe you are looking for CroQS: Cross-modal Query Suggestion for
  Text-to-Image Retrieval
Maybe you are looking for CroQS: Cross-modal Query Suggestion for Text-to-Image Retrieval
Giacomo Pacini
F. Carrara
Nicola Messina
Nicola Tonellotto
Giuseppe Amato
Fabrizio Falchi
282
1
0
18 Dec 2024
Semantic Alignment for Multimodal Large Language Models
Semantic Alignment for Multimodal Large Language ModelsACM Multimedia (MM), 2024
Tao Wu
Mengze Li
Jingyuan Chen
Wei Ji
Wang Lin
Jinyang Gao
Kun Kuang
Zhou Zhao
Fei Wu
275
20
0
23 Aug 2024
Towards More Faithful Natural Language Explanation Using Multi-Level
  Contrastive Learning in VQA
Towards More Faithful Natural Language Explanation Using Multi-Level Contrastive Learning in VQA
Chengen Lai
Shengli Song
Shiqi Meng
Jingyang Li
Sitong Yan
Guangneng Hu
235
11
0
21 Dec 2023
Visual Commonsense based Heterogeneous Graph Contrastive Learning
Visual Commonsense based Heterogeneous Graph Contrastive Learning
Zongzhao Li
Xiangyu Zhu
Xi Zhang
Zhaoxiang Zhang
Zhen Lei
218
1
0
11 Nov 2023
COMICS: End-to-end Bi-grained Contrastive Learning for Multi-face
  Forgery Detection
COMICS: End-to-end Bi-grained Contrastive Learning for Multi-face Forgery Detection
Cong Zhang
H. Qi
Shuhui Wang
Yuezun Li
Siwei Lyu
CVBM
242
14
0
03 Aug 2023
Improving Reference-based Distinctive Image Captioning with Contrastive
  Rewards
Improving Reference-based Distinctive Image Captioning with Contrastive Rewards
Yangjun Mao
Jun Xiao
Dong Zhang
Meng Cao
Jian Shao
Yueting Zhuang
Long Chen
EGVM
217
10
0
25 Jun 2023
Neighborhood Contrastive Transformer for Change Captioning
Neighborhood Contrastive Transformer for Change CaptioningIEEE transactions on multimedia (IEEE TMM), 2023
Yunbin Tu
Liang Li
Li Su
Kelvin Lu
Qin Huang
ViT
195
29
0
06 Mar 2023
Transform, Contrast and Tell: Coherent Entity-Aware Multi-Image
  Captioning
Transform, Contrast and Tell: Coherent Entity-Aware Multi-Image CaptioningComputer Vision and Image Understanding (CVIU), 2023
Jingqiang Chen
308
7
0
04 Feb 2023
Rethinking the Reference-based Distinctive Image Captioning
Rethinking the Reference-based Distinctive Image CaptioningACM Multimedia (ACM MM), 2022
Yangjun Mao
Long Chen
Zhihong Jiang
Dong Zhang
Zhimeng Zhang
Jian Shao
Jun Xiao
DiffM
259
23
0
22 Jul 2022
On Distinctive Image Captioning via Comparing and Reweighting
On Distinctive Image Captioning via Comparing and ReweightingIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
197
23
0
08 Apr 2022
Spot the Difference: A Cooperative Object-Referring Game in
  Non-Perfectly Co-Observable Scene
Spot the Difference: A Cooperative Object-Referring Game in Non-Perfectly Co-Observable Scene
Duo Zheng
Fandong Meng
Q. Si
Hairun Fan
Zipeng Xu
Jie Zhou
Fangxiang Feng
Xiaojie Wang
197
0
0
16 Mar 2022
Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real
  Images
Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images
Zhuowan Li
Elias Stengel-Eskin
Yixiao Zhang
Cihang Xie
Q. Tran
Benjamin Van Durme
Alan Yuille
VLM
182
19
0
01 Oct 2021
Group-based Distinctive Image Captioning with Memory Attention
Group-based Distinctive Image Captioning with Memory Attention
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
288
19
0
20 Aug 2021
Tackling the Challenges in Scene Graph Generation with Local-to-Global
  Interactions
Tackling the Challenges in Scene Graph Generation with Local-to-Global Interactions
Sangmin Woo
Junhyug Noh
Kangil Kim
439
21
0
16 Jun 2021
Domain Adaptation for Semantic Segmentation via Patch-Wise Contrastive
  Learning
Domain Adaptation for Semantic Segmentation via Patch-Wise Contrastive Learning
Weizhe Liu
David Ferstl
S. Schulter
L. Zebedin
Pascal Fua
C. Leistner
283
41
0
22 Apr 2021
Quantifying Learnability and Describability of Visual Concepts Emerging
  in Representation Learning
Quantifying Learnability and Describability of Visual Concepts Emerging in Representation LearningNeural Information Processing Systems (NeurIPS), 2020
Iro Laina
Ruth C. Fong
Andrea Vedaldi
OCL
177
14
0
27 Oct 2020
Multimodal Research in Vision and Language: A Review of Current and
  Emerging Trends
Multimodal Research in Vision and Language: A Review of Current and Emerging Trends
Shagun Uppal
Sarthak Bhagat
Devamanyu Hazarika
Navonil Majumdar
Soujanya Poria
Roger Zimmermann
Amir Zadeh
324
6
0
19 Oct 2020
Neural Architecture Search for Lightweight Non-Local Networks
Neural Architecture Search for Lightweight Non-Local NetworksComputer Vision and Pattern Recognition (CVPR), 2020
Yingwei Li
Xiaojie Jin
Jieru Mei
Xiaochen Lian
Linjie Yang
Cihang Xie
Qihang Yu
Yuyin Zhou
S. Bai
Alan Yuille
177
55
0
04 Apr 2020
Deconfounded Image Captioning: A Causal Retrospect
Deconfounded Image Captioning: A Causal RetrospectIEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2020
Xu Yang
Hanwang Zhang
Jianfei Cai
CML
236
152
0
09 Mar 2020
1
Page 1 of 1