Show, Interpret and Tell: Entity-aware Contextualised Image Captioning
in Wikipedia

Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia

21 September 2022

Ali Furkan Biten

Dimosthenis Karatzas

Papers citing "Show, Interpret and Tell: Entity-aware Contextualised Image Captioning in Wikipedia"

13 / 13 papers shown

Title
Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach Mathilde Caron Alireza Fathi Cordelia Schmid Ahmet Iscen 34 1 0 31 Oct 2024
The Role of Generative Systems in Historical Photography Management: A Case Study on Catalan Archives Èric Śanchez Adrià Molina O. R. Terrades 29 0 0 05 Sep 2024
Surveying the Landscape of Image Captioning Evaluation: A Comprehensive Taxonomy, Trends and Metrics Analysis Uri Berger Gabriel Stanovsky Omri Abend Lea Frermann 32 0 0 09 Aug 2024
Enhancing Journalism with AI: A Study of Contextualized Image Captioning for News Articles using LLMs and LMMs Aliki Anagnostopoulou Thiago S. Gouvêa Daniel Sonntag 37 1 0 08 Aug 2024
Controllable Contextualized Image Captioning: Directing the Visual Narrative through User-Defined Highlights Shunqi Mao Chaoyi Zhang Hang Su Hwanjun Song Igor Shalyminov Weidong Cai 30 1 0 16 Jul 2024
MIKE: A New Benchmark for Fine-grained Multimodal Entity Knowledge Editing Jiaqi Li Miaozeng Du Chuanyi Zhang Yongrui Chen Nan Hu Guilin Qi Haiyun Jiang Siyuan Cheng Bo Tian 20 14 0 18 Feb 2024
WikiWeb2M: A Page-Level Multimodal Wikipedia Dataset Andrea Burns Krishna Srinivasan Joshua Ainslie Geoff Brown Bryan A. Plummer Kate Saenko Jianmo Ni Mandy Guo VLM 29 4 0 09 May 2023
A Suite of Generative Tasks for Multi-Level Multimodal Webpage Understanding Andrea Burns Krishna Srinivasan Joshua Ainslie Geoff Brown Bryan A. Plummer Kate Saenko Jianmo Ni Mandy Guo 3DV 42 11 0 05 May 2023
WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning Krishna Srinivasan K. Raman Jiecao Chen Michael Bendersky Marc Najork VLM 208 310 0 02 Mar 2021
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network Jiayi Ji Yunpeng Luo Xiaoshuai Sun Fuhai Chen Gen Luo Yongjian Wu Yue Gao Rongrong Ji ViT 43 170 0 13 Dec 2020
OpenNMT: Open-Source Toolkit for Neural Machine Translation Guillaume Klein Yoon Kim Yuntian Deng Jean Senellart Alexander M. Rush 259 1,896 0 10 Jan 2017
Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning Jiasen Lu Caiming Xiong Devi Parikh R. Socher 85 1,442 0 06 Dec 2016
Google's Neural Machine Translation System: Bridging the Gap between Human and Machine Translation Yonghui Wu M. Schuster Z. Chen Quoc V. Le Mohammad Norouzi ... Alex Rudnick Oriol Vinyals G. Corrado Macduff Hughes J. Dean AIMat 716 6,743 0 26 Sep 2016