STAIR Captions: Constructing a Large-Scale Japanese Image Caption
Dataset

STAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset

2 May 2017

Papers citing "STAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset"

11 / 61 papers shown

Title
Aligning Multilingual Word Embeddings for Cross-Modal Retrieval Task Alireza Mohammadshahi R. Lebret Karl Aberer 20 10 0 08 Oct 2019
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods Aditya Mogadala M. Kalimuthu Dietrich Klakow VLM 25 132 0 22 Jul 2019
Unsupervised Bilingual Lexicon Induction from Mono-lingual Multimodal Data Shizhe Chen Qin Jin Alexander G. Hauptmann SSL 9 9 0 02 Jun 2019
Models of Visually Grounded Speech Signal Pay Attention To Nouns: a Bilingual Experiment on English and Japanese William N. Havard Jean-Pierre Chevrot Laurent Besacier 23 24 0 08 Feb 2019
How2: A Large-scale Dataset for Multimodal Language Understanding Ramon Sanabria Ozan Caglayan Shruti Palaskar Desmond Elliott Loïc Barrault Lucia Specia Florian Metze VGen MLLM 24 286 0 01 Nov 2018
Neural Joking Machine : Humorous image captioning Kota Yoshida Munetaka Minoguchi Kenichiro Wani Akio Nakamura Hirokatsu Kataoka 14 11 0 30 May 2018
COCO-CN for Cross-Lingual Image Tagging, Captioning and Retrieval Xirong Li Chaoxi Xu Xiaoxu Wang Weiyu Lan Zhengxiong Jia Gang Yang Jieping Xu 22 149 0 22 May 2018
Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description Desmond Elliott Stella Frank Loïc Barrault Fethi Bougares Lucia Specia VLM 25 218 0 19 Oct 2017
Emergent Translation in Multi-Agent Communication Jason D. Lee Kyunghyun Cho Jason Weston Douwe Kiela 24 68 0 12 Oct 2017
Image Pivoting for Learning Multilingual Multimodal Representations Spandana Gella Rico Sennrich Frank Keller Mirella Lapata SSL 30 78 0 24 Jul 2017
Cross-linguistic differences and similarities in image descriptions Emiel van Miltenburg Desmond Elliott Piek Vossen VLM 16 33 0 06 Jul 2017