ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1705.00823
  4. Cited By
STAIR Captions: Constructing a Large-Scale Japanese Image Caption
  Dataset

STAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset

2 May 2017
Yuya Yoshikawa
Yutaro Shigeto
A. Takeuchi
    3DV
ArXivPDFHTML

Papers citing "STAIR Captions: Constructing a Large-Scale Japanese Image Caption Dataset"

11 / 61 papers shown
Title
Aligning Multilingual Word Embeddings for Cross-Modal Retrieval Task
Aligning Multilingual Word Embeddings for Cross-Modal Retrieval Task
Alireza Mohammadshahi
R. Lebret
Karl Aberer
20
10
0
08 Oct 2019
Trends in Integration of Vision and Language Research: A Survey of
  Tasks, Datasets, and Methods
Trends in Integration of Vision and Language Research: A Survey of Tasks, Datasets, and Methods
Aditya Mogadala
M. Kalimuthu
Dietrich Klakow
VLM
25
132
0
22 Jul 2019
Unsupervised Bilingual Lexicon Induction from Mono-lingual Multimodal
  Data
Unsupervised Bilingual Lexicon Induction from Mono-lingual Multimodal Data
Shizhe Chen
Qin Jin
Alexander G. Hauptmann
SSL
9
9
0
02 Jun 2019
Models of Visually Grounded Speech Signal Pay Attention To Nouns: a
  Bilingual Experiment on English and Japanese
Models of Visually Grounded Speech Signal Pay Attention To Nouns: a Bilingual Experiment on English and Japanese
William N. Havard
Jean-Pierre Chevrot
Laurent Besacier
23
24
0
08 Feb 2019
How2: A Large-scale Dataset for Multimodal Language Understanding
How2: A Large-scale Dataset for Multimodal Language Understanding
Ramon Sanabria
Ozan Caglayan
Shruti Palaskar
Desmond Elliott
Loïc Barrault
Lucia Specia
Florian Metze
VGen
MLLM
24
286
0
01 Nov 2018
Neural Joking Machine : Humorous image captioning
Neural Joking Machine : Humorous image captioning
Kota Yoshida
Munetaka Minoguchi
Kenichiro Wani
Akio Nakamura
Hirokatsu Kataoka
14
11
0
30 May 2018
COCO-CN for Cross-Lingual Image Tagging, Captioning and Retrieval
COCO-CN for Cross-Lingual Image Tagging, Captioning and Retrieval
Xirong Li
Chaoxi Xu
Xiaoxu Wang
Weiyu Lan
Zhengxiong Jia
Gang Yang
Jieping Xu
22
149
0
22 May 2018
Findings of the Second Shared Task on Multimodal Machine Translation and
  Multilingual Image Description
Findings of the Second Shared Task on Multimodal Machine Translation and Multilingual Image Description
Desmond Elliott
Stella Frank
Loïc Barrault
Fethi Bougares
Lucia Specia
VLM
25
218
0
19 Oct 2017
Emergent Translation in Multi-Agent Communication
Emergent Translation in Multi-Agent Communication
Jason D. Lee
Kyunghyun Cho
Jason Weston
Douwe Kiela
24
68
0
12 Oct 2017
Image Pivoting for Learning Multilingual Multimodal Representations
Image Pivoting for Learning Multilingual Multimodal Representations
Spandana Gella
Rico Sennrich
Frank Keller
Mirella Lapata
SSL
30
78
0
24 Jul 2017
Cross-linguistic differences and similarities in image descriptions
Cross-linguistic differences and similarities in image descriptions
Emiel van Miltenburg
Desmond Elliott
Piek Vossen
VLM
16
33
0
06 Jul 2017
Previous
12