ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1809.00681
  4. Cited By
Diverse and Coherent Paragraph Generation from Images

Diverse and Coherent Paragraph Generation from Images

3 September 2018
Moitreya Chatterjee
Alex Schwing
ArXivPDFHTML

Papers citing "Diverse and Coherent Paragraph Generation from Images"

15 / 15 papers shown
Title
IC3: Image Captioning by Committee Consensus
IC3: Image Captioning by Committee Consensus
David M. Chan
Austin Myers
Sudheendra Vijayanarasimhan
David A. Ross
John F. Canny
34
17
0
02 Feb 2023
Bypass Network for Semantics Driven Image Paragraph Captioning
Bypass Network for Semantics Driven Image Paragraph Captioning
Qinjie Zheng
Chaoyue Wang
Dadong Wang
32
1
0
21 Jun 2022
Language Models Can See: Plugging Visual Controls in Text Generation
Language Models Can See: Plugging Visual Controls in Text Generation
Yixuan Su
Tian Lan
Yahui Liu
Fangyu Liu
Dani Yogatama
Yan Wang
Lingpeng Kong
Nigel Collier
VLM
MLLM
62
97
0
05 May 2022
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic
  Arithmetic
ZeroCap: Zero-Shot Image-to-Text Generation for Visual-Semantic Arithmetic
Yoad Tewel
Yoav Shalev
Idan Schwartz
Lior Wolf
VLM
36
193
0
29 Nov 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
From Show to Tell: A Survey on Deep Learning-based Image Captioning
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DV
VLM
MLLM
69
256
0
14 Jul 2021
Matching Visual Features to Hierarchical Semantic Topics for Image
  Paragraph Captioning
Matching Visual Features to Hierarchical Semantic Topics for Image Paragraph Captioning
D. Guo
Ruiying Lu
Bo Chen
Zequn Zeng
Mingyuan Zhou
VLM
30
9
0
10 May 2021
Knowledge driven Description Synthesis for Floor Plan Interpretation
Knowledge driven Description Synthesis for Floor Plan Interpretation
Shreya Goyal
Chiranjoy Chattopadhyay
Gaurav Bhatnagar
3DV
23
13
0
15 Mar 2021
Removing Bias in Multi-modal Classifiers: Regularization by Maximizing
  Functional Entropies
Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional Entropies
Itai Gat
Idan Schwartz
Alex Schwing
Tamir Hazan
63
90
0
21 Oct 2020
Sparse Graph to Sequence Learning for Vision Conditioned Long Textual
  Sequence Generation
Sparse Graph to Sequence Learning for Vision Conditioned Long Textual Sequence Generation
Aditya Mogadala
Marius Mosbach
Dietrich Klakow
VLM
196
0
0
12 Jul 2020
Sequential Latent Spaces for Modeling the Intention During Diverse Image
  Captioning
Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning
J. Aneja
Harsh Agrawal
Dhruv Batra
Alex Schwing
BDL
VLM
26
66
0
22 Aug 2019
Encoder-Agnostic Adaptation for Conditional Language Generation
Encoder-Agnostic Adaptation for Conditional Language Generation
Zachary M. Ziegler
Luke Melas-Kyriazi
Sebastian Gehrmann
Alexander M. Rush
AI4CE
16
57
0
19 Aug 2019
Convolutional Auto-encoding of Sentence Topics for Image Paragraph
  Generation
Convolutional Auto-encoding of Sentence Topics for Image Paragraph Generation
Jing Wang
Yingwei Pan
Ting Yao
Jinhui Tang
Tao Mei
VLM
BDL
DiffM
27
36
0
01 Aug 2019
Context-Aware Visual Policy Network for Fine-Grained Image Captioning
Context-Aware Visual Policy Network for Fine-Grained Image Captioning
Zhengjun Zha
Daqing Liu
Hanwang Zhang
Yongdong Zhang
Feng Wu
25
120
0
06 Jun 2019
Factor Graph Attention
Factor Graph Attention
Idan Schwartz
Seunghak Yu
Tamir Hazan
Alex Schwing
30
110
0
11 Apr 2019
A Simple Baseline for Audio-Visual Scene-Aware Dialog
A Simple Baseline for Audio-Visual Scene-Aware Dialog
Idan Schwartz
Alex Schwing
Tamir Hazan
27
69
0
11 Apr 2019
1