ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1704.06485
  4. Cited By
Attend to You: Personalized Image Captioning with Context Sequence
  Memory Networks

Attend to You: Personalized Image Captioning with Context Sequence Memory Networks

21 April 2017
C. C. Park
Byeongchang Kim
Gunhee Kim
ArXivPDFHTML

Papers citing "Attend to You: Personalized Image Captioning with Context Sequence Memory Networks"

30 / 30 papers shown
Title
MAMo: Leveraging Memory and Attention for Monocular Video Depth Estimation
MAMo: Leveraging Memory and Attention for Monocular Video Depth Estimation
R. Yasarla
H. Cai
Jisoo Jeong
Y. Shi
Risheek Garrepalli
Fatih Porikli
MDE
71
16
0
17 Jan 2025
PGA: Personalizing Grasping Agents with Single Human-Robot Interaction
PGA: Personalizing Grasping Agents with Single Human-Robot Interaction
Junghyun Kim
Gi-Cheon Kang
Jaein Kim
Seoyun Yang
Minjoon Jung
Byoung-Tak Zhang
36
0
0
19 Oct 2023
ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based
  Polishing
ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing
Zequn Zeng
Hao Zhang
Zhengjue Wang
Ruiying Lu
Dongsheng Wang
Bo Chen
BDL
DiffM
19
33
0
04 Mar 2023
Motion-aware Memory Network for Fast Video Salient Object Detection
Motion-aware Memory Network for Fast Video Salient Object Detection
Xingke Zhao
Haoran Liang
Peipei Li
Guodao Sun
Dongdong Zhao
Ronghua Liang
Xiaofei He
30
11
0
01 Aug 2022
On Distinctive Image Captioning via Comparing and Reweighting
On Distinctive Image Captioning via Comparing and Reweighting
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
38
16
0
08 Apr 2022
Cross Modal Retrieval with Querybank Normalisation
Cross Modal Retrieval with Querybank Normalisation
Simion-Vlad Bogolin
Ioana Croitoru
Hailin Jin
Yang Liu
Samuel Albanie
27
84
0
23 Dec 2021
Consensus Graph Representation Learning for Better Grounded Image
  Captioning
Consensus Graph Representation Learning for Better Grounded Image Captioning
Wenqiao Zhang
Haochen Shi
Siliang Tang
Jun Xiao
Qiang Yu
Yueting Zhuang
15
54
0
02 Dec 2021
Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context
  Images via Online Resources
Open-Domain, Content-based, Multi-modal Fact-checking of Out-of-Context Images via Online Resources
Sahar Abdelnabi
Rakibul Hasan
Mario Fritz
26
74
0
30 Nov 2021
Universal Face Restoration With Memorized Modulation
Universal Face Restoration With Memorized Modulation
Jia Li
Huaibo Huang
Xiaofei Jia
Ran He
CVBM
36
2
0
03 Oct 2021
Personalized Image Semantic Segmentation
Personalized Image Semantic Segmentation
Yu Zhang
Chang-Bin Zhang
Peng-Tao Jiang
Mingg-Ming Cheng
Feng Mao
21
4
0
24 Jul 2021
From Show to Tell: A Survey on Deep Learning-based Image Captioning
From Show to Tell: A Survey on Deep Learning-based Image Captioning
Matteo Stefanini
Marcella Cornia
Lorenzo Baraldi
S. Cascianelli
G. Fiameni
Rita Cucchiara
3DV
VLM
MLLM
67
254
0
14 Jul 2021
Human-like Controllable Image Captioning with Verb-specific Semantic
  Roles
Human-like Controllable Image Captioning with Verb-specific Semantic Roles
Long Chen
Zhihong Jiang
Jun Xiao
Wei Liu
30
74
0
22 Mar 2021
New Ideas and Trends in Deep Multimodal Content Understanding: A Review
New Ideas and Trends in Deep Multimodal Content Understanding: A Review
Wei Chen
Weiping Wang
Li Liu
M. Lew
VLM
118
31
0
16 Oct 2020
Enriching Video Captions With Contextual Text
Enriching Video Captions With Contextual Text
Philipp Rimle
Pelin Dogan
Markus Gross
30
3
0
29 Jul 2020
Compare and Reweight: Distinctive Image Captioning Using Similar Images
  Sets
Compare and Reweight: Distinctive Image Captioning Using Similar Images Sets
Jiuniu Wang
Wenjia Xu
Qingzhong Wang
Antoni B. Chan
37
45
0
14 Jul 2020
Clue: Cross-modal Coherence Modeling for Caption Generation
Clue: Cross-modal Coherence Modeling for Caption Generation
Malihe Alikhani
Piyush Sharma
Shengjie Li
Radu Soricut
Matthew Stone
38
56
0
02 May 2020
Transferring Cross-domain Knowledge for Video Sign Language Recognition
Transferring Cross-domain Knowledge for Video Sign Language Recognition
Dongxu Li
Xin Yu
Chenchen Xu
L. Petersson
Hongdong Li
SLR
33
104
0
08 Mar 2020
Audio-driven Talking Face Video Generation with Learning-based
  Personalized Head Pose
Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose
Ran Yi
Zipeng Ye
Juyong Zhang
Hujun Bao
Yong-jin Liu
CVBM
27
122
0
24 Feb 2020
Gaussian Smoothen Semantic Features (GSSF) -- Exploring the Linguistic
  Aspects of Visual Captioning in Indian Languages (Bengali) Using MSCOCO
  Framework
Gaussian Smoothen Semantic Features (GSSF) -- Exploring the Linguistic Aspects of Visual Captioning in Indian Languages (Bengali) Using MSCOCO Framework
C. Sur
27
7
0
16 Feb 2020
MRRC: Multiple Role Representation Crossover Interpretation for Image
  Captioning With R-CNN Feature Distribution Composition (FDC)
MRRC: Multiple Role Representation Crossover Interpretation for Image Captioning With R-CNN Feature Distribution Composition (FDC)
C. Sur
25
16
0
15 Feb 2020
Unpaired Image-to-Speech Synthesis with Multimodal Information
  Bottleneck
Unpaired Image-to-Speech Synthesis with Multimodal Information Bottleneck
Shuang Ma
Daniel J. McDuff
Yale Song
25
22
0
19 Aug 2019
Coloring With Limited Data: Few-Shot Colorization via Memory-Augmented
  Networks
Coloring With Limited Data: Few-Shot Colorization via Memory-Augmented Networks
Seungjoo Yoo
Hyojin Bahng
Sunghyo Chung
Junsoo Lee
Jaehyuk Chang
Jaegul Choo
VLM
MQ
30
122
0
09 Jun 2019
Reasoning Visual Dialogs with Structural and Partial Observations
Reasoning Visual Dialogs with Structural and Partial Observations
Zilong Zheng
Wenguan Wang
Siyuan Qi
Song-Chun Zhu
39
117
0
11 Apr 2019
Video Object Segmentation using Space-Time Memory Networks
Video Object Segmentation using Space-Time Memory Networks
Seoung Wug Oh
Joon-Young Lee
N. Xu
Seon Joo Kim
VOS
23
699
0
01 Apr 2019
Abstractive Summarization of Reddit Posts with Multi-level Memory
  Networks
Abstractive Summarization of Reddit Posts with Multi-level Memory Networks
Byeongchang Kim
Hyunwoo J. Kim
Gunhee Kim
12
181
0
02 Nov 2018
Engaging Image Captioning Via Personality
Engaging Image Captioning Via Personality
Kurt Shuster
Samuel Humeau
Hexiang Hu
Antoine Bordes
Jason Weston
37
149
0
25 Oct 2018
A Comprehensive Survey of Deep Learning for Image Captioning
A Comprehensive Survey of Deep Learning for Image Captioning
Md Zakir Hossain
Ferdous Sohel
M. Shiratuddin
Hamid Laga
VLM
3DV
45
760
0
06 Oct 2018
Sometimes You Want to Go Where Everybody Knows your Name
Sometimes You Want to Go Where Everybody Knows your Name
Reuben Brasher
Nat Roth
Justin Wagle
25
0
0
30 Jan 2018
A Read-Write Memory Network for Movie Story Understanding
A Read-Write Memory Network for Movie Story Understanding
Seil Na
Sangho Lee
Jisung Kim
Gunhee Kim
AIMat
24
98
0
27 Sep 2017
Self-Guiding Multimodal LSTM - when we do not have a perfect training
  dataset for image captioning
Self-Guiding Multimodal LSTM - when we do not have a perfect training dataset for image captioning
Yang Xian
Yingli Tian
VLM
25
22
0
15 Sep 2017
1