Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2008.06880
Cited By
Poet: Product-oriented Video Captioner for E-commerce
16 August 2020
Shengyu Zhang
Ziqi Tan
Jin Yu
Zhou Zhao
Kun Kuang
Jie Liu
Jingren Zhou
Hongxia Yang
Fei Wu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Poet: Product-oriented Video Captioner for E-commerce"
32 / 32 papers shown
Title
Knowledge-enhanced Multi-perspective Video Representation Learning for Scene Recognition
Xuzheng Yu
Chen Jiang
Wei Zhang
Tian Gan
Linlin Chao
Jianan Zhao
Yuan Cheng
Qingpei Guo
Wei Chu
51
0
0
09 Jan 2024
Graph Convolutional Networks for Temporal Action Localization
Runhao Zeng
Wenbing Huang
Mingkui Tan
Yu Rong
P. Zhao
Junzhou Huang
Chuang Gan
GNN
72
476
0
07 Sep 2019
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Jiasen Lu
Dhruv Batra
Devi Parikh
Stefan Lee
SSL
VLM
160
3,659
0
06 Aug 2019
Watch It Twice: Video Captioning with a Refocused Video Encoder
Xiangxi Shi
Jianfei Cai
Shafiq Joty
Jiuxiang Gu
26
28
0
21 Jul 2019
Object-aware Aggregation with Bidirectional Temporal Graph for Video Captioning
Junchao Zhang
Yuxin Peng
42
170
0
11 Jun 2019
VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
Xin Eric Wang
Jiawei Wu
Junkun Chen
Lei Li
Yuan-fang Wang
William Yang Wang
49
544
0
06 Apr 2019
Towards Knowledge-Based Personalized Product Description Generation in E-commerce
Qibin Chen
Junyang Lin
Yichang Zhang
Hongxia Yang
Jingren Zhou
Jie Tang
33
97
0
29 Mar 2019
LiveBot: Generating Live Video Comments Based on Visual and Textual Contexts
Shuming Ma
Lei Cui
Damai Dai
Furu Wei
Xu Sun
VGen
31
62
0
13 Sep 2018
Videos as Space-Time Region Graphs
Xinyu Wang
Abhinav Gupta
52
753
0
05 Jun 2018
No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling
Xin Eric Wang
Wenhu Chen
Yuan-fang Wang
William Yang Wang
32
158
0
24 Apr 2018
Reconstruction Network for Video Captioning
Bairui Wang
Lin Ma
Wei Zhang
Wen Liu
106
317
0
30 Mar 2018
Localizing Moments in Video with Natural Language
Lisa Anne Hendricks
Oliver Wang
Eli Shechtman
Josef Sivic
Trevor Darrell
Bryan C. Russell
78
933
0
04 Aug 2017
Convolutional Sequence to Sequence Learning
Jonas Gehring
Michael Auli
David Grangier
Denis Yarats
Yann N. Dauphin
AIMat
111
3,279
0
08 May 2017
Dense-Captioning Events in Videos
Ranjay Krishna
Kenji Hata
F. Ren
Li Fei-Fei
Juan Carlos Niebles
107
1,225
0
02 May 2017
Towards Automatic Learning of Procedures from Web Instructional Videos
Luowei Zhou
Chenliang Xu
Jason J. Corso
EgoV
45
812
0
28 Mar 2017
Adversarial Learning for Neural Dialogue Generation
Jiwei Li
Will Monroe
Tianlin Shi
Sébastien Jean
Alan Ritter
Dan Jurafsky
33
898
0
23 Jan 2017
ConceptNet 5.5: An Open Multilingual Graph of General Knowledge
R. Speer
Joshua Chin
Catherine Havasi
101
2,874
0
12 Dec 2016
Semi-Supervised Classification with Graph Convolutional Networks
Thomas Kipf
Max Welling
GNN
SSL
310
28,795
0
09 Sep 2016
Title Generation for User Generated Videos
Kuo-Hao Zeng
Tseng-Hung Chen
Juan Carlos Niebles
Min Sun
46
69
0
25 Aug 2016
Fashion Landmark Detection in the Wild
Ziwei Liu
Sijie Yan
Ping Luo
Xiaogang Wang
Xiaoou Tang
36
171
0
10 Aug 2016
Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding
Gunnar Sigurdsson
Gül Varol
Xinyu Wang
Ali Farhadi
Ivan Laptev
Abhinav Gupta
VGen
54
1,232
0
06 Apr 2016
Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text
Subhashini Venugopalan
Lisa Anne Hendricks
Raymond J. Mooney
Kate Saenko
VLM
35
117
0
06 Apr 2016
Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning
Pingbo Pan
Zhongwen Xu
Yi Yang
Fei Wu
Yueting Zhuang
33
385
0
11 Nov 2015
Sequence to Sequence -- Video to Text
Subhashini Venugopalan
Marcus Rohrbach
Jeff Donahue
Raymond J. Mooney
Trevor Darrell
Kate Saenko
76
1,417
0
03 May 2015
Using Descriptive Video Services to Create a Large Data Source for Video Annotation Research
Atousa Torabi
C. Pal
Hugo Larochelle
Aaron Courville
VGen
51
205
0
03 Mar 2015
Describing Videos by Exploiting Temporal Structure
L. Yao
Atousa Torabi
Kyunghyun Cho
Nicolas Ballas
C. Pal
Hugo Larochelle
Aaron Courville
105
1,063
0
27 Feb 2015
A Dataset for Movie Description
Anna Rohrbach
Marcus Rohrbach
Niket Tandon
Bernt Schiele
VGen
66
499
0
12 Jan 2015
Translating Videos to Natural Language Using Deep Recurrent Neural Networks
Subhashini Venugopalan
Huijuan Xu
Jeff Donahue
Marcus Rohrbach
Raymond J. Mooney
Kate Saenko
74
951
0
15 Dec 2014
CIDEr: Consensus-based Image Description Evaluation
Ramakrishna Vedantam
C. L. Zitnick
Devi Parikh
173
4,451
0
20 Nov 2014
Memory Networks
Jason Weston
S. Chopra
Antoine Bordes
GNN
KELM
101
1,702
0
15 Oct 2014
On the Properties of Neural Machine Translation: Encoder-Decoder Approaches
Kyunghyun Cho
B. V. Merrienboer
Dzmitry Bahdanau
Yoshua Bengio
AI4CE
AIMat
82
6,748
0
03 Sep 2014
Coherent Multi-Sentence Video Description with Variable Level of Detail
Anna Rohrbach
Marcus Rohrbach
Weijian Qiu
Annemarie Friedrich
Sikandar Amin
Mykhaylo Andriluka
Manfred Pinkal
Bernt Schiele
40
217
0
24 Mar 2014
1