ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.06880
  4. Cited By
Poet: Product-oriented Video Captioner for E-commerce

Poet: Product-oriented Video Captioner for E-commerce

16 August 2020
Shengyu Zhang
Ziqi Tan
Jin Yu
Zhou Zhao
Kun Kuang
Jie Liu
Jingren Zhou
Hongxia Yang
Fei Wu
ArXivPDFHTML

Papers citing "Poet: Product-oriented Video Captioner for E-commerce"

32 / 32 papers shown
Title
Knowledge-enhanced Multi-perspective Video Representation Learning for Scene Recognition
Knowledge-enhanced Multi-perspective Video Representation Learning for Scene Recognition
Xuzheng Yu
Chen Jiang
Wei Zhang
Tian Gan
Linlin Chao
Jianan Zhao
Yuan Cheng
Qingpei Guo
Wei Chu
51
0
0
09 Jan 2024
Graph Convolutional Networks for Temporal Action Localization
Graph Convolutional Networks for Temporal Action Localization
Runhao Zeng
Wenbing Huang
Mingkui Tan
Yu Rong
P. Zhao
Junzhou Huang
Chuang Gan
GNN
72
476
0
07 Sep 2019
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for
  Vision-and-Language Tasks
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Jiasen Lu
Dhruv Batra
Devi Parikh
Stefan Lee
SSL
VLM
160
3,659
0
06 Aug 2019
Watch It Twice: Video Captioning with a Refocused Video Encoder
Watch It Twice: Video Captioning with a Refocused Video Encoder
Xiangxi Shi
Jianfei Cai
Shafiq Joty
Jiuxiang Gu
26
28
0
21 Jul 2019
Object-aware Aggregation with Bidirectional Temporal Graph for Video
  Captioning
Object-aware Aggregation with Bidirectional Temporal Graph for Video Captioning
Junchao Zhang
Yuxin Peng
42
170
0
11 Jun 2019
VATEX: A Large-Scale, High-Quality Multilingual Dataset for
  Video-and-Language Research
VATEX: A Large-Scale, High-Quality Multilingual Dataset for Video-and-Language Research
Xin Eric Wang
Jiawei Wu
Junkun Chen
Lei Li
Yuan-fang Wang
William Yang Wang
49
544
0
06 Apr 2019
Towards Knowledge-Based Personalized Product Description Generation in
  E-commerce
Towards Knowledge-Based Personalized Product Description Generation in E-commerce
Qibin Chen
Junyang Lin
Yichang Zhang
Hongxia Yang
Jingren Zhou
Jie Tang
33
97
0
29 Mar 2019
LiveBot: Generating Live Video Comments Based on Visual and Textual
  Contexts
LiveBot: Generating Live Video Comments Based on Visual and Textual Contexts
Shuming Ma
Lei Cui
Damai Dai
Furu Wei
Xu Sun
VGen
31
62
0
13 Sep 2018
Videos as Space-Time Region Graphs
Videos as Space-Time Region Graphs
Xinyu Wang
Abhinav Gupta
52
753
0
05 Jun 2018
No Metrics Are Perfect: Adversarial Reward Learning for Visual
  Storytelling
No Metrics Are Perfect: Adversarial Reward Learning for Visual Storytelling
Xin Eric Wang
Wenhu Chen
Yuan-fang Wang
William Yang Wang
32
158
0
24 Apr 2018
Reconstruction Network for Video Captioning
Reconstruction Network for Video Captioning
Bairui Wang
Lin Ma
Wei Zhang
Wen Liu
106
317
0
30 Mar 2018
Localizing Moments in Video with Natural Language
Localizing Moments in Video with Natural Language
Lisa Anne Hendricks
Oliver Wang
Eli Shechtman
Josef Sivic
Trevor Darrell
Bryan C. Russell
78
933
0
04 Aug 2017
Convolutional Sequence to Sequence Learning
Convolutional Sequence to Sequence Learning
Jonas Gehring
Michael Auli
David Grangier
Denis Yarats
Yann N. Dauphin
AIMat
111
3,279
0
08 May 2017
Dense-Captioning Events in Videos
Dense-Captioning Events in Videos
Ranjay Krishna
Kenji Hata
F. Ren
Li Fei-Fei
Juan Carlos Niebles
107
1,225
0
02 May 2017
Towards Automatic Learning of Procedures from Web Instructional Videos
Towards Automatic Learning of Procedures from Web Instructional Videos
Luowei Zhou
Chenliang Xu
Jason J. Corso
EgoV
45
812
0
28 Mar 2017
Adversarial Learning for Neural Dialogue Generation
Adversarial Learning for Neural Dialogue Generation
Jiwei Li
Will Monroe
Tianlin Shi
Sébastien Jean
Alan Ritter
Dan Jurafsky
33
898
0
23 Jan 2017
ConceptNet 5.5: An Open Multilingual Graph of General Knowledge
ConceptNet 5.5: An Open Multilingual Graph of General Knowledge
R. Speer
Joshua Chin
Catherine Havasi
101
2,874
0
12 Dec 2016
Semi-Supervised Classification with Graph Convolutional Networks
Semi-Supervised Classification with Graph Convolutional Networks
Thomas Kipf
Max Welling
GNN
SSL
310
28,795
0
09 Sep 2016
Title Generation for User Generated Videos
Title Generation for User Generated Videos
Kuo-Hao Zeng
Tseng-Hung Chen
Juan Carlos Niebles
Min Sun
46
69
0
25 Aug 2016
Fashion Landmark Detection in the Wild
Fashion Landmark Detection in the Wild
Ziwei Liu
Sijie Yan
Ping Luo
Xiaogang Wang
Xiaoou Tang
36
171
0
10 Aug 2016
Hollywood in Homes: Crowdsourcing Data Collection for Activity
  Understanding
Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding
Gunnar Sigurdsson
Gül Varol
Xinyu Wang
Ali Farhadi
Ivan Laptev
Abhinav Gupta
VGen
54
1,232
0
06 Apr 2016
Improving LSTM-based Video Description with Linguistic Knowledge Mined
  from Text
Improving LSTM-based Video Description with Linguistic Knowledge Mined from Text
Subhashini Venugopalan
Lisa Anne Hendricks
Raymond J. Mooney
Kate Saenko
VLM
35
117
0
06 Apr 2016
Hierarchical Recurrent Neural Encoder for Video Representation with
  Application to Captioning
Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning
Pingbo Pan
Zhongwen Xu
Yi Yang
Fei Wu
Yueting Zhuang
33
385
0
11 Nov 2015
Sequence to Sequence -- Video to Text
Sequence to Sequence -- Video to Text
Subhashini Venugopalan
Marcus Rohrbach
Jeff Donahue
Raymond J. Mooney
Trevor Darrell
Kate Saenko
76
1,417
0
03 May 2015
Using Descriptive Video Services to Create a Large Data Source for Video
  Annotation Research
Using Descriptive Video Services to Create a Large Data Source for Video Annotation Research
Atousa Torabi
C. Pal
Hugo Larochelle
Aaron Courville
VGen
51
205
0
03 Mar 2015
Describing Videos by Exploiting Temporal Structure
Describing Videos by Exploiting Temporal Structure
L. Yao
Atousa Torabi
Kyunghyun Cho
Nicolas Ballas
C. Pal
Hugo Larochelle
Aaron Courville
105
1,063
0
27 Feb 2015
A Dataset for Movie Description
A Dataset for Movie Description
Anna Rohrbach
Marcus Rohrbach
Niket Tandon
Bernt Schiele
VGen
66
499
0
12 Jan 2015
Translating Videos to Natural Language Using Deep Recurrent Neural
  Networks
Translating Videos to Natural Language Using Deep Recurrent Neural Networks
Subhashini Venugopalan
Huijuan Xu
Jeff Donahue
Marcus Rohrbach
Raymond J. Mooney
Kate Saenko
74
951
0
15 Dec 2014
CIDEr: Consensus-based Image Description Evaluation
CIDEr: Consensus-based Image Description Evaluation
Ramakrishna Vedantam
C. L. Zitnick
Devi Parikh
173
4,451
0
20 Nov 2014
Memory Networks
Memory Networks
Jason Weston
S. Chopra
Antoine Bordes
GNN
KELM
101
1,702
0
15 Oct 2014
On the Properties of Neural Machine Translation: Encoder-Decoder
  Approaches
On the Properties of Neural Machine Translation: Encoder-Decoder Approaches
Kyunghyun Cho
B. V. Merrienboer
Dzmitry Bahdanau
Yoshua Bengio
AI4CE
AIMat
82
6,748
0
03 Sep 2014
Coherent Multi-Sentence Video Description with Variable Level of Detail
Coherent Multi-Sentence Video Description with Variable Level of Detail
Anna Rohrbach
Marcus Rohrbach
Weijian Qiu
Annemarie Friedrich
Sikandar Amin
Mykhaylo Andriluka
Manfred Pinkal
Bernt Schiele
40
217
0
24 Mar 2014
1