ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2208.12306
  4. Cited By
Multimedia Generative Script Learning for Task Planning

Multimedia Generative Script Learning for Task Planning

25 August 2022
Qingyun Wang
Manling Li
Hou Pong Chan
Lifu Huang
J. Hockenmaier
Girish Chowdhary
Heng Ji
    VGen
ArXivPDFHTML

Papers citing "Multimedia Generative Script Learning for Task Planning"

10 / 10 papers shown
Title
Long-horizon Visual Instruction Generation with Logic and Attribute Self-reflection
Long-horizon Visual Instruction Generation with Logic and Attribute Self-reflection
Yucheng Suo
Fan Ma
Kaixin Shen
Linchao Zhu
Yi Yang
VLM
52
0
0
12 Mar 2025
Will GPT-4 Run DOOM?
Will GPT-4 Run DOOM?
Adrian de Wynter
LM&Ro
MLLM
43
5
0
08 Mar 2024
Chem-FINESE: Validating Fine-Grained Few-shot Entity Extraction through
  Text Reconstruction
Chem-FINESE: Validating Fine-Grained Few-shot Entity Extraction through Text Reconstruction
Qingyun Wang
Zixuan Zhang
Hongxiang Li
Xuan Liu
Jiawei Han
Huimin Zhao
Heng Ji
49
1
0
18 Jan 2024
Non-Sequential Graph Script Induction via Multimedia Grounding
Non-Sequential Graph Script Induction via Multimedia Grounding
Yu Zhou
Sha Li
Manling Li
Xudong Lin
Shih-Fu Chang
Joey Tianyi Zhou
Heng Ji
30
8
0
27 May 2023
Procedure-Aware Pretraining for Instructional Video Understanding
Procedure-Aware Pretraining for Instructional Video Understanding
Honglu Zhou
Roberto Martín-Martín
Mubbasir Kapadia
Silvio Savarese
Juan Carlos Niebles
28
38
0
31 Mar 2023
CoNT: Contrastive Neural Text Generation
CoNT: Contrastive Neural Text Generation
Chen An
Jiangtao Feng
Kai Lv
Lingpeng Kong
Xipeng Qiu
Xuanjing Huang
100
23
0
29 May 2022
BLIP: Bootstrapping Language-Image Pre-training for Unified
  Vision-Language Understanding and Generation
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Junnan Li
Dongxu Li
Caiming Xiong
S. Hoi
MLLM
BDL
VLM
CLIP
392
4,137
0
28 Jan 2022
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize
  Long-Tail Visual Concepts
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts
Soravit Changpinyo
P. Sharma
Nan Ding
Radu Soricut
VLM
290
1,084
0
17 Feb 2021
Unifying Vision-and-Language Tasks via Text Generation
Unifying Vision-and-Language Tasks via Text Generation
Jaemin Cho
Jie Lei
Hao Tan
Joey Tianyi Zhou
MLLM
256
525
0
04 Feb 2021
Augmented SBERT: Data Augmentation Method for Improving Bi-Encoders for
  Pairwise Sentence Scoring Tasks
Augmented SBERT: Data Augmentation Method for Improving Bi-Encoders for Pairwise Sentence Scoring Tasks
Nandan Thakur
Nils Reimers
Johannes Daxenberger
Iryna Gurevych
205
241
0
16 Oct 2020
1