ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2103.12975
  4. Cited By
VLGrammar: Grounded Grammar Induction of Vision and Language

VLGrammar: Grounded Grammar Induction of Vision and Language

24 March 2021
Yining Hong
Qing Li
Song-Chun Zhu
Siyuan Huang
    VLM
ArXivPDFHTML

Papers citing "VLGrammar: Grounded Grammar Induction of Vision and Language"

19 / 19 papers shown
Title
SPIN: Hierarchical Segmentation with Subpart Granularity in Natural
  Images
SPIN: Hierarchical Segmentation with Subpart Granularity in Natural Images
Josh Myers-Dean
Jarek Reynolds
Brian Price
Yifei Fan
Danna Gurari
49
2
0
12 Jul 2024
Text-to-3D Shape Generation
Text-to-3D Shape Generation
Han-Hung Lee
Manolis Savva
Angel X. Chang
34
11
0
20 Mar 2024
Activity Grammars for Temporal Action Segmentation
Activity Grammars for Temporal Action Segmentation
Dayoung Gong
Joonseok Lee
Deunsol Jung
Suha Kwak
Minsu Cho
50
7
0
07 Dec 2023
On the Transferability of Visually Grounded PCFGs
On the Transferability of Visually Grounded PCFGs
Yanpeng Zhao
Ivan Titov
22
1
0
21 Oct 2023
State of the Art on Diffusion Models for Visual Computing
State of the Art on Diffusion Models for Visual Computing
Ryan Po
Wang Yifan
Vladislav Golyanik
Kfir Aberman
Jonathan T. Barron
...
Matthias Nießner
Bjorn Ommer
Christian Theobalt
Peter Wonka
Gordon Wetzstein
38
103
0
11 Oct 2023
3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment
3D-VisTA: Pre-trained Transformer for 3D Vision and Text Alignment
Ziyu Zhu
Xiaojian Ma
Yixin Chen
Zhidong Deng
Siyuan Huang
Qing Li
LM&Ro
34
106
0
08 Aug 2023
Semantic Composition in Visually Grounded Language Models
Semantic Composition in Visually Grounded Language Models
Rohan Pandey
CoGe
26
1
0
15 May 2023
Cross-modal Attention Congruence Regularization for Vision-Language
  Relation Alignment
Cross-modal Attention Congruence Regularization for Vision-Language Relation Alignment
Rohan Pandey
Rulin Shao
Paul Pu Liang
Ruslan Salakhutdinov
Louis-Philippe Morency
31
12
0
20 Dec 2022
Unsupervised Discontinuous Constituency Parsing with Mildly
  Context-Sensitive Grammars
Unsupervised Discontinuous Constituency Parsing with Mildly Context-Sensitive Grammars
Aaron Courville
R. Levy
Yoon Kim
39
5
0
18 Dec 2022
ScanEnts3D: Exploiting Phrase-to-3D-Object Correspondences for Improved
  Visio-Linguistic Models in 3D Scenes
ScanEnts3D: Exploiting Phrase-to-3D-Object Correspondences for Improved Visio-Linguistic Models in 3D Scenes
Ahmed Abdelreheem
Kyle Olszewski
Hsin-Ying Lee
Peter Wonka
Panos Achlioptas
3DPC
24
28
0
12 Dec 2022
HUMANISE: Language-conditioned Human Motion Generation in 3D Scenes
HUMANISE: Language-conditioned Human Motion Generation in 3D Scenes
Zan Wang
Yixin Chen
Tengyu Liu
Yixin Zhu
Wei Liang
Siyuan Huang
48
104
0
18 Oct 2022
SQA3D: Situated Question Answering in 3D Scenes
SQA3D: Situated Question Answering in 3D Scenes
Xiaojian Ma
Silong Yong
Zilong Zheng
Qing Li
Yitao Liang
Song-Chun Zhu
Siyuan Huang
LM&Ro
38
134
0
14 Oct 2022
VALHALLA: Visual Hallucination for Machine Translation
VALHALLA: Visual Hallucination for Machine Translation
Yi Li
Yikang Shen
Yoon Kim
Chun-Fu Chen
Rogerio Feris
David D. Cox
Nuno Vasconcelos
MLLM
40
38
0
31 May 2022
Fixing Malfunctional Objects With Learned Physical Simulation and
  Functional Prediction
Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction
Yining Hong
Kaichun Mo
L. Yi
Leonidas J. Guibas
Antonio Torralba
J. Tenenbaum
Chuang Gan
42
5
0
05 May 2022
Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene
  Graphs with Language Structures via Dependency Relationships
Unsupervised Vision-Language Parsing: Seamlessly Bridging Visual Scene Graphs with Language Structures via Dependency Relationships
Chao Lou
Wenjuan Han
Yuh-Chen Lin
Zilong Zheng
CoGe
23
10
0
27 Mar 2022
PartGlot: Learning Shape Part Segmentation from Language Reference Games
PartGlot: Learning Shape Part Segmentation from Language Reference Games
Juil Koo
Ian Huang
Panos Achlioptas
Leonidas J. Guibas
Minhyuk Sung
3DPC
40
28
0
13 Dec 2021
PTR: A Benchmark for Part-based Conceptual, Relational, and Physical
  Reasoning
PTR: A Benchmark for Part-based Conceptual, Relational, and Physical Reasoning
Yining Hong
Li Yi
J. Tenenbaum
Antonio Torralba
Chuang Gan
9
39
0
09 Dec 2021
Sequence-to-Sequence Learning with Latent Neural Grammars
Sequence-to-Sequence Learning with Latent Neural Grammars
Yoon Kim
33
40
0
02 Sep 2021
Parts2Words: Learning Joint Embedding of Point Clouds and Texts by
  Bidirectional Matching between Parts and Words
Parts2Words: Learning Joint Embedding of Point Clouds and Texts by Bidirectional Matching between Parts and Words
Chuan Tang
Xi Yang
Bojian Wu
Zhizhong Han
Yi Chang
3DPC
35
13
0
05 Jul 2021
1