Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2304.07647
Cited By
LASER: A Neuro-Symbolic Framework for Learning Spatial-Temporal Scene Graphs with Weak Supervision
15 April 2023
Jiani Huang
Ziyang Li
Mayur Naik
Ser-Nam Lim
Re-assign community
ArXiv
PDF
HTML
Papers citing
"LASER: A Neuro-Symbolic Framework for Learning Spatial-Temporal Scene Graphs with Weak Supervision"
47 / 47 papers shown
Title
What can Off-the-Shelves Large Multi-Modal Models do for Dynamic Scene Graph Generation?
Xuanming Cui
Jaiminkumar Ashokbhai Bhoi
Chionh Wei Peng
Adriel Kuek
Ser-Nam Lim
84
0
0
20 Mar 2025
Neuro-Symbolic AI in 2024: A Systematic Review
Brandon C. Colelough
William Regli
NAI
101
10
0
09 Jan 2025
DeiSAM: Segment Anything with Deictic Prompting
Hikaru Shindo
Manuel Brack
Gopika Sudhakaran
Devendra Singh Dhami
P. Schramowski
Kristian Kersting
VLM
60
3
0
21 Feb 2024
Panoptic Video Scene Graph Generation
Jingkang Yang
Wen-Hsiao Peng
Xiangtai Li
Zujin Guo
Liangyu Chen
...
Zheng Ma
Kaiyang Zhou
Wayne Zhang
Chen Change Loy
Ziwei Liu
VOS
91
42
0
28 Nov 2023
Neuro-symbolic Empowered Denoising Diffusion Probabilistic Models for Real-time Anomaly Detection in Industry 4.0
Luigi Capogrosso
Alessio Mascolini
Federico Girella
Geri Skenderi
Sebastiano Gaiardelli
...
S. D. Cataldo
Sara Vinco
Enrico Macii
Franco Fummi
Marco Cristani
AI4CE
DiffM
43
3
0
13 Jul 2023
Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Hang Zhang
Xin Li
Lidong Bing
MLLM
124
1,006
0
05 Jun 2023
TRACE: 5D Temporal Regression of Avatars with Dynamic Cameras in 3D Environments
Yu Sun
Qian Bao
Wu Liu
Tao Mei
Michael J. Black
3DH
56
61
0
05 Jun 2023
Auto-GPT for Online Decision Making: Benchmarks and Additional Opinions
Hui Yang
Sifu Yue
Yunzhong He
RALM
25
157
0
04 Jun 2023
Improved Logical Reasoning of Language Models via Differentiable Symbolic Programming
Hanlin Zhang
Jiani Huang
Ziyang Li
Mayur Naik
Eric P. Xing
ReLM
LRM
63
28
0
05 May 2023
Scallop: A Language for Neurosymbolic Programming
Ziyang Li
Jiani Huang
Mayur Naik
ReLM
LRM
NAI
47
31
0
10 Apr 2023
Unbiased Scene Graph Generation in Videos
Sayak Nag
Kyle Min
Subarna Tripathi
Amit K. Roy-Chowdhury
47
29
0
03 Apr 2023
Sigmoid Loss for Language Image Pre-Training
Xiaohua Zhai
Basil Mustafa
Alexander Kolesnikov
Lucas Beyer
CLIP
VLM
113
1,076
0
27 Mar 2023
Meta Spatio-Temporal Debiasing for Video Scene Graph Generation
Li Xu
Haoxuan Qu
Jason Kuen
Jiuxiang Gu
Jun Liu
CML
72
27
0
23 Jul 2022
Temporal Alignment Networks for Long-term Video
Tengda Han
Weidi Xie
Andrew Zisserman
AI4TS
58
85
0
06 Apr 2022
Video-Text Representation Learning via Differentiable Weak Temporal Alignment
Dohwan Ko
Joonmyung Choi
Juyeon Ko
Shinyeong Noh
Kyoung-Woon On
Eun-Sol Kim
Hyunwoo J. Kim
VGen
AI4TS
52
22
0
31 Mar 2022
Scene Graph Generation: A Comprehensive Survey
Guangming Zhu
Liang Zhang
Youliang Jiang
Yixuan Dang
Haoran Hou
...
Mingtao Feng
Xia Zhao
Qiguang Miao
Syed Afaq Ali Shah
Bennamoun
3DV
70
84
0
03 Jan 2022
VIOLET : End-to-End Video-Language Transformers with Masked Visual-token Modeling
Tsu-Jui Fu
Linjie Li
Zhe Gan
Kevin Qinghong Lin
Wenjie Wang
Lijuan Wang
Zicheng Liu
VLM
85
219
0
24 Nov 2021
Ego4D: Around the World in 3,000 Hours of Egocentric Video
Kristen Grauman
Andrew Westbury
Eugene Byrne
Zachary Chavis
Antonino Furnari
...
Mike Zheng Shou
Antonio Torralba
Lorenzo Torresani
Mingfei Yan
Jitendra Malik
EgoV
359
1,056
0
13 Oct 2021
Grounding Predicates through Actions
Toki Migimatsu
Jeannette Bohg
171
35
0
29 Sep 2021
VideoCLIP: Contrastive Pre-training for Zero-shot Video-Text Understanding
Hu Xu
Gargi Ghosh
Po-Yao (Bernie) Huang
Dmytro Okhonko
Armen Aghajanyan
Florian Metze
Luke Zettlemoyer
Florian Metze Luke Zettlemoyer Christoph Feichtenhofer
CLIP
VLM
303
567
0
28 Sep 2021
Drop-DTW: Aligning Common Signal Between Sequences While Dropping Outliers
Nikita Dvornik
Isma Hadji
Konstantinos G. Derpanis
Animesh Garg
Allan D. Jepson
27
50
0
26 Aug 2021
Spatial-Temporal Transformer for Dynamic Scene Graph Generation
Yuren Cong
Wentong Liao
H. Ackermann
Bodo Rosenhahn
M. Yang
ViT
26
126
0
26 Jul 2021
Align before Fuse: Vision and Language Representation Learning with Momentum Distillation
Junnan Li
Ramprasaath R. Selvaraju
Akhilesh Deepak Gotmare
Shafiq Joty
Caiming Xiong
Guosheng Lin
FaML
161
1,915
0
16 Jul 2021
DeepStochLog: Neural Stochastic Logic Programming
Thomas Winters
G. Marra
Robin Manhaeve
Luc de Raedt
BDL
NAI
58
64
0
23 Jun 2021
Fully Convolutional Scene Graph Generation
Hengyue Liu
Ning Yan
Masood S. Mortazavi
B. Bhanu
46
107
0
30 Mar 2021
Visual Distant Supervision for Scene Graph Generation
Yuan Yao
Ao Zhang
Xu Han
Mengdi Li
C. Weber
Zhiyuan Liu
S. Wermter
Maosong Sun
37
40
0
29 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
767
28,659
0
26 Feb 2021
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
407
3,778
0
11 Feb 2021
Rescaling Egocentric Vision
Dima Damen
Hazel Doughty
G. Farinella
Antonino Furnari
Evangelos Kazakos
...
Davide Moltisanti
Jonathan Munro
Toby Perrett
Will Price
Michael Wray
EgoV
52
444
0
23 Jun 2020
Closed Loop Neural-Symbolic Learning via Integrating Neural Perception, Grammar Parsing, and Symbolic Reasoning
Qing Li
Siyuan Huang
Yining Hong
Yixin Chen
Ying Nian Wu
Song-Chun Zhu
NAI
56
83
0
11 Jun 2020
Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks
Joanna Materzynska
Tete Xiao
Roei Herzig
Huijuan Xu
Xiaolong Wang
Trevor Darrell
CoGe
46
174
0
20 Dec 2019
End-to-End Learning of Visual Representations from Uncurated Instructional Videos
Antoine Miech
Jean-Baptiste Alayrac
Lucas Smaira
Ivan Laptev
Josef Sivic
Andrew Zisserman
VGen
SSL
111
710
0
13 Dec 2019
Neural Probabilistic Logic Programming in DeepProbLog
Robin Manhaeve
Sebastijan Dumancic
Angelika Kimmig
T. Demeester
Luc de Raedt
NAI
67
550
0
18 Jul 2019
The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision
Jiayuan Mao
Chuang Gan
Pushmeet Kohli
J. Tenenbaum
Jiajun Wu
NAI
101
694
0
26 Apr 2019
Temporal Cycle-Consistency Learning
Debidatta Dwibedi
Y. Aytar
Jonathan Tompson
P. Sermanet
Andrew Zisserman
SSL
AI4TS
73
275
0
16 Apr 2019
VideoBERT: A Joint Model for Video and Language Representation Learning
Chen Sun
Austin Myers
Carl Vondrick
Kevin Patrick Murphy
Cordelia Schmid
VLM
SSL
56
1,238
0
03 Apr 2019
D3TW: Discriminative Differentiable Dynamic Time Warping for Weakly Supervised Action Alignment and Segmentation
C. Chang
De-An Huang
Yanan Sui
Li Fei-Fei
Juan Carlos Niebles
90
156
0
09 Jan 2019
Quantifying Generalization in Reinforcement Learning
K. Cobbe
Oleg Klimov
Christopher Hesse
Taehoon Kim
John Schulman
OffRL
87
662
0
06 Dec 2018
Graph R-CNN for Scene Graph Generation
Jianwei Yang
Jiasen Lu
Stefan Lee
Dhruv Batra
Devi Parikh
GNN
101
839
0
01 Aug 2018
NeuralNetwork-Viterbi: A Framework for Weakly Supervised Video Learning
Alexander Richard
Hilde Kuehne
Ahsan Iqbal
Juergen Gall
74
137
0
17 May 2018
Scaling Egocentric Vision: The EPIC-KITCHENS Dataset
Dima Damen
Hazel Doughty
G. Farinella
Sanja Fidler
Antonino Furnari
...
Davide Moltisanti
Jonathan Munro
Toby Perrett
Will Price
Michael Wray
EgoV
95
1,011
0
08 Apr 2018
Weakly-Supervised Action Segmentation with Iterative Soft Boundary Assignment
Li Ding
Chenliang Xu
81
180
0
28 Mar 2018
Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification
Saining Xie
Chen Sun
Jonathan Huang
Zhuowen Tu
Kevin Patrick Murphy
3DH
137
1,317
0
13 Dec 2017
The "something something" video database for learning and evaluating visual common sense
Raghav Goyal
Samira Ebrahimi Kahou
Vincent Michalski
Joanna Materzynska
S. Westphal
...
Moritz Mueller-Freitag
F. Hoppe
Christian Thurau
Ingo Bax
Roland Memisevic
VLM
79
1,516
0
13 Jun 2017
The Kinetics Human Action Video Dataset
W. Kay
João Carreira
Karen Simonyan
Brian Zhang
Chloe Hillier
...
Tim Green
T. Back
Apostol Natsev
Mustafa Suleyman
Andrew Zisserman
209
3,771
0
19 May 2017
Visual Relationship Detection with Language Priors
Cewu Lu
Ranjay Krishna
Michael S. Bernstein
Li Fei-Fei
VLM
65
1,137
0
31 Jul 2016
Connectionist Temporal Modeling for Weakly Supervised Action Labeling
De-An Huang
Li Fei-Fei
Juan Carlos Niebles
66
237
0
28 Jul 2016
1