Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2009.14558
Cited By
Learning Object Detection from Captions via Textual Scene Attributes
30 September 2020
Achiya Jerbi
Roei Herzig
Jonathan Berant
Gal Chechik
Amir Globerson
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Learning Object Detection from Captions via Textual Scene Attributes"
38 / 38 papers shown
Title
Enhancing Few-Shot Vision-Language Classification with Large Multimodal Model Features
Chancharik Mitra
Brandon Huang
Tianning Chai
Zhiqiu Lin
Assaf Arbelle
Rogerio Feris
Leonid Karlinsky
Trevor Darrell
Deva Ramanan
Roei Herzig
VLM
320
4
0
28 Nov 2024
3VL: Using Trees to Improve Vision-Language Models' Interpretability
Nir Yellinek
Leonid Karlinsky
Raja Giryes
CoGe
VLM
255
3
0
28 Dec 2023
Compositional Video Synthesis with Action Graphs
Amir Bar
Roei Herzig
Xiaolong Wang
Anna Rohrbach
Gal Chechik
Trevor Darrell
Amir Globerson
79
44
0
27 Jun 2020
Contrastive Learning for Weakly Supervised Phrase Grounding
Tanmay Gupta
Arash Vahdat
Gal Chechik
Xiaodong Yang
Jan Kautz
Derek Hoiem
ObjD
SSL
119
144
0
17 Jun 2020
Instance-aware, Context-focused, and Memory-efficient Weakly Supervised Object Detection
Zhongzheng Ren
Zhiding Yu
Xiaodong Yang
Xuan Li
Yong Jae Lee
Alex Schwing
Jan Kautz
WSOD
62
201
0
09 Apr 2020
Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks
Joanna Materzynska
Tete Xiao
Roei Herzig
Huijuan Xu
Xiaolong Wang
Trevor Darrell
CoGe
53
176
0
20 Dec 2019
Learning Canonical Representations for Scene Graph to Image Generation
Roei Herzig
Amir Bar
Huijuan Xu
Gal Chechik
Trevor Darrell
Amir Globerson
GNN
OCL
69
108
0
16 Dec 2019
UNITER: UNiversal Image-TExt Representation Learning
Yen-Chun Chen
Linjie Li
Licheng Yu
Ahmed El Kholy
Faisal Ahmed
Zhe Gan
Yu Cheng
Jingjing Liu
VLM
OT
112
447
0
25 Sep 2019
Triplet-Aware Scene Graph Embeddings
Brigit Schroeder
Subarna Tripathi
Hanlin Tang
3DPC
59
16
0
19 Sep 2019
Scene Graph Parsing by Attention Graph
Martin Andrews
Yew Ken Chia
Sam Witteveen
GNN
32
12
0
13 Sep 2019
VL-BERT: Pre-training of Generic Visual-Linguistic Representations
Weijie Su
Xizhou Zhu
Yue Cao
Bin Li
Lewei Lu
Furu Wei
Jifeng Dai
VLM
MLLM
SSL
169
1,666
0
22 Aug 2019
LXMERT: Learning Cross-Modality Encoder Representations from Transformers
Hao Hao Tan
Joey Tianyi Zhou
VLM
MLLM
247
2,488
0
20 Aug 2019
ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks
Jiasen Lu
Dhruv Batra
Devi Parikh
Stefan Lee
SSL
VLM
237
3,693
0
06 Aug 2019
Cap2Det: Learning to Amplify Weak Caption Supervision for Object Detection
Keren Ye
Ruotong Wang
Adriana Kovashka
Wei Li
Danfeng Qin
Jesse Berent
111
60
0
23 Jul 2019
Precise Detection in Densely Packed Scenes
Eran Goldman
Roei Herzig
Aviv Eisenschtat
Oria Ratzon
Itsik Levi
Jacob Goldberger
Tal Hassner
71
187
0
01 Apr 2019
Differentiable Scene Graphs
Moshiko Raboh
Roei Herzig
Gal Chechik
Jonathan Berant
Amir Globerson
OCL
66
34
0
26 Feb 2019
Grounded Video Description
Luowei Zhou
Yannis Kalantidis
Xinlei Chen
Jason J. Corso
Marcus Rohrbach
83
193
0
17 Dec 2018
Multi-level Multimodal Common Semantic Space for Image-Phrase Grounding
Hassan Akbari
Svebor Karaman
Surabhi Bhargava
Brian Chen
Carl Vondrick
Shih-Fu Chang
55
83
0
28 Nov 2018
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
1.8K
95,114
0
11 Oct 2018
SNIPER: Efficient Multi-Scale Training
Bharat Singh
Mahyar Najibi
L. Davis
ObjD
75
486
0
23 May 2018
Exploring the Limits of Weakly Supervised Pretraining
D. Mahajan
Ross B. Girshick
Vignesh Ramanathan
Kaiming He
Manohar Paluri
Yixuan Li
Ashwin R. Bharambe
Laurens van der Maaten
VLM
193
1,369
0
02 May 2018
Image Generation from Scene Graphs
Justin Johnson
Agrim Gupta
Li Fei-Fei
GNN
303
820
0
04 Apr 2018
Referring Relationships
Ranjay Krishna
Ines Chami
Michael S. Bernstein
Li Fei-Fei
77
94
0
28 Mar 2018
Scene Graph Parsing as Dependency Parsing
Yu-Siang Wang
Chenxi Liu
Fangyin Wei
Alan Yuille
GNN
3DV
45
53
0
25 Mar 2018
Mapping Images to Scene Graphs with Permutation-Invariant Structured Prediction
Roei Herzig
Moshiko Raboh
Gal Chechik
Jonathan Berant
Amir Globerson
GNN
OCL
73
135
0
15 Feb 2018
Neural Motifs: Scene Graph Parsing with Global Context
Rowan Zellers
Mark Yatskar
Sam Thomson
Yejin Choi
GNN
88
999
0
17 Nov 2017
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
Peter Anderson
Xiaodong He
Chris Buehler
Damien Teney
Mark Johnson
Stephen Gould
Lei Zhang
AIMat
121
4,220
0
25 Jul 2017
Multiple Instance Detection Network with Online Instance Classifier Refinement
Peng Tang
Xinggang Wang
X. Bai
Wenyu Liu
WSOD
75
442
0
01 Apr 2017
Mask R-CNN
Kaiming He
Georgia Gkioxari
Piotr Dollár
Ross B. Girshick
ObjD
352
27,230
0
20 Mar 2017
Scene Graph Generation by Iterative Message Passing
Danfei Xu
Yuke Zhu
Chris Choy
Li Fei-Fei
GNN
3DV
97
1,224
0
10 Jan 2017
Modeling Relationships in Referential Expressions with Compositional Modular Networks
Ronghang Hu
Marcus Rohrbach
Jacob Andreas
Trevor Darrell
Kate Saenko
82
406
0
30 Nov 2016
On Support Relations and Semantic Scene Graphs
M. Yang
Wentong Liao
H. Ackermann
Bodo Rosenhahn
GNN
50
60
0
19 Sep 2016
Audio Event Detection using Weakly Labeled Data
Anurag Kumar
Bhiksha Raj
56
173
0
09 May 2016
Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations
Ranjay Krishna
Yuke Zhu
Oliver Groth
Justin Johnson
Kenji Hata
...
Yannis Kalantidis
Li Li
David A. Shamma
Michael S. Bernstein
Fei-Fei Li
223
5,761
0
23 Feb 2016
Learning Deep Features for Discriminative Localization
Bolei Zhou
A. Khosla
Àgata Lapedriza
A. Oliva
Antonio Torralba
SSL
SSeg
FAtt
250
9,326
0
14 Dec 2015
Rethinking the Inception Architecture for Computer Vision
Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jonathon Shlens
Z. Wojna
3DV
BDL
886
27,412
0
02 Dec 2015
Weakly Supervised Deep Detection Networks
Hakan Bilen
Andrea Vedaldi
WSOD
93
790
0
09 Nov 2015
Microsoft COCO Captions: Data Collection and Evaluation Server
Xinlei Chen
Hao Fang
Nayeon Lee
Ramakrishna Vedantam
Saurabh Gupta
Piotr Dollar
C. L. Zitnick
215
2,489
0
01 Apr 2015
1