ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2210.08402
  4. Cited By
LAION-5B: An open large-scale dataset for training next generation
  image-text models

LAION-5B: An open large-scale dataset for training next generation image-text models

16 October 2022
Christoph Schuhmann
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
Mehdi Cherti
Theo Coombes
Aarush Katta
Clayton Mullis
Mitchell Wortsman
P. Schramowski
Srivatsa Kundurthy
Katherine Crowson
Ludwig Schmidt
R. Kaczmarczyk
J. Jitsev
    VLM
    MLLM
    CLIP
ArXivPDFHTML

Papers citing "LAION-5B: An open large-scale dataset for training next generation image-text models"

50 / 665 papers shown
Title
Effective Data Augmentation With Diffusion Models
Effective Data Augmentation With Diffusion Models
Brandon Trabucco
Kyle Doherty
Max Gurinas
Ruslan Salakhutdinov
VLM
DiffM
32
232
0
07 Feb 2023
Fair Diffusion: Instructing Text-to-Image Generation Models on Fairness
Fair Diffusion: Instructing Text-to-Image Generation Models on Fairness
Felix Friedrich
Manuel Brack
Lukas Struppek
Dominik Hintersdorf
P. Schramowski
Sasha Luccioni
Kristian Kersting
38
120
0
07 Feb 2023
Mixture of Diffusers for scene composition and high resolution image
  generation
Mixture of Diffusers for scene composition and high resolution image generation
Á. Jiménez
DiffM
21
45
0
05 Feb 2023
Eliminating Contextual Prior Bias for Semantic Image Editing via
  Dual-Cycle Diffusion
Eliminating Contextual Prior Bias for Semantic Image Editing via Dual-Cycle Diffusion
Zuopeng Yang
Tianshu Chu
Xin Lin
Erdun Gao
Daqing Liu
J. Yang
Chaoyue Wang
DiffM
34
16
0
05 Feb 2023
Contrast with Reconstruct: Contrastive 3D Representation Learning Guided
  by Generative Pretraining
Contrast with Reconstruct: Contrastive 3D Representation Learning Guided by Generative Pretraining
Zekun Qi
Runpei Dong
Guo Fan
Zheng Ge
Xiangyu Zhang
Kaisheng Ma
Li Yi
38
118
0
05 Feb 2023
Semantic-Guided Generative Image Augmentation Method with Diffusion
  Models for Image Classification
Semantic-Guided Generative Image Augmentation Method with Diffusion Models for Image Classification
Bohan Li
Xiao Xu
Xinghao Wang
Yutai Hou
Yunlong Feng
Feng Wang
Xuanliang Zhang
Qingfu Zhu
Wanxiang Che
DiffM
VLM
26
10
0
04 Feb 2023
TEXTure: Text-Guided Texturing of 3D Shapes
TEXTure: Text-Guided Texturing of 3D Shapes
Elad Richardson
G. Metzer
Yuval Alaluf
Raja Giryes
Daniel Cohen-Or
DiffM
35
260
0
03 Feb 2023
Are Diffusion Models Vulnerable to Membership Inference Attacks?
Are Diffusion Models Vulnerable to Membership Inference Attacks?
Jinhao Duan
Fei Kong
Shiqi Wang
Xiaoshuang Shi
Kaidi Xu
35
109
0
02 Feb 2023
Debiasing Vision-Language Models via Biased Prompts
Debiasing Vision-Language Models via Biased Prompts
Ching-Yao Chuang
Varun Jampani
Yuanzhen Li
Antonio Torralba
Stefanie Jegelka
VLM
30
97
0
31 Jan 2023
Discovering and Mitigating Visual Biases through Keyword Explanation
Discovering and Mitigating Visual Biases through Keyword Explanation
Younghyun Kim
Sangwoo Mo
Minkyu Kim
Kyungmin Lee
Jaeho Lee
Jinwoo Shin
40
33
0
26 Jan 2023
StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale
  Text-to-Image Synthesis
StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis
Axel Sauer
Tero Karras
S. Laine
Andreas Geiger
Timo Aila
37
209
0
23 Jan 2023
Human-Timescale Adaptation in an Open-Ended Task Space
Human-Timescale Adaptation in an Open-Ended Task Space
Adaptive Agent Team
Jakob Bauer
Kate Baumli
Satinder Baveja
Feryal M. P. Behbahani
...
Jakub Sygnowski
K. Tuyls
Sarah York
Alexander Zacherl
Lei Zhang
LM&Ro
OffRL
AI4CE
LRM
38
109
0
18 Jan 2023
RILS: Masked Visual Reconstruction in Language Semantic Space
RILS: Masked Visual Reconstruction in Language Semantic Space
Shusheng Yang
Yixiao Ge
Kun Yi
Dian Li
Ying Shan
Xiaohu Qie
Xinggang Wang
CLIP
43
11
0
17 Jan 2023
Toward Building General Foundation Models for Language, Vision, and
  Vision-Language Understanding Tasks
Toward Building General Foundation Models for Language, Vision, and Vision-Language Understanding Tasks
Xinsong Zhang
Yan Zeng
Jipeng Zhang
Hang Li
VLM
AI4CE
LRM
22
17
0
12 Jan 2023
CiT: Curation in Training for Effective Vision-Language Data
CiT: Curation in Training for Effective Vision-Language Data
Hu Xu
Saining Xie
Po-Yao (Bernie) Huang
Licheng Yu
Russ Howes
Gargi Ghosh
Luke Zettlemoyer
Christoph Feichtenhofer
VLM
DiffM
33
25
0
05 Jan 2023
Muse: Text-To-Image Generation via Masked Generative Transformers
Muse: Text-To-Image Generation via Masked Generative Transformers
Huiwen Chang
Han Zhang
Jarred Barber
AJ Maschinot
José Lezama
...
Kevin Patrick Murphy
William T. Freeman
Michael Rubinstein
Yuanzhen Li
Dilip Krishnan
DiffM
197
521
0
02 Jan 2023
Exploring Vision Transformers as Diffusion Learners
Exploring Vision Transformers as Diffusion Learners
He Cao
Jianan Wang
Tianhe Ren
Xianbiao Qi
Yihao Chen
Yuan Yao
Lefei Zhang
44
10
0
28 Dec 2022
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for
  Text-to-Video Generation
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Jay Zhangjie Wu
Yixiao Ge
Xintao Wang
Weixian Lei
Yuchao Gu
Yufei Shi
W. Hsu
Ying Shan
Xiaohu Qie
Mike Zheng Shou
VGen
62
692
0
22 Dec 2022
Diffusing Surrogate Dreams of Video Scenes to Predict Video Memorability
Diffusing Surrogate Dreams of Video Scenes to Predict Video Memorability
Lorin Sweeney
Graham Healy
Alan F. Smeaton
DiffM
22
2
0
19 Dec 2022
Transferring General Multimodal Pretrained Models to Text Recognition
Transferring General Multimodal Pretrained Models to Text Recognition
Junyang Lin
Xuancheng Ren
Yichang Zhang
Gao Liu
Peng Wang
An Yang
Chang Zhou
34
4
0
19 Dec 2022
Objaverse: A Universe of Annotated 3D Objects
Objaverse: A Universe of Annotated 3D Objects
Matt Deitke
Dustin Schwenk
Jordi Salvador
Luca Weihs
Oscar Michel
Eli VanderBilt
Ludwig Schmidt
Kiana Ehsani
Aniruddha Kembhavi
Ali Farhadi
29
890
0
15 Dec 2022
The Stable Artist: Steering Semantics in Diffusion Latent Space
The Stable Artist: Steering Semantics in Diffusion Latent Space
Manuel Brack
P. Schramowski
Felix Friedrich
Dominik Hintersdorf
Kristian Kersting
DiffM
19
25
0
12 Dec 2022
A Whac-A-Mole Dilemma: Shortcuts Come in Multiples Where Mitigating One
  Amplifies Others
A Whac-A-Mole Dilemma: Shortcuts Come in Multiples Where Mitigating One Amplifies Others
Zhiheng Li
Ivan Evtimov
Albert Gordo
C. Hazirbas
Tal Hassner
Cristian Canton Ferrer
Chenliang Xu
Mark Ibrahim
39
71
0
09 Dec 2022
Refiner: Data Refining against Gradient Leakage Attacks in Federated
  Learning
Refiner: Data Refining against Gradient Leakage Attacks in Federated Learning
Mingyuan Fan
Cen Chen
Chengyu Wang
Ximeng Liu
Wenmeng Zhou
Jun Huang
AAML
FedML
34
0
0
05 Dec 2022
Scaling Language-Image Pre-training via Masking
Scaling Language-Image Pre-training via Masking
Yanghao Li
Haoqi Fan
Ronghang Hu
Christoph Feichtenhofer
Kaiming He
CLIP
VLM
27
318
0
01 Dec 2022
Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D
  Generation
Score Jacobian Chaining: Lifting Pretrained 2D Diffusion Models for 3D Generation
Haochen Wang
Xiaodan Du
Jiahao Li
Raymond A. Yeh
Gregory Shakhnarovich
DiffM
60
527
0
01 Dec 2022
One-shot recognition of any material anywhere using contrastive learning
  with physics-based rendering
One-shot recognition of any material anywhere using contrastive learning with physics-based rendering
Manuel S. Drehwald
S. Eppel
Jolina Li
Han Hao
Alán Aspuru-Guzik
33
6
0
01 Dec 2022
DATID-3D: Diversity-Preserved Domain Adaptation Using Text-to-Image
  Diffusion for 3D Generative Model
DATID-3D: Diversity-Preserved Domain Adaptation Using Text-to-Image Diffusion for 3D Generative Model
Gwanghyun Kim
S. Chun
DiffM
33
39
0
29 Nov 2022
Context-Aware Robust Fine-Tuning
Context-Aware Robust Fine-Tuning
Xiaofeng Mao
YueFeng Chen
Xiaojun Jia
Rong Zhang
Hui Xue
Zhao Li
VLM
CLIP
35
25
0
29 Nov 2022
Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors
Peekaboo: Text to Image Diffusion Models are Zero-Shot Segmentors
R. Burgert
Kanchana Ranasinghe
Xiang Li
Michael S. Ryoo
DiffM
VLM
34
37
0
23 Nov 2022
ReCo: Region-Controlled Text-to-Image Generation
ReCo: Region-Controlled Text-to-Image Generation
Zhengyuan Yang
Jianfeng Wang
Zhe Gan
Linjie Li
Kevin Qinghong Lin
...
Nan Duan
Zicheng Liu
Ce Liu
Michael Zeng
Lijuan Wang
DiffM
56
140
0
23 Nov 2022
Open-vocabulary Attribute Detection
Open-vocabulary Attribute Detection
M. A. Bravo
Sudhanshu Mittal
Simon Ging
Thomas Brox
VLM
ObjD
19
30
0
23 Nov 2022
RoentGen: Vision-Language Foundation Model for Chest X-ray Generation
RoentGen: Vision-Language Foundation Model for Chest X-ray Generation
Pierre J. Chambon
Christian Blüthgen
Jean-Benoit Delbrouck
Rogier van der Sluijs
M. Polacin
Juan Manuel Zambrano Chaves
Tanishq Mathew Abraham
Shivanshu Purohit
C. Langlotz
Akshay S. Chaudhari
LM&MA
DiffM
MedIm
37
98
0
23 Nov 2022
Investigating Prompt Engineering in Diffusion Models
Investigating Prompt Engineering in Diffusion Models
Sam Witteveen
Martin Andrews
11
58
0
21 Nov 2022
Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
Xichen Pan
Pengda Qin
Yuhong Li
Hui Xue
Wenhu Chen
DiffM
29
62
0
20 Nov 2022
InstructPix2Pix: Learning to Follow Image Editing Instructions
InstructPix2Pix: Learning to Follow Image Editing Instructions
Tim Brooks
Aleksander Holynski
Alexei A. Efros
DiffM
83
1,709
0
17 Nov 2022
GLAMI-1M: A Multilingual Image-Text Fashion Dataset
GLAMI-1M: A Multilingual Image-Text Fashion Dataset
Vaclav Kosar
A. Hoskovec
Milan Šulc
Radek Bartyzal
VLM
32
3
0
17 Nov 2022
EVA: Exploring the Limits of Masked Visual Representation Learning at
  Scale
EVA: Exploring the Limits of Masked Visual Representation Learning at Scale
Yuxin Fang
Wen Wang
Binhui Xie
Quan-Sen Sun
Ledell Yu Wu
Xinggang Wang
Tiejun Huang
Xinlong Wang
Yue Cao
VLM
CLIP
87
679
0
14 Nov 2022
AltCLIP: Altering the Language Encoder in CLIP for Extended Language
  Capabilities
AltCLIP: Altering the Language Encoder in CLIP for Extended Language Capabilities
Zhongzhi Chen
Guangyi Liu
Bo-Wen Zhang
Fulong Ye
Qinghong Yang
Ledell Yu Wu
VLM
37
80
0
12 Nov 2022
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in
  Diffusion Models
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion Models
P. Schramowski
Manuel Brack
Bjorn Deiseroth
Kristian Kersting
42
272
0
09 Nov 2022
Rickrolling the Artist: Injecting Backdoors into Text Encoders for
  Text-to-Image Synthesis
Rickrolling the Artist: Injecting Backdoors into Text Encoders for Text-to-Image Synthesis
Lukas Struppek
Dominik Hintersdorf
Kristian Kersting
SILM
22
36
0
04 Nov 2022
Efficient Spatially Sparse Inference for Conditional GANs and Diffusion
  Models
Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models
Muyang Li
Ji Lin
Chenlin Meng
Stefano Ermon
Song Han
Jun-Yan Zhu
DiffM
40
45
0
03 Nov 2022
DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image
  Generative Models
DiffusionDB: A Large-scale Prompt Gallery Dataset for Text-to-Image Generative Models
Zijie J. Wang
Evan Montoya
David Munechika
Haoyang Yang
Benjamin Hoover
Duen Horng Chau
41
288
0
26 Oct 2022
Conditional Diffusion with Less Explicit Guidance via Model Predictive
  Control
Conditional Diffusion with Less Explicit Guidance via Model Predictive Control
Max W. Shen
Ehsan Hajiramezanali
Gabriele Scalia
Alex Tseng
N. Diamant
Tommaso Biancalani
Andreas Loukas
34
1
0
21 Oct 2022
Language Does More Than Describe: On The Lack Of Figurative Speech in
  Text-To-Image Models
Language Does More Than Describe: On The Lack Of Figurative Speech in Text-To-Image Models
Ricardo Kleinlein
Cristina Luna Jiménez
Fernando Fernández-Martínez
DiffM
20
3
0
19 Oct 2022
5th Place Solution to Kaggle Google Universal Image Embedding
  Competition
5th Place Solution to Kaggle Google Universal Image Embedding Competition
Noriaki Ota
Shingo Yokoi
Shinsuke Yamaoka
123
2
0
18 Oct 2022
1st Place Solution in Google Universal Images Embedding
1st Place Solution in Google Universal Images Embedding
Shihao Shao
Qinghua Cui
3DGS
25
7
0
16 Oct 2022
Unifying Diffusion Models' Latent Space, with Applications to
  CycleDiffusion and Guidance
Unifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and Guidance
Chen Henry Wu
Fernando de la Torre
DiffM
33
67
0
11 Oct 2022
What the DAAM: Interpreting Stable Diffusion Using Cross Attention
What the DAAM: Interpreting Stable Diffusion Using Cross Attention
Raphael Tang
Linqing Liu
Akshat Pandey
Zhiying Jiang
Gefei Yang
K. Kumar
Pontus Stenetorp
Jimmy J. Lin
Ferhan Ture
34
167
0
10 Oct 2022
DreamFusion: Text-to-3D using 2D Diffusion
DreamFusion: Text-to-3D using 2D Diffusion
Ben Poole
Ajay Jain
Jonathan T. Barron
B. Mildenhall
82
2,319
0
29 Sep 2022
Previous
123...121314
Next