Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2403.01693
Cited By
HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances
4 March 2024
Supreeth Narasimhaswamy
Uttaran Bhattacharya
Xiang Chen
Ishita Dasgupta
Saayan Mitra
Minh Hoai
DiffM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances"
33 / 33 papers shown
Title
Adding Conditional Control to Text-to-Image Diffusion Models
Lvmin Zhang
Anyi Rao
Maneesh Agrawala
AI4CE
112
4,104
1
10 Feb 2023
IMos: Intent-Driven Full-Body Motion Synthesis for Human-Object Interactions
Anindita Ghosh
Rishabh Dabral
Vladislav Golyanik
Christian Theobalt
P. Slusallek
DiffM
113
94
0
14 Dec 2022
ReCo: Region-Controlled Text-to-Image Generation
Zhengyuan Yang
Jianfeng Wang
Zhe Gan
Linjie Li
Kevin Qinghong Lin
...
Nan Duan
Zicheng Liu
Ce Liu
Michael Zeng
Lijuan Wang
DiffM
80
149
0
23 Nov 2022
LAION-5B: An open large-scale dataset for training next generation image-text models
Christoph Schuhmann
Romain Beaumont
Richard Vencu
Cade Gordon
Ross Wightman
...
Srivatsa Kundurthy
Katherine Crowson
Ludwig Schmidt
R. Kaczmarczyk
J. Jitsev
VLM
MLLM
CLIP
146
3,438
0
16 Oct 2022
Human Motion Diffusion Model
Guy Tevet
Sigal Raab
Brian Gordon
Yonatan Shafir
Daniel Cohen-Or
Amit H. Bermano
DiffM
VGen
260
756
0
29 Sep 2022
FLAME: Free-form Language-based Motion Synthesis & Editing
Jihoon Kim
Jiseob Kim
Sungjoon Choi
VGen
79
210
0
01 Sep 2022
MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model
Mingyuan Zhang
Zhongang Cai
Liang Pan
Fangzhou Hong
Xinying Guo
Lei Yang
Ziwei Liu
DiffM
VGen
94
568
0
31 Aug 2022
Classifier-Free Diffusion Guidance
Jonathan Ho
Tim Salimans
FaML
180
3,885
0
26 Jul 2022
HaGRID - HAnd Gesture Recognition Image Dataset
A. Kapitanov
Karina Kvanchiani
Alexander Nagaev
Roman Kraynov
Andrew Makhlyarchuk
50
61
0
16 Jun 2022
AvatarCLIP: Zero-Shot Text-Driven Generation and Animation of 3D Avatars
Fangzhou Hong
Mingyuan Zhang
Liang Pan
Zhongang Cai
Lei Yang
Ziwei Liu
CLIP
112
83
0
17 May 2022
TEMOS: Generating diverse human motions from textual descriptions
Mathis Petrovich
Michael J. Black
Gül Varol
98
386
0
25 Apr 2022
Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
Oran Gafni
Adam Polyak
Oron Ashual
Shelly Sheynin
Devi Parikh
Yaniv Taigman
DiffM
57
520
0
24 Mar 2022
Pseudo Numerical Methods for Diffusion Models on Manifolds
Luping Liu
Yi Ren
Zhijie Lin
Zhou Zhao
DiffM
92
646
0
20 Feb 2022
Embodied Hands: Modeling and Capturing Hands and Bodies Together
Javier Romero
Dimitrios Tzionas
Michael J. Black
3DH
CVBM
75
402
0
07 Jan 2022
High-Resolution Image Synthesis with Latent Diffusion Models
Robin Rombach
A. Blattmann
Dominik Lorenz
Patrick Esser
Bjorn Ommer
3DV
388
15,454
0
20 Dec 2021
GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
Alex Nichol
Prafulla Dhariwal
Aditya A. Ramesh
Pranav Shyam
Pamela Mishkin
Bob McGrew
Ilya Sutskever
Mark Chen
313
3,594
0
20 Dec 2021
Pix2seq: A Language Modeling Framework for Object Detection
Ting-Li Chen
Saurabh Saxena
Lala Li
David J. Fleet
Geoffrey E. Hinton
MLLM
ViT
VLM
262
347
0
22 Sep 2021
Design Guidelines for Prompt Engineering Text-to-Image Generative Models
Vivian Liu
Lydia B. Chilton
61
489
0
14 Sep 2021
SDEdit: Guided Image Synthesis and Editing with Stochastic Differential Equations
Chenlin Meng
Yutong He
Yang Song
Jiaming Song
Jiajun Wu
Jun-Yan Zhu
Stefano Ermon
DiffM
132
1,485
0
02 Aug 2021
BABEL: Bodies, Action and Behavior with English Labels
Abhinanda R. Punnakkal
Arjun Chandrasekaran
Nikos Athanasiou
Alejandra Quiros-Ramirez
Michael J. Black Max Planck Institute for Intelligent Systems
57
217
0
17 Jun 2021
Cascaded Diffusion Models for High Fidelity Image Generation
Jonathan Ho
Chitwan Saharia
William Chan
David J. Fleet
Mohammad Norouzi
Tim Salimans
145
1,218
0
30 May 2021
Diffusion Models Beat GANs on Image Synthesis
Prafulla Dhariwal
Alex Nichol
193
7,818
0
11 May 2021
Action-Conditioned 3D Human Motion Synthesis with Transformer VAE
Mathis Petrovich
Michael J. Black
Gül Varol
ViT
88
503
0
12 Apr 2021
Zero-Shot Text-to-Image Generation
Aditya A. Ramesh
Mikhail Pavlov
Gabriel Goh
Scott Gray
Chelsea Voss
Alec Radford
Mark Chen
Ilya Sutskever
VLM
387
4,937
0
24 Feb 2021
Monocular, One-stage, Regression of Multiple 3D People
Yu Sun
Qian Bao
Wu Liu
Yili Fu
Michael J. Black
Tao Mei
3DH
67
277
0
27 Aug 2020
DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis
Ming Tao
Hao Tang
Leilei Gan
Xiaoyuan Jing
Bingkun Bao
Changsheng Xu
91
213
0
13 Aug 2020
MediaPipe Hands: On-device Real-time Hand Tracking
Fan Zhang
Valentin Bazarevsky
Andrey Vakunov
A. Tkachenka
George Sung
Chuo-Ling Chang
Matthias Grundmann
3DH
MedIm
73
738
0
18 Jun 2020
NTU RGB+D 120: A Large-Scale Benchmark for 3D Human Activity Understanding
Jun Liu
Amir Shahroudy
Mauricio Perez
G. Wang
Ling-yu Duan
Alex C. Kot
77
1,289
0
12 May 2019
Contextual Attention for Hand Detection in the Wild
Supreeth Narasimhaswamy
Zheng Wei
Yanjie Wang
Justin Zhang
Minh Hoai
HAI
46
55
0
09 Apr 2019
AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks
Tao Xu
Pengchuan Zhang
Qiuyuan Huang
Han Zhang
Zhe Gan
Xiaolei Huang
Xiaodong He
GAN
ViT
105
1,716
0
28 Nov 2017
The KIT Motion-Language Dataset
Matthias Plappert
Christian Mandery
Tamim Asfour
233
285
0
13 Jul 2016
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
Jascha Narain Sohl-Dickstein
Eric A. Weiss
Niru Maheswaranathan
Surya Ganguli
SyDa
DiffM
281
6,925
0
12 Mar 2015
Microsoft COCO: Common Objects in Context
Nayeon Lee
Michael Maire
Serge J. Belongie
Lubomir Bourdev
Ross B. Girshick
James Hays
Pietro Perona
Deva Ramanan
C. L. Zitnick
Piotr Dollár
ObjD
398
43,619
0
01 May 2014
1