Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2504.15624
Cited By
v1
v2 (latest)
FaceInsight: A Multimodal Large Language Model for Face Perception
22 April 2025
Jingzhi Li
Changjiang Luo
Ruoyu Chen
Hua Zhang
Wenqi Ren
Jianhou Gan
Xiaochun Cao
CVBM
LRM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"FaceInsight: A Multimodal Large Language Model for Face Perception"
37 / 37 papers shown
Title
Qwen2.5-VL Technical Report
S. Bai
Keqin Chen
Xuejing Liu
Jialin Wang
Wenbin Ge
...
Zesen Cheng
Hang Zhang
Zhibo Yang
Haiyang Xu
Junyang Lin
VLM
441
699
0
20 Feb 2025
Visual Large Language Models for Generalized and Specialized Applications
Yifan Li
Zhixin Lai
Wentao Bao
Zhen Tan
Anh Dao
Kewei Sui
Jiayi Shen
Dong Liu
Huan Liu
Yu Kong
VLM
179
15
0
06 Jan 2025
Face-MLLM: A Large Face Perception Model
Haomiao Sun
Mingjie He
Tianheng Lian
Hu Han
Shiguang Shan
VLM
CVBM
LRM
70
6
0
28 Oct 2024
EMO-LLaMA: Enhancing Facial Emotion Understanding with Instruction Tuning
Bohao Xing
Zitong Yu
Xin Liu
Kaishen Yuan
Qilang Ye
Weicheng Xie
Huanjing Yue
Jingyu Yang
Heikki Kälviäinen
101
13
0
21 Aug 2024
LLaVA-OneVision: Easy Visual Task Transfer
Bo Li
Yuanhan Zhang
Dong Guo
Renrui Zhang
Feng Li
Hao Zhang
Kaichen Zhang
Yanwei Li
Ziwei Liu
Chunyuan Li
MLLM
SyDa
VLM
174
865
0
06 Aug 2024
MiniCPM-V: A GPT-4V Level MLLM on Your Phone
Yuan Yao
Tianyu Yu
Ao Zhang
Chongyi Wang
Junbo Cui
...
Xu Han
Guoyang Zeng
Dahai Li
Zhiyuan Liu
Maosong Sun
VLM
MLLM
149
481
0
03 Aug 2024
Task-adaptive Q-Face
Haomiao Sun
Mingjie He
Shiguang Shan
Hu Han
Xilin Chen
CVBM
95
4
0
15 May 2024
MANTIS: Interleaved Multi-Image Instruction Tuning
Dongfu Jiang
Xuan He
Huaye Zeng
Cong Wei
Max Ku
Qian Liu
Wenhu Chen
VLM
MLLM
125
125
0
02 May 2024
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone
Marah Abdin
Sam Ade Jacobs
A. A. Awan
J. Aneja
Ahmed Hassan Awadallah
...
Li Zhang
Yi Zhang
Yue Zhang
Yunan Zhang
Xiren Zhou
LRM
ALM
212
1,278
0
22 Apr 2024
FaceXFormer: A Unified Transformer for Facial Analysis
Kartik Narayan
VS Vibashan
Rama Chellappa
Vishal M. Patel
ViT
140
13
0
19 Mar 2024
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models
Bin Lin
Zhenyu Tang
Yang Ye
Jiaxi Cui
Bin Zhu
...
Jinfa Huang
Junwu Zhang
Yatian Pang
Munan Ning
Li-ming Yuan
VLM
MLLM
MoE
147
180
0
29 Jan 2024
VCoder: Versatile Vision Encoders for Multimodal Large Language Models
Jitesh Jain
Jianwei Yang
Humphrey Shi
MLLM
76
31
0
21 Dec 2023
VILA: On Pre-training for Visual Language Models
Ji Lin
Hongxu Yin
Ming-Yu Liu
Yao Lu
Pavlo Molchanov
Andrew Tao
Huizi Mao
Jan Kautz
Mohammad Shoeybi
Song Han
MLLM
VLM
176
430
0
12 Dec 2023
LION : Empowering Multimodal Large Language Model with Dual-Level Visual Knowledge
Gongwei Chen
Leyang Shen
Rui Shao
Xiang Deng
Liqiang Nie
VLM
MLLM
146
48
0
20 Nov 2023
LogicNet: A Logical Consistency Embedded Face Attribute Learning Network
Haiyu Wu
Sicong Tian
Huayu Li
Kevin W. Bowyer
72
2
0
19 Nov 2023
Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models
Zhang Li
Biao Yang
Qiang Liu
Zhiyin Ma
Shuo Zhang
Jingxu Yang
Yabo Sun
Yuliang Liu
Xiang Bai
MLLM
140
278
0
11 Nov 2023
EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition
Niki Maria Foteinopoulou
Ioannis Patras
VLM
74
17
0
25 Oct 2023
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Jun Chen
Deyao Zhu
Xiaoqian Shen
Xiang Li
Zechun Liu
Pengchuan Zhang
Raghuraman Krishnamoorthi
Vikas Chandra
Yunyang Xiong
Mohamed Elhoseiny
MLLM
255
475
0
14 Oct 2023
Improved Baselines with Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Yuheng Li
Yong Jae Lee
VLM
MLLM
246
2,834
0
05 Oct 2023
Prompting Visual-Language Models for Dynamic Facial Expression Recognition
Zengqun Zhao
Ioannis Patras
VLM
98
34
0
25 Aug 2023
SwinFace: A Multi-task Transformer for Face Recognition, Expression Recognition, Age Estimation and Attribute Estimation
Lixiong Qin
Mei Wang
Chao Deng
K. Wang
Xiangshan Chen
Jiani Hu
Weihong Deng
CVBM
ViT
81
47
0
22 Aug 2023
MiVOLO: Multi-input Transformer for Age and Gender Estimation
Maksim Kuprashevich
Irina Tolstykh
91
38
0
10 Jul 2023
mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality
Qinghao Ye
Haiyang Xu
Guohai Xu
Jiabo Ye
Ming Yan
...
Junfeng Tian
Qiang Qi
Ji Zhang
Feiyan Huang
Jingren Zhou
VLM
MLLM
313
957
0
27 Apr 2023
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language Models
Deyao Zhu
Jun Chen
Xiaoqian Shen
Xiang Li
Mohamed Elhoseiny
VLM
MLLM
171
2,080
0
20 Apr 2023
Visual Instruction Tuning
Haotian Liu
Chunyuan Li
Qingyang Wu
Yong Jae Lee
SyDa
VLM
MLLM
584
4,948
0
17 Apr 2023
LLaMA: Open and Efficient Foundation Language Models
Hugo Touvron
Thibaut Lavril
Gautier Izacard
Xavier Martinet
Marie-Anne Lachaux
...
Faisal Azhar
Aurelien Rodriguez
Armand Joulin
Edouard Grave
Guillaume Lample
ALM
PILM
1.7K
13,557
0
27 Feb 2023
Logical Consistency and Greater Descriptive Power for Facial Hair Attribute Learning
Haiyu Wu
Grace Bezold
Aman Bhatta
Kevin W. Bowyer
CVBM
77
15
0
22 Feb 2023
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models
Junnan Li
Dongxu Li
Silvio Savarese
Steven C. H. Hoi
VLM
MLLM
616
4,679
0
30 Jan 2023
Label2Label: A Language Modeling Framework for Multi-Attribute Learning
Wanhua Li
Zhexuan Cao
Jianjiang Feng
Jie Zhou
Jiwen Lu
VLM
99
28
0
18 Jul 2022
General Facial Representation Learning in a Visual-Linguistic Manner
Yinglin Zheng
Hao Yang
Ting Zhang
Jianmin Bao
Dongdong Chen
Yangyu Huang
Lu Yuan
Dong Chen
Ming Zeng
Fang Wen
CVBM
211
176
0
06 Dec 2021
Pre-training strategies and datasets for facial representation learning
Adrian Bulat
Shiyang Cheng
Jing Yang
A. Garbett
Enrique Sanchez
Georgios Tzimiropoulos
CVBM
SSL
55
36
0
30 Mar 2021
Learning Transferable Visual Models From Natural Language Supervision
Alec Radford
Jong Wook Kim
Chris Hallacy
Aditya A. Ramesh
Gabriel Goh
...
Amanda Askell
Pamela Mishkin
Jack Clark
Gretchen Krueger
Ilya Sutskever
CLIP
VLM
1.1K
30,115
0
26 Feb 2021
MAAD-Face: A Massively Annotated Attribute Dataset for Face Images
Philipp Terhörst
Daniel Fahrmann
Jan Niklas Kolf
Naser Damer
Florian Kirchbuchner
Arjan Kuijper
CVBM
83
37
0
02 Dec 2020
Rank consistent ordinal regression for neural networks with application to age estimation
Wenzhi Cao
Vahid Mirjalili
S. Raschka
184
214
0
20 Jan 2019
Age Progression/Regression by Conditional Adversarial Autoencoder
Zhifei Zhang
Yang Song
Hairong Qi
GAN
CVBM
94
1,126
0
27 Feb 2017
From Facial Expression Recognition to Interpersonal Relation Prediction
Zhanpeng Zhang
Ping Luo
Chen Change Loy
Xiaoou Tang
CVBM
70
273
0
21 Sep 2016
Deep Learning Face Attributes in the Wild
Ziwei Liu
Ping Luo
Xiaogang Wang
Xiaoou Tang
CVBM
282
8,454
0
28 Nov 2014
1