Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2110.14378
Cited By
Towards artificial general intelligence via a multimodal foundation model
27 October 2021
Nanyi Fei
Zhiwu Lu
Yizhao Gao
Guoxing Yang
Yuqi Huo
Jing Wen
Haoyu Lu
Ruihua Song
Xin Gao
Tao Xiang
Haoran Sun
Jiling Wen
AI4CE
LRM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Towards artificial general intelligence via a multimodal foundation model"
32 / 32 papers shown
Title
Revealing the Intrinsic Ethical Vulnerability of Aligned Large Language Models
Jiawei Lian
Jianhong Pan
L. Wang
Yi Wang
Shaohui Mei
Lap-Pui Chau
AAML
31
0
0
07 Apr 2025
Taxonomy-Guided Zero-Shot Recommendations with LLMs
Yueqing Liang
Liangwei Yang
Chen Wang
Xiongxiao Xu
Philip S. Yu
Kai Shu
72
6
0
21 Feb 2025
Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
Yibo Yan
Shen Wang
Jiahao Huo
Jingheng Ye
Zhendong Chu
Xuming Hu
Philip S. Yu
Carla P. Gomes
B. Selman
Qingsong Wen
LRM
127
9
0
05 Feb 2025
The Jumping Reasoning Curve? Tracking the Evolution of Reasoning Performance in GPT-[n] and o-[n] Models on Multimodal Puzzles
Vernon Y.H. Toh
Yew Ken Chia
Deepanway Ghosal
Soujanya Poria
LRM
ReLM
ELM
84
1
0
03 Feb 2025
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Min Zhang
LM&MA
AILaw
93
154
0
28 Jan 2025
Gensors: Authoring Personalized Visual Sensors with Multimodal Foundation Models and Reasoning
Michael Xieyang Liu
S. Petridis
Vivian Tsai
Alexander J. Fiannaca
Alex Olwal
Michael Terry
Carrie J. Cai
LRM
42
1
0
28 Jan 2025
Combining Knowledge Graph and LLMs for Enhanced Zero-shot Visual Question Answering
Qian Tao
Xiaoyang Fan
Yong Xu
Xingquan Zhu
Yufei Tang
50
0
0
22 Jan 2025
Fine-Tuning Games: Bargaining and Adaptation for General-Purpose Models
Benjamin Laufer
Jon M. Kleinberg
Hoda Heidari
60
8
0
03 Jan 2025
Seamless Optical Cloud Computing across Edge-Metro Network for Generative AI
Sizhe Xing
Aolong Sun
Chengxi Wang
Yizhi Wang
Boyu Dong
...
Xi Xiao
R. Penty
Qixiang Cheng
Nan Chi
Junwen Zhang
113
0
0
04 Dec 2024
Transmission Line Defect Detection Based on UAV Patrol Images and Vision-language Pretraining
Ke Zhang
Zhaoye Zheng
Yurong Guo
Jiacun Wang
Jiyuan Yang
Yangjie Xiao
VLM
79
0
0
18 Nov 2024
Tissue Concepts: supervised foundation models in computational pathology
Till Nicke
Jan Raphael Schaefer
Henning Hoefener
Friedrich Feuerhake
Dorit Merhof
Fabian Kiessling
Johannes Lotz
MedIm
50
0
0
05 Sep 2024
HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation Model
Di Wang
Meiqi Hu
Yao Jin
Yuchun Miao
Jiaqi Yang
...
Lefei Zhang
Chen Wu
Bo Du
Dacheng Tao
Liangpei Zhang
66
25
0
17 Jun 2024
Potentials of the Metaverse for Robotized Applications in Industry 4.0 and Industry 5.0
E. Kaigom
32
2
0
31 Mar 2024
When Large Language Models Meet Evolutionary Algorithms: Potential Enhancements and Challenges
Wang Chao
Jiaxuan Zhao
Licheng Jiao
Lingling Li
Fang Liu
Shuyuan Yang
75
13
0
19 Jan 2024
Large Scale Foundation Models for Intelligent Manufacturing Applications: A Survey
Haotian Zhang
S. D. Semujju
Zhicheng Wang
Xianwei Lv
Kang Xu
...
Jing Wu
Zhuo Long
Wensheng Liang
Xiaoguang Ma
Ruiyan Zhuang
UQCV
AI4TS
AI4CE
29
4
0
11 Dec 2023
Language models in molecular discovery
Chaoqi Wang
Yibo Jiang
Chenghao Yang
Han Liu
Yuxin Chen
25
7
0
28 Sep 2023
UniBriVL: Robust Universal Representation and Generation of Audio Driven Diffusion Models
Sen Fang
Bowen Gao
Yangjian Wu
T. Teoh
DiffM
34
1
0
29 Jul 2023
A Reminder of its Brittleness: Language Reward Shaping May Hinder Learning for Instruction Following Agents
Sukai Huang
N. Lipovetzky
Trevor Cohn
35
2
0
26 May 2023
CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
Shuai Zhao
Xiaohan Wang
Linchao Zhu
Yezhou Yang
CLIP
VLM
23
25
0
23 May 2023
Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining
Bingqian Lin
Zicong Chen
Mingjie Li
Haokun Lin
Hang Xu
...
Ling-Hao Chen
Xiaojun Chang
Yi Yang
L. Xing
Xiaodan Liang
LM&MA
MedIm
AI4CE
40
14
0
26 Apr 2023
AGI: Artificial General Intelligence for Education
Ehsan Latif
Gengchen Mai
Matthew Nyaaba
Xuansheng Wu
Ninghao Liu
Guoyu Lu
Sheng Li
Tianming Liu
Xiaoming Zhai
ELM
AI4CE
35
22
0
24 Apr 2023
Active Self-Supervised Learning: A Few Low-Cost Relationships Are All You Need
Vivien A. Cabannes
Léon Bottou
Yann LeCun
Randall Balestriero
48
13
0
27 Mar 2023
Borrowing Human Senses: Comment-Aware Self-Training for Social Media Multimodal Classification
Chunpu Xu
Jing Li
VLM
26
5
0
27 Mar 2023
The Shaky Foundations of Clinical Foundation Models: A Survey of Large Language Models and Foundation Models for EMRs
Michael Wornow
Yizhe Xu
Rahul Thapa
Birju S. Patel
E. Steinberg
Scott L. Fleming
M. Pfeffer
Jason Alan Fries
N. Shah
LM&MA
28
32
0
22 Mar 2023
Exploring Efficient-Tuned Learning Audio Representation Method from BriVL
Sen Fang
Yang Wu
Bowen Gao
Jingwen Cai
T. Teoh
DiffM
29
1
0
08 Mar 2023
FAME-ViL: Multi-Tasking Vision-Language Model for Heterogeneous Fashion Tasks
Xiaoping Han
Xiatian Zhu
Licheng Yu
Li Zhang
Yi-Zhe Song
Tao Xiang
VLM
24
38
0
04 Mar 2023
Rejecting Cognitivism: Computational Phenomenology for Deep Learning
P. Beckmann
G. Köstner
Ines Hipólito
34
4
0
16 Feb 2023
Decoding Visual Neural Representations by Multimodal Learning of Brain-Visual-Linguistic Features
Changde Du
Kaicheng Fu
Jinpeng Li
Huiguang He
VLM
45
68
0
13 Oct 2022
A Molecular Multimodal Foundation Model Associating Molecule Graphs with Natural Language
Bing-Huang Su
Dazhao Du
Zhao-Qing Yang
Yujie Zhou
Jiangmeng Li
Anyi Rao
Haoran Sun
Zhiwu Lu
Ji-Rong Wen
46
108
0
12 Sep 2022
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
Chao Jia
Yinfei Yang
Ye Xia
Yi-Ting Chen
Zarana Parekh
Hieu H. Pham
Quoc V. Le
Yun-hsuan Sung
Zhen Li
Tom Duerig
VLM
CLIP
334
3,708
0
11 Feb 2021
Similarity Reasoning and Filtration for Image-Text Matching
Haiwen Diao
Ying Zhang
Lingyun Ma
Huchuan Lu
231
332
0
05 Jan 2021
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
299
6,984
0
20 Apr 2018
1