Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2107.02137
Cited By
ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
5 July 2021
Yu Sun
Shuohuan Wang
Shikun Feng
Siyu Ding
Chao Pang
Junyuan Shang
Jiaxiang Liu
Xuyi Chen
Yanbin Zhao
Yuxiang Lu
Weixin Liu
Zhihua Wu
Weibao Gong
Jianzhong Liang
Zhizhou Shang
Peng Sun
Wei Liu
Ouyang Xuan
Dianhai Yu
Hao Tian
Hua Wu
Haifeng Wang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"ERNIE 3.0: Large-scale Knowledge Enhanced Pre-training for Language Understanding and Generation"
50 / 68 papers shown
Title
Task-Core Memory Management and Consolidation for Long-term Continual Learning
Tianyu Huai
Jie Zhou
Yuxuan Cai
Qin Chen
Wen Wu
Xingjiao Wu
Xipeng Qiu
Liang He
CLL
33
0
0
15 May 2025
Learn to Think: Bootstrapping LLM Reasoning Capability Through Graph Representation Learning
Hang Gao
Chenhao Zhang
Tie Wang
Junsuo Zhao
Fengge Wu
Changwen Zheng
Huaping Liu
LRM
34
0
0
09 May 2025
Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification
Yubo Wang
Haoyang Li
Fei Teng
Lei Chen
91
1
0
17 Feb 2025
A Survey of Large Language Models for Healthcare: from Data, Technology, and Applications to Accountability and Ethics
Kai He
Rui Mao
Qika Lin
Yucheng Ruan
Xiang Lan
Mengling Feng
Min Zhang
LM&MA
AILaw
93
154
0
28 Jan 2025
Hengqin-RA-v1: Advanced Large Language Model for Diagnosis and Treatment of Rheumatoid Arthritis with Dataset based Traditional Chinese Medicine
Yishen Liu
Shengda Luo
Zishao Zhong
Tongtong Wu
Jun Zhang
Peiyao Ou
Yong Liang
Liang Liu
Hudan Pan
LM&MA
40
0
0
05 Jan 2025
ValuesRAG: Enhancing Cultural Alignment Through Retrieval-Augmented Contextual Learning
Wonduk Seo
Zonghao Yuan
Yi Bu
VLM
50
1
0
02 Jan 2025
UniHGKR: Unified Instruction-aware Heterogeneous Knowledge Retrievers
Dehai Min
Zhiyang Xu
Guilin Qi
Lifu Huang
Chenyu You
RALM
75
1
0
26 Oct 2024
Understanding Layer Significance in LLM Alignment
Guangyuan Shi
Zexin Lu
Xiaoyu Dong
Wenlong Zhang
Xuanyu Zhang
Yujie Feng
Xiao-Ming Wu
58
2
0
23 Oct 2024
Large Language Model Inference Acceleration: A Comprehensive Hardware Perspective
Jinhao Li
Jiaming Xu
Shan Huang
Yonghua Chen
Wen Li
...
Jiayi Pan
Li Ding
Hao Zhou
Yu Wang
Guohao Dai
62
16
0
06 Oct 2024
Edu-Values: Towards Evaluating the Chinese Education Values of Large Language Models
Peiyi Zhang
Yazhou Zhang
Bo Wang
Lu Rong
Jing Qin
Jing Qin
AI4Ed
ELM
49
1
0
19 Sep 2024
medIKAL: Integrating Knowledge Graphs as Assistants of LLMs for Enhanced Clinical Diagnosis on EMRs
Mingyi Jia
Junwen Duan
Yan Song
Jianxin Wang
37
7
0
20 Jun 2024
DEPTH: Discourse Education through Pre-Training Hierarchically
Zachary Bamberger
Ofek Glick
Chaim Baskin
Yonatan Belinkov
67
0
0
13 May 2024
CRE-LLM: A Domain-Specific Chinese Relation Extraction Framework with Fine-tuned Large Language Model
Zhengpeng Shi
Haoran Luo
LRM
ALM
38
2
0
28 Apr 2024
SOS-1K: A Fine-grained Suicide Risk Classification Dataset for Chinese Social Media Analysis
Hongzhi Qi
Hanfei Liu
Jianqiang Li
Qing Zhao
Wei-dong Zhai
Dan Luo
Tianyu He
Shuo Liu
Bing Xiang Yang
Guanghui Fu
31
1
0
19 Apr 2024
Navigating the Landscape of Large Language Models: A Comprehensive Review and Analysis of Paradigms and Fine-Tuning Strategies
Benjue Weng
LM&MA
46
7
0
13 Apr 2024
Cross-domain Multi-modal Few-shot Object Detection via Rich Text
Zeyu Shangguan
Daniel Seita
Mohammad Rostami
ObjD
55
1
0
24 Mar 2024
LHMKE: A Large-scale Holistic Multi-subject Knowledge Evaluation Benchmark for Chinese Large Language Models
Chuang Liu
Renren Jin
Yuqi Ren
Deyi Xiong
ELM
43
0
0
19 Mar 2024
CPSDBench: A Large Language Model Evaluation Benchmark and Baseline for Chinese Public Security Domain
Xin Tong
Bo Jin
Zhi Lin
Binjun Wang
Ting Yu
Qiang Cheng
ELM
22
0
0
11 Feb 2024
Large Language Models: A Survey
Shervin Minaee
Tomáš Mikolov
Narjes Nikzad
M. Asgari-Chenaghlu
R. Socher
Xavier Amatriain
Jianfeng Gao
ALM
LM&MA
ELM
134
371
0
09 Feb 2024
GeoDecoder: Empowering Multimodal Map Understanding
Feng Qi
Mian Dai
Zixian Zheng
Chao Wang
39
1
0
26 Jan 2024
Lever LM: Configuring In-Context Sequence to Lever Large Vision Language Models
Xu Yang
Yingzhe Peng
Haoxuan Ma
Shuo Xu
Chi Zhang
Yucheng Han
Hanwang Zhang
32
5
0
15 Dec 2023
Oasis: Data Curation and Assessment System for Pretraining of Large Language Models
Tong Zhou
Yubo Chen
Pengfei Cao
Kang Liu
Jun Zhao
Shengping Liu
29
3
0
21 Nov 2023
HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs
Junying Chen
Xidong Wang
Anningzhe Gao
Feng Jiang
Shunian Chen
...
Chuyi Kong
Jianquan Li
Xiang Wan
Haizhou Li
Benyou Wang
LM&MA
24
61
0
16 Nov 2023
Can Knowledge Graphs Reduce Hallucinations in LLMs? : A Survey
Garima Agrawal
Tharindu Kumarage
Zeyad Alghami
Huanmin Liu
37
81
0
14 Nov 2023
Graph Neural Prompting with Large Language Models
Yijun Tian
Huan Song
Zichen Wang
Haozhu Wang
Ziqing Hu
Fang Wang
Nitesh V. Chawla
Panpan Xu
AI4CE
37
44
0
27 Sep 2023
The Impact of Artificial Intelligence on the Evolution of Digital Education: A Comparative Study of OpenAI Text Generation Tools including ChatGPT, Bing Chat, Bard, and Ernie
Negin Yazdani Motlagh
Matin Khajavi
Abbas Sharifi
Mohsen Ahmadi
30
27
0
05 Sep 2023
CValues: Measuring the Values of Chinese Large Language Models from Safety to Responsibility
Guohai Xu
Jiayi Liu
Mingshi Yan
Haotian Xu
Jinghui Si
...
Rong Zhang
Ji Zhang
Chao Peng
Feiyan Huang
Jingren Zhou
ALM
ELM
33
73
0
19 Jul 2023
AI could create a perfect storm of climate misinformation
V. Galaz
Hannah Metzler
Stefan Daume
A. Olsson
B. Lindström
A. Marklund
23
5
0
22 Jun 2023
GPT-SW3: An Autoregressive Language Model for the Nordic Languages
Ariel Ekgren
Amaru Cuba Gyllensten
Felix Stollenwerk
Joey Öhman
T. Isbister
Evangelia Gogoulou
F. Carlsson
Alice Heiman
Judit Casademont
Magnus Sahlgren
29
13
0
22 May 2023
A Survey of Safety and Trustworthiness of Large Language Models through the Lens of Verification and Validation
Xiaowei Huang
Wenjie Ruan
Wei Huang
Gao Jin
Yizhen Dong
...
Sihao Wu
Peipei Xu
Dengyu Wu
André Freitas
Mustafa A. Mustafa
ALM
45
82
0
19 May 2023
Measuring the Instability of Fine-Tuning
Yupei Du
D. Nguyen
25
4
0
15 Feb 2023
SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient
Max Ryabinin
Tim Dettmers
Michael Diskin
Alexander Borzunov
MoE
30
31
0
27 Jan 2023
Towards Robust Low-Resource Fine-Tuning with Multi-View Compressed Representations
Linlin Liu
Xingxuan Li
Megh Thakkar
Xin Li
Chenyu You
Luo Si
Lidong Bing
27
2
0
16 Nov 2022
A Survey of Knowledge Enhanced Pre-trained Language Models
Linmei Hu
Zeyi Liu
Ziwang Zhao
Lei Hou
Liqiang Nie
Juanzi Li
KELM
VLM
24
121
0
11 Nov 2022
Discriminative Language Model as Semantic Consistency Scorer for Prompt-based Few-Shot Text Classification
Zhipeng Xie
Yahe Li
26
0
0
23 Oct 2022
Deep Bidirectional Language-Knowledge Graph Pretraining
Michihiro Yasunaga
Antoine Bosselut
Hongyu Ren
Xikun Zhang
Christopher D. Manning
Percy Liang
J. Leskovec
36
193
0
17 Oct 2022
ERNIE-ViL 2.0: Multi-view Contrastive Learning for Image-Text Pre-training
Bin Shan
Weichong Yin
Yu Sun
Hao Tian
Hua Wu
Haifeng Wang
VLM
27
19
0
30 Sep 2022
Large-scale Knowledge Distillation with Elastic Heterogeneous Computing Resources
Ji Liu
Daxiang Dong
Xi Wang
An Qin
Xingjian Li
P. Valduriez
Dejing Dou
Dianhai Yu
34
6
0
14 Jul 2022
Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval
Xiao Dong
Xunlin Zhan
Yunchao Wei
Xiaoyong Wei
Yaowei Wang
Minlong Lu
Xiaochun Cao
Xiaodan Liang
24
11
0
17 Jun 2022
JiuZhang: A Chinese Pre-trained Language Model for Mathematical Problem Understanding
Wayne Xin Zhao
Kun Zhou
Zheng Gong
Beichen Zhang
Yuanhang Zhou
Jing Sha
Zhigang Chen
Shijin Wang
Cong Liu
Ji-Rong Wen
36
18
0
13 Jun 2022
A Survey on Video Action Recognition in Sports: Datasets, Methods and Applications
Fei Wu
Qingzhong Wang
Jian Bian
Haoyi Xiong
Ning Ding
Feixiang Lu
Junqing Cheng
Dejing Dou
AI4TS
28
52
0
02 Jun 2022
On the Role of Bidirectionality in Language Model Pre-Training
Mikel Artetxe
Jingfei Du
Naman Goyal
Luke Zettlemoyer
Ves Stoyanov
27
16
0
24 May 2022
SKILL: Structured Knowledge Infusion for Large Language Models
Fedor Moiseev
Zhe Dong
Enrique Alfonseca
Martin Jaggi
KELM
58
58
0
17 May 2022
GPT-NeoX-20B: An Open-Source Autoregressive Language Model
Sid Black
Stella Biderman
Eric Hallahan
Quentin G. Anthony
Leo Gao
...
Shivanshu Purohit
Laria Reynolds
J. Tow
Benqi Wang
Samuel Weinbach
96
801
0
14 Apr 2022
METRO: Efficient Denoising Pretraining of Large Scale Autoencoding Language Models with Model Generated Signals
Payal Bajaj
Chenyan Xiong
Guolin Ke
Xiaodong Liu
Di He
Saurabh Tiwary
Tie-Yan Liu
Paul N. Bennett
Xia Song
Jianfeng Gao
50
32
0
13 Apr 2022
Token Dropping for Efficient BERT Pretraining
Le Hou
Richard Yuanzhe Pang
Dinesh Manocha
Yuexin Wu
Xinying Song
Xiaodan Song
Denny Zhou
22
43
0
24 Mar 2022
MarkBERT: Marking Word Boundaries Improves Chinese BERT
Linyang Li
Yong Dai
Duyu Tang
Xipeng Qiu
Zenglin Xu
Shuming Shi
VLM
29
11
0
12 Mar 2022
What Makes Reading Comprehension Questions Difficult?
Saku Sugawara
Nikita Nangia
Alex Warstadt
Sam Bowman
ELM
RALM
20
13
0
12 Mar 2022
Context Enhanced Short Text Matching using Clickthrough Data
Mao Yan Chen
Haiyun Jiang
Yujiu Yang
36
4
0
03 Mar 2022
Y
\mathcal{Y}
Y
-Tuning: An Efficient Tuning Paradigm for Large-Scale Pre-Trained Models via Label Representation Learning
Yitao Liu
Chen An
Xipeng Qiu
29
17
0
20 Feb 2022
1
2
Next