Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2106.07139
Cited By
Pre-Trained Models: Past, Present and Future
14 June 2021
Xu Han
Zhengyan Zhang
Ning Ding
Yuxian Gu
Xiao Liu
Yuqi Huo
J. Qiu
Yuan Yao
Ao Zhang
Liang Zhang
Wentao Han
Minlie Huang
Qin Jin
Yanyan Lan
Yang Liu
Zhiyuan Liu
Zhiwu Lu
Xipeng Qiu
Ruihua Song
Jie Tang
Ji-Rong Wen
Jinhui Yuan
Wayne Xin Zhao
Jun Zhu
AIFin
MQ
AI4MH
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Pre-Trained Models: Past, Present and Future"
50 / 293 papers shown
Title
PersLLM: A Personified Training Approach for Large Language Models
Zheni Zeng
Jiayi Chen
Huimin Chen
Yukun Yan
Yuxuan Chen
Zhenghao Liu
Zhiyuan Liu
Maosong Sun
LLMAG
52
2
0
17 Jul 2024
NoisyAG-News: A Benchmark for Addressing Instance-Dependent Noise in Text Classification
Hongfei Huang
Tingting Liang
Xixi Sun
Zikang Jin
Yuyu Yin
NoLa
39
1
0
09 Jul 2024
Studying the Impact of TensorFlow and PyTorch Bindings on Machine Learning Software Quality
Hao Li
Gopi Krishnan Rajbahadur
C. Bezemer
39
5
0
07 Jul 2024
Self-supervised Pretraining for Partial Differential Equations
Varun Madhavan
Amal S Sebastian
Bharath Ramsundar
Venkatasubramanian Viswanathan
AI4CE
43
0
0
03 Jul 2024
A Depression Detection Method Based on Multi-Modal Feature Fusion Using Cross-Attention
Shengjie Li
Yinhao Xiao
33
1
0
02 Jul 2024
InFiConD: Interactive No-code Fine-tuning with Concept-based Knowledge Distillation
Jinbin Huang
Wenbin He
Liang Gou
Liu Ren
Chris Bryan
52
0
0
25 Jun 2024
Controlling Forgetting with Test-Time Data in Continual Learning
Vaibhav Singh
Rahaf Aljundi
Eugene Belilovsky
CLL
VLM
KELM
48
3
0
19 Jun 2024
Text-space Graph Foundation Models: Comprehensive Benchmarks and New Insights
Zhikai Chen
Haitao Mao
Jingzhe Liu
Yu Song
Bingheng Li
...
Bahare Fatemi
Anton Tsitsulin
Bryan Perozzi
Hui Liu
Jiliang Tang
36
10
0
15 Jun 2024
ProTrain: Efficient LLM Training via Memory-Aware Techniques
Hanmei Yang
Jin Zhou
Yao Fu
Xiaoqun Wang
Ramine Roane
Hui Guan
Tongping Liu
VLM
36
0
0
12 Jun 2024
SecureNet: A Comparative Study of DeBERTa and Large Language Models for Phishing Detection
Sakshi Mahendru
Tejul Pandit
31
1
0
10 Jun 2024
Peer Review as A Multi-Turn and Long-Context Dialogue with Role-Based Interactions
Cheng Tan
Dongxin Lyu
Siyuan Li
Zhangyang Gao
Jingxuan Wei
Siqi Ma
Zicheng Liu
Stan Z. Li
LLMAG
48
10
0
09 Jun 2024
Certified Robustness to Data Poisoning in Gradient-Based Training
Philip Sosnin
Mark N. Müller
Maximilian Baader
Calvin Tsay
Matthew Wicker
AAML
SILM
71
8
0
09 Jun 2024
Extroversion or Introversion? Controlling The Personality of Your Large Language Models
Yanquan Chen
Zhen Wu
Junjie Guo
Shujian Huang
Xinyu Dai
26
0
0
07 Jun 2024
LLMs Could Autonomously Learn Without External Supervision
Ke Ji
Junying Chen
Anningzhe Gao
Wenya Xie
Xiang Wan
Benyou Wang
37
4
0
02 Jun 2024
Communication-Efficient Distributed Deep Learning via Federated Dynamic Averaging
Michail Theologitis
Georgios Frangias
Georgios Anestis
V. Samoladas
Antonios Deligiannakis
FedML
40
0
0
31 May 2024
Constrained Ensemble Exploration for Unsupervised Skill Discovery
Chenjia Bai
Rushuai Yang
Qiaosheng Zhang
Kang Xu
Yi Chen
Ting Xiao
Xuelong Li
OffRL
40
3
0
25 May 2024
Large Language Models for Medicine: A Survey
Yanxin Zheng
Wensheng Gan
Zefeng Chen
Zhenlian Qi
Qian Liang
Philip S. Yu
LM&MA
23
15
0
20 May 2024
A Hybrid Deep Learning Framework for Stock Price Prediction Considering the Investor Sentiment of Online Forum Enhanced by Popularity
Huiyu Li
Junhua Hu
19
0
0
17 May 2024
SA-FedLora: Adaptive Parameter Allocation for Efficient Federated Learning with LoRA Tuning
Yuning Yang
Xiaohong Liu
Tianrun Gao
Xiaodong Xu
Guangyu Wang
40
5
0
15 May 2024
The Ghanaian NLP Landscape: A First Look
Sheriff Issaka
Zhaoyi Zhang
Mihir Heda
Keyi Wang
Yinka Ajibola
Ryan DeMar
Xuefeng Du
46
1
0
10 May 2024
A Survey of Time Series Foundation Models: Generalizing Time Series Representation with Large Language Model
Weiqi Zhang
Jiexia Ye
Ke Yi
Yongzi Yu
Ziyue Li
Jia Li
Fugee Tsung
AI4TS
AI4CE
45
22
0
03 May 2024
GUing: A Mobile GUI Search Engine using a Vision-Language Model
Jialiang Wei
A. Courbis
Thomas Lambolais
Binbin Xu
P. Bernard
Gérard Dray
Walid Maalej
DiffM
CLIP
34
6
0
30 Apr 2024
A Partial Replication of MaskFormer in TensorFlow on TPUs for the TensorFlow Model Garden
Vishal Purohit
Wenxin Jiang
Akshath R. Ravikiran
James C. Davis
40
1
0
29 Apr 2024
AI-Enhanced Cognitive Behavioral Therapy: Deep Learning and Large Language Models for Extracting Cognitive Pathways from Social Media Texts
Meng Jiang
Y. Yu
Qing Zhao
Jianqiang Li
Changwei Song
...
Wei-dong Zhai
Dan Luo
Xiaoqin Wang
Guanghui Fu
Bing Xiang Yang
40
1
0
17 Apr 2024
Privacy Preserving Prompt Engineering: A Survey
Kennedy Edemacu
Xintao Wu
49
18
0
09 Apr 2024
Comparing Self-Supervised Learning Techniques for Wearable Human Activity Recognition
Sannara Ek
Riccardo Presotto
Gabriele Civitarese
Franccois Portet
P. Lalanda
Claudio Bettini
HAI
25
1
0
08 Apr 2024
Foundation Model for Advancing Healthcare: Challenges, Opportunities, and Future Directions
Yuting He
Fuxiang Huang
Xinrui Jiang
Yuxiang Nie
Minghao Wang
Jiguang Wang
Hao Chen
LM&MA
AI4CE
76
27
0
04 Apr 2024
DivTOD: Unleashing the Power of LLMs for Diversifying Task-Oriented Dialogue Representations
Weihao Zeng
Dayuan Fu
Keqing He
Yejie Wang
Yukai Xu
Weiran Xu
46
2
0
31 Mar 2024
Change-Agent: Towards Interactive Comprehensive Remote Sensing Change Interpretation and Analysis
Chenyang Liu
Keyan Chen
Haotian Zhang
Zipeng Qi
Zhengxia Zou
Z. Shi
39
30
0
28 Mar 2024
Naive Bayes-based Context Extension for Large Language Models
Jianlin Su
Murtadha Ahmed
Wenbo Luo
Abhishek Rao
Denny Zhou
Hyeontaek Lim
34
5
0
26 Mar 2024
Qibo: A Large Language Model for Traditional Chinese Medicine
Heyi Zhang
Xin Wang
Zhaopeng Meng
Zhe Chen
Pengwei Zhuang
Yongzhe Jia
Dawei Xu
Wenbin Guo
LM&MA
37
10
0
24 Mar 2024
Expandable Subspace Ensemble for Pre-Trained Model-Based Class-Incremental Learning
Da-Wei Zhou
Hai-Long Sun
Han-Jia Ye
De-Chuan Zhan
CLL
38
54
0
18 Mar 2024
Fisher Mask Nodes for Language Model Merging
Thennal D K
Ganesh Nathan
Suchithra M S
MoMe
AI4CE
47
5
0
14 Mar 2024
BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences
Sun Ao
Weilin Zhao
Xu Han
Cheng Yang
Zhiyuan Liu
Chuan Shi
Maosong Sun
GNN
40
8
0
14 Mar 2024
Quantum Mixed-State Self-Attention Network
Fu Chen
Qinglin Zhao
Li Feng
Chuangtao Chen
Yangbin Lin
Jianhong Lin
42
5
0
05 Mar 2024
MALTO at SemEval-2024 Task 6: Leveraging Synthetic Data for LLM Hallucination Detection
Federico Borra
Claudio Savelli
Giacomo Rosso
Alkis Koudounas
F. Giobergia
HILM
37
3
0
01 Mar 2024
Towards Optimal Learning of Language Models
Yuxian Gu
Li Dong
Y. Hao
Qingxiu Dong
Minlie Huang
Furu Wei
36
7
0
27 Feb 2024
∞
\infty
∞
Bench: Extending Long Context Evaluation Beyond 100K Tokens
Xinrong Zhang
Yingfa Chen
Shengding Hu
Zihang Xu
Junhao Chen
...
Xu Han
Zhen Leng Thai
Shuo Wang
Zhiyuan Liu
Maosong Sun
RALM
LRM
36
148
0
21 Feb 2024
LLM-Enhanced User-Item Interactions: Leveraging Edge Information for Optimized Recommendations
Xinyuan Wang
Liang Wu
Liangjie Hong
Hao Liu
Yanjie Fu
34
18
0
14 Feb 2024
Large Language Models: A Survey
Shervin Minaee
Tomáš Mikolov
Narjes Nikzad
M. Asgari-Chenaghlu
R. Socher
Xavier Amatriain
Jianfeng Gao
ALM
LM&MA
ELM
134
371
0
09 Feb 2024
Text-to-Code Generation with Modality-relative Pre-training
Fenia Christopoulou
Guchun Zhang
Gerasimos Lampouras
AI4TS
26
1
0
08 Feb 2024
InfLLM: Training-Free Long-Context Extrapolation for LLMs with an Efficient Context Memory
Chaojun Xiao
Pengle Zhang
Xu Han
Guangxuan Xiao
Yankai Lin
Zhengyan Zhang
Zhiyuan Liu
Maosong Sun
LLMAG
47
35
0
07 Feb 2024
Position: Graph Foundation Models are Already Here
Haitao Mao
Zhikai Chen
Wenzhuo Tang
Jianan Zhao
Yao Ma
Tong Zhao
Neil Shah
Mikhail Galkin
Jiliang Tang
AI4CE
64
27
0
03 Feb 2024
Foundation Model Sherpas: Guiding Foundation Models through Knowledge and Reasoning
D. Bhattacharjya
Junkyu Lee
Don Joven Agravante
Balaji Ganesan
Radu Marinescu
LLMAG
38
1
0
02 Feb 2024
PeaTMOSS: A Dataset and Initial Analysis of Pre-Trained Models in Open-Source Software
Wenxin Jiang
Jerin Yasmin
Jason Jones
Nicholas Synovic
Jiashen Kuo
Nathaniel Bielanski
Yuan Tian
George K. Thiruvathukal
James C. Davis
41
11
0
01 Feb 2024
Deep Learning Model Reuse in the HuggingFace Community: Challenges, Benefit and Trends
Mina Taraghi
Gianolli Dorcelus
A. Foundjem
Florian Tambon
Foutse Khomh
18
14
0
24 Jan 2024
A Novel Prompt-tuning Method: Incorporating Scenario-specific Concepts into a Verbalizer
Yong Ma
Senlin Luo
Yu-Ming Shang
Zhengjun Li
Yong Liu
VLM
28
2
0
10 Jan 2024
Generic Knowledge Boosted Pre-training For Remote Sensing Images
Ziyue Huang
Mingming Zhang
Yuan Gong
Qingjie Liu
Yunhong Wang
VLM
35
14
0
09 Jan 2024
DynaLay: An Introspective Approach to Dynamic Layer Selection for Deep Networks
Mrinal Mathur
Sergey Plis
AI4CE
15
1
0
20 Dec 2023
TrojFSP: Trojan Insertion in Few-shot Prompt Tuning
Meng Zheng
Jiaqi Xue
Xun Chen
YanShan Wang
Qian Lou
Lei Jiang
AAML
32
7
0
16 Dec 2023
Previous
1
2
3
4
5
6
Next