Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.16867
Cited By
v1
v2 (latest)
The Falcon Series of Open Language Models
28 November 2023
Ebtesam Almazrouei
Hamza Alobeidli
Abdulaziz Alshamsi
Alessandro Cappelli
Ruxandra-Aimée Cojocaru
Mérouane Debbah
Étienne Goffinet
Daniel Hesslow
Julien Launay
Quentin Malartic
Daniele Mazzotta
Badreddine Noune
B. Pannier
Guilherme Penedo
AI4TS
ALM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"The Falcon Series of Open Language Models"
50 / 306 papers shown
Title
u-
μ
\mu
μ
P: The Unit-Scaled Maximal Update Parametrization
Charlie Blake
C. Eichenberg
Josef Dean
Lukas Balles
Luke Y. Prince
Bjorn Deiseroth
Andres Felipe Cruz Salinas
Carlo Luschi
Samuel Weinbach
Douglas Orr
136
10
0
24 Jul 2024
Automatic Generation of Fashion Images using Prompting in Generative Machine Learning Models
Georgia Argyrou
Angeliki Dimitriou
Maria Lymperaiou
Giorgos Filandrianos
Giorgos Stamou
83
4
0
20 Jul 2024
Evaluating the Reliability of Self-Explanations in Large Language Models
Korbinian Randl
John Pavlopoulos
Aron Henriksson
Tony Lindgren
LRM
138
1
0
19 Jul 2024
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies
Chaofan Tao
Qian Liu
Longxu Dou
Niklas Muennighoff
Zhongwei Wan
Ping Luo
Min Lin
Ngai Wong
PILM
134
54
0
18 Jul 2024
From Words to Worlds: Compositionality for Cognitive Architectures
Ruchira Dhar
Anders Sogaard
98
0
0
18 Jul 2024
PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation
Branden Butler
Sixing Yu
Arya Mazaheri
Ali Jannesari
LRM
128
7
0
16 Jul 2024
AdaptEval: Evaluating Large Language Models on Domain Adaptation for Text Summarization
Anum Afzal
Ribin Chalumattu
Florian Matthes
Laura Mascarell
ALM
ELM
94
5
0
16 Jul 2024
Inference Optimization of Foundation Models on AI Accelerators
Youngsuk Park
Kailash Budhathoki
Liangfu Chen
Jonas M. Kübler
Jiaji Huang
Matthäus Kleindessner
Jun Huan
Volkan Cevher
Yida Wang
George Karypis
122
5
0
12 Jul 2024
Training on the Test Task Confounds Evaluation and Emergence
Ricardo Dominguez-Olmedo
Florian E. Dorner
Moritz Hardt
ELM
164
9
1
10 Jul 2024
Data, Data Everywhere: A Guide for Pretraining Dataset Construction
Jupinder Parmar
Shrimai Prabhumoye
Pritam Gundecha
Bo Liu
Aastha Jhunjhunwala
Zhilin Wang
M. Patwary
Mohammad Shoeybi
Bryan Catanzaro
126
10
0
08 Jul 2024
Limits to Predicting Online Speech Using Large Language Models
Mina Remeli
Moritz Hardt
Robert C. Williamson
61
0
0
08 Jul 2024
Do Multilingual Large Language Models Mitigate Stereotype Bias?
Shangrui Nie
Michael Fromm
Charles F Welch
Rebekka Görge
Akbar Karimi
Joan Plepi
Nazia Afsan Mowmita
Nicolas Flores-Herr
Mehdi Ali
Lucie Flek
89
4
0
08 Jul 2024
PoPreRo: A New Dataset for Popularity Prediction of Romanian Reddit Posts
Ana-Cristina Rogoz
Maria Ilinca Nechita
Radu Tudor Ionescu
118
0
0
05 Jul 2024
A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations
Md Tahmid Rahman Laskar
Sawsan Alqahtani
M Saiful Bari
Mizanur Rahman
Mohammad Abdullah Matin Khan
...
Chee Wei Tan
Md. Rizwan Parvez
Enamul Hoque
Shafiq Joty
Jimmy Huang
ELM
ALM
105
41
0
04 Jul 2024
LLM Roleplay: Simulating Human-Chatbot Interaction
Hovhannes Tamoyan
Hendrik Schuff
Iryna Gurevych
104
10
0
04 Jul 2024
SOS! Soft Prompt Attack Against Open-Source Large Language Models
Ziqing Yang
Michael Backes
Yang Zhang
Ahmed Salem
AAML
73
5
0
03 Jul 2024
Towards More Realistic Extraction Attacks: An Adversarial Perspective
Yash More
Prakhar Ganesh
G. Farnadi
AAML
126
7
0
02 Jul 2024
ScaleBiO: Scalable Bilevel Optimization for LLM Data Reweighting
Boyao Wang
Dylan Zhang
Hanning Zhang
Xingyuan Pan
Minrui Xu
Jipeng Zhang
Renjie Pi
Xiaoyu Wang
Tong Zhang
143
10
0
28 Jun 2024
Fairness and Bias in Multimodal AI: A Survey
Tosin Adewumi
Lama Alkhaled
Namrata Gurung
G. V. Boven
Irene Pagliai
121
10
0
27 Jun 2024
STBench: Assessing the Ability of Large Language Models in Spatio-Temporal Analysis
Wenbin Li
Di Yao
Ruibo Zhao
Wenjie Chen
Zijie Xu
Chengxue Luo
Chang Gong
Quanliang Jing
Haining Tan
Jingping Bi
85
7
0
27 Jun 2024
FactFinders at CheckThat! 2024: Refining Check-worthy Statement Detection with LLMs through Data Pruning
Yufeng Li
Rrubaa Panchendrarajan
A. Zubiaga
90
4
0
26 Jun 2024
Structured Unrestricted-Rank Matrices for Parameter Efficient Fine-tuning
Arijit Sehanobish
Avinava Dubey
Krzysztof Choromanski
Somnath Basu Roy Chowdhury
Deepali Jain
Vikas Sindhwani
Snigdha Chaturvedi
ALM
93
3
0
25 Jun 2024
BlockLLM: Memory-Efficient Adaptation of LLMs by Selecting and Optimizing the Right Coordinate Blocks
A. Ramesh
Vignesh Ganapathiraman
I. Laradji
Mark Schmidt
141
3
0
25 Jun 2024
Evaluation of Language Models in the Medical Context Under Resource-Constrained Settings
Andrea Posada
Daniel Rueckert
Felix Meissen
Philip Muller
LM&MA
ELM
63
0
0
24 Jun 2024
PORT: Preference Optimization on Reasoning Traces
Salem Lahlou
Abdalgader Abubaker
Hakim Hacid
LRM
124
5
0
23 Jun 2024
A Tale of Trust and Accuracy: Base vs. Instruct LLMs in RAG Systems
Florin Cuconasu
Giovanni Trappolini
Nicola Tonellotto
Fabrizio Silvestri
87
2
0
21 Jun 2024
Talking the Talk Does Not Entail Walking the Walk: On the Limits of Large Language Models in Lexical Entailment Recognition
C. M. Greco
Lucio La Cava
Andrea Tagarelli
71
1
0
21 Jun 2024
Evidence of a log scaling law for political persuasion with large language models
Kobi Hackenburg
Ben M. Tappin
Paul Röttger
Scott Hale
Jonathan Bright
Helen Z. Margetts
82
9
0
20 Jun 2024
Instruction Pre-Training: Language Models are Supervised Multitask Learners
Daixuan Cheng
Yuxian Gu
Shaohan Huang
Junyu Bi
Minlie Huang
Furu Wei
SyDa
137
27
0
20 Jun 2024
Leveraging Large Language Models for Patient Engagement: The Power of Conversational AI in Digital Health
Bo Wen
R. Norel
Julia Liu
Thaddeus Stappenbeck
F. Zulkernine
Huamin Chen
AI4MH
LM&MA
76
4
0
19 Jun 2024
Improving Visual Commonsense in Language Models via Multiple Image Generation
Guy Yariv
Idan Schwartz
Yossi Adi
Sagie Benaim
VLM
LRM
50
0
0
19 Jun 2024
Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators
Matéo Mahaut
Laura Aina
Paula Czarnowska
Momchil Hardalov
Thomas Müller
Lluís Marquez
HILM
99
24
0
19 Jun 2024
Breaking the Ceiling of the LLM Community by Treating Token Generation as a Classification for Ensembling
Yao-Ching Yu
Chun-Chih Kuo
Ziqi Ye
Yu-Cheng Chang
Yueh-Se Li
90
12
0
18 Jun 2024
D2O: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models
Zhongwei Wan
Xinjian Wu
Yu Zhang
Yi Xin
Chaofan Tao
...
Xin Wang
Siqi Luo
Jing Xiong
Mi Zhang
Mi Zhang
126
1
0
18 Jun 2024
LiLiuM: eBay's Large Language Models for e-commerce
Christian Herold
Michael Kozielski
Leonid Ekimov
Pavel Petrushkov
P. Vandenbussche
Shahram Khadivi
98
3
0
17 Jun 2024
Large Language Models for Dysfluency Detection in Stuttered Speech
Dominik Wagner
Sebastian P. Bayerl
Ilja Baumann
Korbinian Riedhammer
Elmar Nöth
Tobias Bocklet
132
6
0
16 Jun 2024
IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Language Models in E-commerce
Wenxuan Ding
Weiqi Wang
Sze Heng Douglas Kwok
Minghao Liu
Tianqing Fang
Jiaxin Bai
Junxian He
Yangqiu Song
RALM
88
8
0
14 Jun 2024
GEB-1.3B: Open Lightweight Large Language Model
Jie Wu
Yufeng Zhu
Lei Shen
Xuqing Lu
ALM
48
0
0
14 Jun 2024
OLMES: A Standard for Language Model Evaluations
Yuling Gu
Oyvind Tafjord
Bailey Kuehl
Dany Haddad
Jesse Dodge
Hannaneh Hajishirzi
ELM
134
20
0
12 Jun 2024
MBBQ: A Dataset for Cross-Lingual Comparison of Stereotypes in Generative LLMs
Vera Neplenbroek
Arianna Bisazza
Raquel Fernández
96
8
0
11 Jun 2024
Language Models are Alignable Decision-Makers: Dataset and Application to the Medical Triage Domain
Brian Hu
Bill Ray
Alice Leung
Amy Summerville
David Joy
Christopher Funk
Arslan Basharat
88
6
0
10 Jun 2024
Aligning Large Language Models with Representation Editing: A Control Perspective
Lingkai Kong
Haorui Wang
Wenhao Mu
Yuanqi Du
Yuchen Zhuang
Yifei Zhou
Yue Song
Rongzhi Zhang
Kai Wang
Chao Zhang
107
26
0
10 Jun 2024
A Fine-tuning Dataset and Benchmark for Large Language Models for Protein Understanding
Yiqing Shen
Zan Chen
Michail Mamalakis
Luhan He
Haiyang Xia
Tianbin Li
Yanzhou Su
Junjun He
Yu Guang Wang
AI4MH
150
10
0
08 Jun 2024
Do Language Models Exhibit Human-like Structural Priming Effects?
Jaap Jumelet
Willem H. Zuidema
Arabella J. Sinclair
93
11
0
07 Jun 2024
CRAG -- Comprehensive RAG Benchmark
Xiao Yang
Kai Sun
Hao Xin
Yushi Sun
Nikita Bhalla
...
Nirav Shah
Rakesh Wanga
Anuj Kumar
Wen-tau Yih
Xin Luna Dong
95
32
0
07 Jun 2024
CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset
Abdelrahman Abdallah
Mahmoud Abdalla
M. Kasem
Mohamed Mahmoud
Ibrahim Abdelhalim
Mohamed Elkasaby
Yasser Elbendary
Adam Jatowt
66
0
0
06 Jun 2024
MARS: Benchmarking the Metaphysical Reasoning Abilities of Language Models with a Multi-task Evaluation Dataset
Weiqi Wang
Yangqiu Song
LRM
129
10
0
04 Jun 2024
Achieving Sparse Activation in Small Language Models
Jifeng Song
Kai Huang
Xiangyu Yin
Boyuan Yang
Wei Gao
89
4
0
03 Jun 2024
Effective Interplay between Sparsity and Quantization: From Theory to Practice
Simla Burcu Harma
Ayan Chakraborty
Elizaveta Kostenok
Danila Mishin
Dongho Ha
...
Martin Jaggi
Ming Liu
Yunho Oh
Suvinay Subramanian
Amir Yazdanbakhsh
MQ
103
10
0
31 May 2024
The Fine-Tuning Paradox: Boosting Translation Quality Without Sacrificing LLM Abilities
David Stap
Eva Hasler
Bill Byrne
Christof Monz
Ke M. Tran
89
12
0
30 May 2024
Previous
1
2
3
4
5
6
7
Next