ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.16867
  4. Cited By
The Falcon Series of Open Language Models
v1v2 (latest)

The Falcon Series of Open Language Models

28 November 2023
Ebtesam Almazrouei
Hamza Alobeidli
Abdulaziz Alshamsi
Alessandro Cappelli
Ruxandra-Aimée Cojocaru
Mérouane Debbah
Étienne Goffinet
Daniel Hesslow
Julien Launay
Quentin Malartic
Daniele Mazzotta
Badreddine Noune
B. Pannier
Guilherme Penedo
    AI4TSALM
ArXiv (abs)PDFHTML

Papers citing "The Falcon Series of Open Language Models"

50 / 306 papers shown
Title
Pay Attention to Real World Perturbations! Natural Robustness Evaluation in Machine Reading Comprehension
Pay Attention to Real World Perturbations! Natural Robustness Evaluation in Machine Reading Comprehension
Yulong Wu
Viktor Schlegel
Riza Batista-Navarro
AAML
76
0
0
23 Feb 2025
ELBA-Bench: An Efficient Learning Backdoor Attacks Benchmark for Large Language Models
ELBA-Bench: An Efficient Learning Backdoor Attacks Benchmark for Large Language Models
Xianglong Liu
Siyuan Liang
M. Han
Yong Luo
Aishan Liu
Xiantao Cai
Zheng He
Dacheng Tao
AAMLSILMELM
107
2
0
22 Feb 2025
ExpliCa: Evaluating Explicit Causal Reasoning in Large Language Models
ExpliCa: Evaluating Explicit Causal Reasoning in Large Language Models
Martina Miliani
S. Auriemma
Alessandro Bondielli
Emmanuele Chersoni
Lucia Passaro
Irene Sucameli
Alessandro Lenci
LRMELM
86
0
0
21 Feb 2025
Straight to Zero: Why Linearly Decaying the Learning Rate to Zero Works Best for LLMs
Straight to Zero: Why Linearly Decaying the Learning Rate to Zero Works Best for LLMs
Shane Bergsma
Nolan Dey
Gurpreet Gosal
Gavia Gray
Daria Soboleva
Joel Hestness
109
8
0
21 Feb 2025
AutoMedPrompt: A New Framework for Optimizing LLM Medical Prompts Using Textual Gradients
AutoMedPrompt: A New Framework for Optimizing LLM Medical Prompts Using Textual Gradients
Sean Wu
Michael Koo
Fabien Scalzo
Ira Kurtz
LM&MAMedIm
95
2
0
21 Feb 2025
LESA: Learnable LLM Layer Scaling-Up
LESA: Learnable LLM Layer Scaling-Up
Yifei Yang
Zouying Cao
Xinbei Ma
Yao Yao
L. Qin
Zhongfu Chen
Hai Zhao
179
0
0
20 Feb 2025
The Rotary Position Embedding May Cause Dimension Inefficiency in Attention Heads for Long-Distance Retrieval
The Rotary Position Embedding May Cause Dimension Inefficiency in Attention Heads for Long-Distance Retrieval
Ting-Rui Chiang
Dani Yogatama
62
0
0
16 Feb 2025
User Profile with Large Language Models: Construction, Updating, and Benchmarking
User Profile with Large Language Models: Construction, Updating, and Benchmarking
Nusrat Jahan Prottasha
Md. Kowsher
Hafijur Raman
Israt Jahan Anny
Prakash Bhat
Ivan Garibay
O. Garibay
91
0
0
15 Feb 2025
Forget What You Know about LLMs Evaluations - LLMs are Like a Chameleon
Forget What You Know about LLMs Evaluations - LLMs are Like a Chameleon
Nurit Cohen-Inger
Yehonatan Elisha
Bracha Shapira
Lior Rokach
Seffi Cohen
ELM
174
1
0
11 Feb 2025
Large Language Models for Multi-Robot Systems: A Survey
Large Language Models for Multi-Robot Systems: A Survey
Peihan Li
Zijian An
Shams Abrar
Lifeng Zhou
LM&RoLRM
138
10
0
06 Feb 2025
Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study
Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study
Menglong Cui
Pengzhi Gao
Wei Liu
Jian Luan
Bin Wang
LRM
154
6
0
04 Feb 2025
GUIDE: A Global Unified Inference Engine for Deploying Large Language Models in Heterogeneous Environments
GUIDE: A Global Unified Inference Engine for Deploying Large Language Models in Heterogeneous Environments
Yanyu Chen
Ganhong Huang
162
0
0
28 Jan 2025
Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs
Towards Cross-Tokenizer Distillation: the Universal Logit Distillation Loss for LLMs
Nicolas Boizard
Kevin El Haddad
C´eline Hudelot
Pierre Colombo
167
19
0
28 Jan 2025
Parameter-Efficient Fine-Tuning for Foundation Models
Parameter-Efficient Fine-Tuning for Foundation Models
Dan Zhang
Tao Feng
Lilong Xue
Yuandong Wang
Yuxiao Dong
J. Tang
244
12
0
23 Jan 2025
Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment
Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment
Melissa Kazemi Rad
Huy Nghiem
Andy Luo
Sahil Wadhwa
Mohammad Sorower
Stephen Rawls
AAML
159
5
0
22 Jan 2025
LLMs as Repositories of Factual Knowledge: Limitations and Solutions
Seyed Mahed Mousavi
Simone Alghisi
Giuseppe Riccardi
KELM
110
1
0
22 Jan 2025
Human-like conceptual representations emerge from language prediction
Human-like conceptual representations emerge from language prediction
Ningyu Xu
Qi Zhang
Chao Du
Qiang Luo
Xipeng Qiu
Xuanjing Huang
Menghan Zhang
192
0
0
21 Jan 2025
Exploring Robustness of Multilingual LLMs on Real-World Noisy Data
Exploring Robustness of Multilingual LLMs on Real-World Noisy Data
Amirhossein Aliakbarzadeh
Lucie Flek
Akbar Karimi
117
0
0
14 Jan 2025
SCOT: Self-Supervised Contrastive Pretraining For Zero-Shot Compositional Retrieval
Bhavin Jawade
JOÃO-BRUNO Soares
K. Thadani
D. Mohan
Amir Erfan Eshratifar
Benjamin Culpepper
Paloma de Juan
S. Setlur
V. Govindaraju
94
0
0
12 Jan 2025
Tailored-LLaMA: Optimizing Few-Shot Learning in Pruned LLaMA Models with Task-Specific Prompts
Tailored-LLaMA: Optimizing Few-Shot Learning in Pruned LLaMA Models with Task-Specific Prompts
Danyal Aftab
Steven Davy
ALM
122
1
0
10 Jan 2025
iServe: An Intent-based Serving System for LLMs
iServe: An Intent-based Serving System for LLMs
Dimitrios Liakopoulos
Tianrui Hu
Prasoon Sinha
N. Yadwadkar
VLM
532
0
0
08 Jan 2025
Integrating LLMs with ITS: Recent Advances, Potentials, Challenges, and Future Directions
Integrating LLMs with ITS: Recent Advances, Potentials, Challenges, and Future Directions
Doaa Mahmud
Hadeel Hajmohamed
Shamma Almentheri
Shamma Alqaydi
Lameya Aldhaheri
R. A. Khalil
Nasir Saeed
AI4TS
111
12
0
08 Jan 2025
HuRef: HUman-REadable Fingerprint for Large Language Models
HuRef: HUman-REadable Fingerprint for Large Language Models
Boyi Zeng
Cheng Zhou
Yuncong Hu
Yi Xu
Chenghu Zhou
Xiang Wang
Yu Yu
Zhouhan Lin
145
12
0
08 Jan 2025
Clinical Insights: A Comprehensive Review of Language Models in Medicine
Clinical Insights: A Comprehensive Review of Language Models in Medicine
Nikita Neveditsin
Pawan Lingras
V. Mago
LM&MA
132
5
0
08 Jan 2025
IIMedGPT: Promoting Large Language Model Capabilities of Medical Tasks by Efficient Human Preference Alignment
IIMedGPT: Promoting Large Language Model Capabilities of Medical Tasks by Efficient Human Preference Alignment
Yiming Zhang
Zheng Chang
Wentao Cai
MengXing Ren
Kang Yuan
Yining Sun
Zenghui Ding
LM&MA
103
3
0
06 Jan 2025
Adversarial Negotiation Dynamics in Generative Language Models
Arinbjörn Kolbeinsson
Benedikt Kolbeinsson
AAMLELM
142
0
0
03 Jan 2025
Out-of-distribution generalization via composition: a lens through induction heads in Transformers
Out-of-distribution generalization via composition: a lens through induction heads in Transformers
Jiajun Song
Zhuoyan Xu
Yiqiao Zhong
162
10
0
31 Dec 2024
Beyond Partisan Leaning: A Comparative Analysis of Political Bias in Large Language Models
Beyond Partisan Leaning: A Comparative Analysis of Political Bias in Large Language Models
Kaiqi Yang
Hang Li
Yucheng Chu
Hang Li
Tai-Quan Peng
Yuping Lin
Hui Liu
159
1
0
21 Dec 2024
Fearful Falcons and Angry Llamas: Emotion Category Annotations of Arguments by Humans and LLMs
Fearful Falcons and Angry Llamas: Emotion Category Annotations of Arguments by Humans and LLMs
Lynn Greschner
Roman Klinger
182
2
0
20 Dec 2024
Deploying Foundation Model Powered Agent Services: A Survey
Deploying Foundation Model Powered Agent Services: A Survey
Wenchao Xu
Jinyu Chen
Peirong Zheng
Xiaoquan Yi
Tianyi Tian
...
Quan Wan
Yining Qi
Yunfeng Fan
Qinliang Su
Xuemin Shen
AI4CE
183
2
0
18 Dec 2024
Unlocking LLMs: Addressing Scarce Data and Bias Challenges in Mental
  Health
Unlocking LLMs: Addressing Scarce Data and Bias Challenges in Mental Health
Vivek Kumar
Eirini Ntoutsi
Pushpraj Singh Rajawat
Giacomo Medda
Diego Reforgiato Recupero
AI4MH
112
1
0
17 Dec 2024
Empowering LLMs to Understand and Generate Complex Vector Graphics
Empowering LLMs to Understand and Generate Complex Vector Graphics
Ximing Xing
Juncheng Hu
Guotao Liang
Jing Zhang
Dong Xu
Qian Yu
195
12
0
15 Dec 2024
SusGen-GPT: A Data-Centric LLM for Financial NLP and Sustainability
  Report Generation
SusGen-GPT: A Data-Centric LLM for Financial NLP and Sustainability Report Generation
Qilong Wu
Xiaoneng Xiang
Hejia Huang
Xuan Wang
Yeo Wei Jie
Ranjan Satapathy
Ricardo Shirota Filho
Bharadwaj Veeravalli
147
3
0
14 Dec 2024
MPPO: Multi Pair-wise Preference Optimization for LLMs with Arbitrary
  Negative Samples
MPPO: Multi Pair-wise Preference Optimization for LLMs with Arbitrary Negative Samples
Shuo Xie
Fangzhi Zhu
Jiahui Wang
Lulu Wen
Wei Dai
Xiaowei Chen
Junxiong Zhu
Kai Zhou
Bo Zheng
91
0
0
13 Dec 2024
Code LLMs: A Taxonomy-based Survey
Code LLMs: A Taxonomy-based Survey
Nishat Raihan
Christian D. Newman
Marcos Zampieri
148
1
0
11 Dec 2024
HalluCana: Fixing LLM Hallucination with A Canary Lookahead
HalluCana: Fixing LLM Hallucination with A Canary Lookahead
Tianyi Li
Erenay Dayanik
Shubhi Tyagi
Andrea Pierleoni
HILM
124
0
0
10 Dec 2024
The Well: a Large-Scale Collection of Diverse Physics Simulations for Machine Learning
The Well: a Large-Scale Collection of Diverse Physics Simulations for Machine Learning
Ruben Ohana
Michael McCabe
Lucas Meyer
Rudy Morel
Fruzsina J. Agocs
...
François Rozet
Liam Parker
M. Cranmer
S. Ho
Shirley Ho
PINNAI4CE
195
23
1
30 Nov 2024
Training Bilingual LMs with Data Constraints in the Targeted Language
Training Bilingual LMs with Data Constraints in the Targeted Language
Skyler Seto
Maartje ter Hoeve
He Bai
Natalie Schluter
David Grangier
201
1
0
20 Nov 2024
Empowering Meta-Analysis: Leveraging Large Language Models for Scientific Synthesis
Jawad Ibn Ahad
Rafeed Mohammad Sultan
Abraham Kaikobad
Fuad Rahman
M. R. Amin
Nabeel Mohammed
Shafin Rahman
97
1
0
16 Nov 2024
New Emerged Security and Privacy of Pre-trained Model: a Survey and
  Outlook
New Emerged Security and Privacy of Pre-trained Model: a Survey and Outlook
Meng Yang
Tianqing Zhu
Chi Liu
Wanlei Zhou
Shui Yu
Philip S. Yu
AAMLELMPILM
112
1
0
12 Nov 2024
FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Training
FRUGAL: Memory-Efficient Optimization by Reducing State Overhead for Scalable Training
Philip Zmushko
Aleksandr Beznosikov
Martin Takáč
Samuel Horváth
80
2
0
12 Nov 2024
SciDQA: A Deep Reading Comprehension Dataset over Scientific Papers
SciDQA: A Deep Reading Comprehension Dataset over Scientific Papers
Shruti Singh
Nandan Sarkar
Arman Cohan
108
1
0
08 Nov 2024
Robust and Efficient Fine-tuning of LLMs with Bayesian
  Reparameterization of Low-Rank Adaptation
Robust and Efficient Fine-tuning of LLMs with Bayesian Reparameterization of Low-Rank Adaptation
Ayan Sengupta
Vaibhav Seth
Arinjay Pathak
Natraj Raman
Sriram Gopalakrishnan
Tanmoy Chakraborty
BDL
85
2
0
07 Nov 2024
The LLM Language Network: A Neuroscientific Approach for Identifying Causally Task-Relevant Units
The LLM Language Network: A Neuroscientific Approach for Identifying Causally Task-Relevant Units
Badr AlKhamissi
Greta Tuckute
Antoine Bosselut
Martin Schrimpf
MILM
121
12
0
04 Nov 2024
Data movement limits to frontier model training
Data movement limits to frontier model training
Ege Erdil
David Schneider-Joseph
92
1
0
02 Nov 2024
CodeRosetta: Pushing the Boundaries of Unsupervised Code Translation for
  Parallel Programming
CodeRosetta: Pushing the Boundaries of Unsupervised Code Translation for Parallel Programming
Ali TehraniJamsaz
Arijit Bhattacharjee
Le Chen
Nesreen Ahmed
Amir Yazdanbakhsh
Ali Jannesari
78
7
0
27 Oct 2024
Adversarial Attacks on Large Language Models Using Regularized
  Relaxation
Adversarial Attacks on Large Language Models Using Regularized Relaxation
Samuel Jacob Chacko
Sajib Biswas
Chashi Mahiul Islam
Fatema Tabassum Liza
Xiuwen Liu
AAML
87
3
0
24 Oct 2024
Key Algorithms for Keyphrase Generation: Instruction-Based LLMs for
  Russian Scientific Keyphrases
Key Algorithms for Keyphrase Generation: Instruction-Based LLMs for Russian Scientific Keyphrases
Anna Glazkova
Dmitry A. Morozov
Timur Garipov
90
0
0
23 Oct 2024
Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination
Do Robot Snakes Dream like Electric Sheep? Investigating the Effects of Architectural Inductive Biases on Hallucination
Jerry Huang
Prasanna Parthasarathi
Mehdi Rezagholizadeh
Boxing Chen
Sarath Chandar
173
0
0
22 Oct 2024
Self-calibration for Language Model Quantization and Pruning
Self-calibration for Language Model Quantization and Pruning
Miles Williams
G. Chrysostomou
Nikolaos Aletras
MQ
497
0
0
22 Oct 2024
Previous
1234567
Next