ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.16867
  4. Cited By
The Falcon Series of Open Language Models
v1v2 (latest)

The Falcon Series of Open Language Models

28 November 2023
Ebtesam Almazrouei
Hamza Alobeidli
Abdulaziz Alshamsi
Alessandro Cappelli
Ruxandra-Aimée Cojocaru
Mérouane Debbah
Étienne Goffinet
Daniel Hesslow
Julien Launay
Quentin Malartic
Daniele Mazzotta
Badreddine Noune
B. Pannier
Guilherme Penedo
    AI4TSALM
ArXiv (abs)PDFHTML

Papers citing "The Falcon Series of Open Language Models"

50 / 306 papers shown
Title
Mechanisms vs. Outcomes: Probing for Syntax Fails to Explain Performance on Targeted Syntactic Evaluations
Mechanisms vs. Outcomes: Probing for Syntax Fails to Explain Performance on Targeted Syntactic Evaluations
Ananth Agarwal
Jasper Jian
Christopher D. Manning
Shikhar Murty
24
0
0
20 Jun 2025
Just Go Parallel: Improving the Multilingual Capabilities of Large Language Models
Just Go Parallel: Improving the Multilingual Capabilities of Large Language Models
Muhammad Reza Qorib
Junyi Li
Hwee Tou Ng
LRM
35
0
0
16 Jun 2025
Detecting Hard-Coded Credentials in Software Repositories via LLMs
Detecting Hard-Coded Credentials in Software Repositories via LLMs
Chidera Biringa
Gökhan Kul
43
0
0
16 Jun 2025
QiMeng-Attention: SOTA Attention Operator is generated by SOTA Attention Algorithm
QiMeng-Attention: SOTA Attention Operator is generated by SOTA Attention Algorithm
Qirui Zhou
Shaohui Peng
Weiqiang Xiong
Haixin Chen
Yuanbo Wen
...
Ke Gao
Ruizhi Chen
Yanjun Wu
Chen Zhao
Y. Chen
LRM
37
0
0
14 Jun 2025
Exploring Cultural Variations in Moral Judgments with Large Language Models
Exploring Cultural Variations in Moral Judgments with Large Language Models
Hadi Mohammadi
Efthymia Papadopoulou
Yasmeen F.S.S. Meijer
Ayoub Bagheri
45
0
0
14 Jun 2025
MALM: A Multi-Information Adapter for Large Language Models to Mitigate Hallucination
MALM: A Multi-Information Adapter for Large Language Models to Mitigate Hallucination
Ao Jia
Haiming Wu
Guohui Yao
D. Song
Songkun Ji
Yazhou Zhang
34
0
0
14 Jun 2025
Plug-in and Fine-tuning: Bridging the Gap between Small Language Models and Large Language Models
Plug-in and Fine-tuning: Bridging the Gap between Small Language Models and Large Language Models
Kyeonghyun Kim
Jinhee Jang
Juhwan Choi
Yoonji Lee
Kyohoon Jin
Youngbin Kim
39
0
0
09 Jun 2025
Spark Transformer: Reactivating Sparsity in FFN and Attention
Spark Transformer: Reactivating Sparsity in FFN and Attention
Chong You
Kan Wu
Zhipeng Jia
Lin Chen
Srinadh Bhojanapalli
...
Felix X. Yu
Prateek Jain
David Culler
Henry M. Levy
Sanjiv Kumar
28
0
0
07 Jun 2025
Elementary Math Word Problem Generation using Large Language Models
Elementary Math Word Problem Generation using Large Language Models
Nimesh Ariyarathne
Harshani Bandara
Yasith Heshan
Omega Gamage
Surangika Ranathunga
...
Gayathri Lihinikaduarachchi
Tharoosha Vihidun
Meenambika Chandirakumar
Sanujen Premakumar
Sanjula Gathsara
AI4Ed
78
0
0
06 Jun 2025
SoK: Are Watermarks in LLMs Ready for Deployment?
SoK: Are Watermarks in LLMs Ready for Deployment?
Kieu Dang
Phung Lai
Nhathai Phan
Yelong Shen
Ruoming Jin
Abdallah Khreishah
My T. Thai
43
0
0
05 Jun 2025
Beyond Text Compression: Evaluating Tokenizers Across Scales
Beyond Text Compression: Evaluating Tokenizers Across Scales
Jonas F. Lotz
António V. Lopes
Stephan Peitz
Hendra Setiawan
Leonardo Emili
66
0
0
03 Jun 2025
Beware! The AI Act Can Also Apply to Your AI Research Practices
Beware! The AI Act Can Also Apply to Your AI Research Practices
Alina Wernick
Kristof Meding
23
0
0
03 Jun 2025
Comparing LLM-generated and human-authored news text using formal syntactic theory
Comparing LLM-generated and human-authored news text using formal syntactic theory
Olga Zamaraeva
Dan Flickinger
Francis Bond
Carlos Gómez-Rodríguez
71
0
0
02 Jun 2025
Stepsize anything: A unified learning rate schedule for budgeted-iteration training
Stepsize anything: A unified learning rate schedule for budgeted-iteration training
Anda Tang
Yiming Dong
Yutao Zeng
zhou Xun
Zhouchen Lin
380
0
0
30 May 2025
Leveraging Knowledge Graphs and LLMs for Structured Generation of Misinformation
Leveraging Knowledge Graphs and LLMs for Structured Generation of Misinformation
Sania Nayab
Marco Simoni
Giulio Rossolini
38
0
0
30 May 2025
Is Your Model Fairly Certain? Uncertainty-Aware Fairness Evaluation for LLMs
Is Your Model Fairly Certain? Uncertainty-Aware Fairness Evaluation for LLMs
Yinong Oliver Wang
N. Sivakumar
Falaah Arif Khan
Rin Metcalf Susa
Adam Goliñski
Natalie Mackraz
B. Theobald
Luca Zappella
N. Apostoloff
45
0
0
29 May 2025
Self-Critique and Refinement for Faithful Natural Language Explanations
Self-Critique and Refinement for Faithful Natural Language Explanations
Yingming Wang
Pepa Atanasova
LRM
134
0
0
28 May 2025
Ratas framework: A comprehensive genai-based approach to rubric-based marking of real-world textual exams
Ratas framework: A comprehensive genai-based approach to rubric-based marking of real-world textual exams
Masoud Safilian
Amin Beheshti
Stephen Elbourn
39
0
0
27 May 2025
A Position Paper on the Automatic Generation of Machine Learning Leaderboards
A Position Paper on the Automatic Generation of Machine Learning Leaderboards
Roelien C Timmer
Yufang Hou
Stephen Wan
229
0
0
23 May 2025
Shadows in the Attention: Contextual Perturbation and Representation Drift in the Dynamics of Hallucination in LLMs
Shadows in the Attention: Contextual Perturbation and Representation Drift in the Dynamics of Hallucination in LLMs
Zeyu Wei
Shuo Wang
Xiaohui Rong
Xuemin Liu
He Li
HILM
53
0
0
22 May 2025
Shadow-FT: Tuning Instruct via Base
Shadow-FT: Tuning Instruct via Base
Taiqiang Wu
Runming Yang
Jiayi Li
Pengfei Hu
Ngai Wong
Yujiu Yang
254
0
0
19 May 2025
Power Lines: Scaling Laws for Weight Decay and Batch Size in LLM Pre-training
Power Lines: Scaling Laws for Weight Decay and Batch Size in LLM Pre-training
Shane Bergsma
Nolan Dey
Gurpreet Gosal
Gavia Gray
Daria Soboleva
Joel Hestness
80
2
0
19 May 2025
Investigating the Vulnerability of LLM-as-a-Judge Architectures to Prompt-Injection Attacks
Investigating the Vulnerability of LLM-as-a-Judge Architectures to Prompt-Injection Attacks
Narek Maloyan
Bislan Ashinov
Dmitry Namiot
AAMLELM
90
0
0
19 May 2025
NAMET: Robust Massive Model Editing via Noise-Aware Memory Optimization
NAMET: Robust Massive Model Editing via Noise-Aware Memory Optimization
Yanbo Dai
Zhenlan Ji
Zongjie Li
Shuai Wang
KELM
64
0
0
17 May 2025
Empirically evaluating commonsense intelligence in large language models with large-scale human judgments
Empirically evaluating commonsense intelligence in large language models with large-scale human judgments
Tuan Dung Nguyen
Duncan J. Watts
Mark E. Whiting
ELM
137
1
0
15 May 2025
Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity
Comet: Accelerating Private Inference for Large Language Model by Predicting Activation Sparsity
Guang Yan
Yuhui Zhang
Zimu Guo
Lutan Zhao
Xiaojun Chen
Chen Wang
Wenhao Wang
Dan Meng
Rui Hou
80
0
0
12 May 2025
Towards AI-Driven Human-Machine Co-Teaming for Adaptive and Agile Cyber Security Operation Centers
Towards AI-Driven Human-Machine Co-Teaming for Adaptive and Agile Cyber Security Operation Centers
Massimiliano Albanese
Xinming Ou
Kevin Lybarger
Daniel Lende
Dmitry Goldgof
49
0
0
09 May 2025
ViClaim: A Multilingual Multilabel Dataset for Automatic Claim Detection in Videos
ViClaim: A Multilingual Multilabel Dataset for Automatic Claim Detection in Videos
Patrick Giedemann
Pius von Daniken
Jan Deriu
Álvaro Rodrigo
Anselmo Peñas
Mark Cieliebak
90
1
0
17 Apr 2025
NNTile: a machine learning framework capable of training extremely large GPT language models on a single node
NNTile: a machine learning framework capable of training extremely large GPT language models on a single node
A. Mikhalev
Aleksandr Katrutsa
Konstantin Sozykin
Ivan Oseledets
42
0
0
17 Apr 2025
Video Summarization with Large Language Models
Video Summarization with Large Language Models
Min Jung Lee
Dayoung Gong
Minsu Cho
91
0
0
15 Apr 2025
Can LLMs Revolutionize the Design of Explainable and Efficient TinyML Models?
Can LLMs Revolutionize the Design of Explainable and Efficient TinyML Models?
Christophe El Zeinaty
W. Hamidouche
Glenn Herrou
D. Ménard
Merouane Debbah
85
0
0
13 Apr 2025
Multi-view autoencoders for Fake News Detection
Multi-view autoencoders for Fake News Detection
Ingryd V. S. T. Pereira
George D. C. Cavalcanti
Rafael M. O. Cruz
ViT
49
0
0
10 Apr 2025
Layer-Aware Embedding Fusion for LLMs in Text Classifications
Layer-Aware Embedding Fusion for LLMs in Text Classifications
Jiho Gwak
Yuchul Jung
141
0
0
08 Apr 2025
The H-Elena Trojan Virus to Infect Model Weights: A Wake-Up Call on the Security Risks of Malicious Fine-Tuning
The H-Elena Trojan Virus to Infect Model Weights: A Wake-Up Call on the Security Risks of Malicious Fine-Tuning
Virilo Tejedor
Cristina Zuheros
Carlos Peláez-González
David Herrera-Poyatos
Andrés Herrera-Poyatos
F. Herrera
65
0
0
04 Apr 2025
MaLAware: Automating the Comprehension of Malicious Software Behaviours using Large Language Models (LLMs)
MaLAware: Automating the Comprehension of Malicious Software Behaviours using Large Language Models (LLMs)
Bikash Saha
Nanda Rani
Sandeep K. Shukla
67
2
0
01 Apr 2025
HERA: Hybrid Edge-cloud Resource Allocation for Cost-Efficient AI Agents
HERA: Hybrid Edge-cloud Resource Allocation for Cost-Efficient AI Agents
Shiyi Liu
Haiying Shen
Shuai Che
Mahdi Ghandi
Mingqin Li
LLMAG
180
0
0
01 Apr 2025
OntoAligner: A Comprehensive Modular and Robust Python Toolkit for Ontology Alignment
OntoAligner: A Comprehensive Modular and Robust Python Toolkit for Ontology Alignment
Hamed Babaei Giglou
Jennifer D'Souza
Oliver Karras
Sören Auer
51
3
0
27 Mar 2025
3MDBench: Medical Multimodal Multi-agent Dialogue Benchmark
3MDBench: Medical Multimodal Multi-agent Dialogue Benchmark
Ivan Sviridov
Amina Miftakhova
Artemiy Tereshchenko
Galina Zubkova
Pavel Blinov
Andrey Savchenko
LM&MA
100
0
0
26 Mar 2025
Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization
Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization
Zhanda Zhu
Christina Giannoula
Muralidhar Andoorveedu
Qidong Su
Karttikeya Mangalam
Bojian Zheng
Gennady Pekhimenko
VLMMoE
96
0
0
24 Mar 2025
Payload-Aware Intrusion Detection with CMAE and Large Language Models
Payload-Aware Intrusion Detection with CMAE and Large Language Models
Yongcheol Kim
Chanjae Lee
Young Yoon
77
0
0
23 Mar 2025
Think Before Refusal : Triggering Safety Reflection in LLMs to Mitigate False Refusal Behavior
Think Before Refusal : Triggering Safety Reflection in LLMs to Mitigate False Refusal Behavior
Siyang Song
Xinpeng Wang
Guangyao Zhai
Nassir Navab
Yun Xue
LLMAG
101
0
0
22 Mar 2025
Only a Little to the Left: A Theory-grounded Measure of Political Bias in Large Language Models
Only a Little to the Left: A Theory-grounded Measure of Political Bias in Large Language Models
Mats Faulborn
Indira Sen
Max Pellert
Andreas Spitz
David Garcia
ELM
77
0
0
20 Mar 2025
Context-aware Biases for Length Extrapolation
Context-aware Biases for Length Extrapolation
Ali Veisi
Hamidreza Amirzadeh
Amir Mansourian
172
1
0
11 Mar 2025
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation
Yingfeng Luo
Tong Zheng
Yongyu Mu
Yangqiu Song
Qinghong Zhang
...
Ziqiang Xu
Peinan Feng
Xiaoqian Liu
Tong Xiao
Jingbo Zhu
AI4CE
514
3
0
09 Mar 2025
SINdex: Semantic INconsistency Index for Hallucination Detection in LLMs
Samir Abdaljalil
Hasan Kurban
Parichit Sharma
Erchin Serpedin
Rachad Atat
HILM
101
3
0
07 Mar 2025
Biases in Large Language Model-Elicited Text: A Case Study in Natural Language Inference
Grace Proebsting
Adam Poliak
101
0
0
06 Mar 2025
EchoQA: A Large Collection of Instruction Tuning Data for Echocardiogram Reports
L. Moukheiber
Mira Moukheiber
Dana Moukheiiber
Jae-Woo Ju
Hyung-Chul Lee
LM&MA
146
0
0
04 Mar 2025
Rethinking Data: Towards Better Performing Domain-Specific Small Language Models
Boris Nazarov
Darya Frolova
Yackov Lubarsky
Alexei Gaissinski
Pavel Kisilev
ALM
111
1
0
03 Mar 2025
Your Model is Overconfident, and Other Lies We Tell Ourselves
Timothee Mickus
Aman Sinha
Raúl Vázquez
85
0
0
03 Mar 2025
Monte Carlo Temperature: a robust sampling strategy for LLM's uncertainty quantification methods
Monte Carlo Temperature: a robust sampling strategy for LLM's uncertainty quantification methods
Nicola Cecere
Andrea Bacciu
Ignacio Fernández Tobías
Amin Mantrach
155
1
0
25 Feb 2025
1234567
Next