ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.16867
  4. Cited By
The Falcon Series of Open Language Models
v1v2 (latest)

The Falcon Series of Open Language Models

28 November 2023
Ebtesam Almazrouei
Hamza Alobeidli
Abdulaziz Alshamsi
Alessandro Cappelli
Ruxandra-Aimée Cojocaru
Mérouane Debbah
Étienne Goffinet
Daniel Hesslow
Julien Launay
Quentin Malartic
Daniele Mazzotta
Badreddine Noune
B. Pannier
Guilherme Penedo
    AI4TSALM
ArXiv (abs)PDFHTML

Papers citing "The Falcon Series of Open Language Models"

50 / 306 papers shown
Title
Kernel Language Entropy: Fine-grained Uncertainty Quantification for
  LLMs from Semantic Similarities
Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities
Alexander Nikitin
Jannik Kossen
Yarin Gal
Pekka Marttinen
UQCV
140
45
0
30 May 2024
Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT
  Even in Low-Resource Settings
Laboratory-Scale AI: Open-Weight Models are Competitive with ChatGPT Even in Low-Resource Settings
Robert Wolfe
Isaac Slaughter
Bin Han
Bingbing Wen
Yiwei Yang
...
Bernease Herman
E. Brown
Zening Qu
Nicholas Weber
Bill Howe
107
8
0
27 May 2024
Comparative Analysis of Open-Source Language Models in Summarizing
  Medical Text Data
Comparative Analysis of Open-Source Language Models in Summarizing Medical Text Data
Yuhao Chen
Zhimu Wang
Bo Wen
F. Zulkernine
ELMLM&MAAI4MH
25
4
0
25 May 2024
RE-Adapt: Reverse Engineered Adaptation of Large Language Models
RE-Adapt: Reverse Engineered Adaptation of Large Language Models
William Fleshman
Benjamin Van Durme
VLM
93
4
0
23 May 2024
Mitigating Quantization Errors Due to Activation Spikes in GLU-Based
  LLMs
Mitigating Quantization Errors Due to Activation Spikes in GLU-Based LLMs
Jaewoo Yang
Hayun Kim
Younghoon Kim
95
15
0
23 May 2024
CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation
  Models
CrossCheckGPT: Universal Hallucination Ranking for Multimodal Foundation Models
Guangzhi Sun
Potsawee Manakul
Adian Liusie
Kunat Pipatanakul
Chao Zhang
P. Woodland
Mark Gales
HILMMLLM
86
9
0
22 May 2024
ReALLM: A general framework for LLM compression and fine-tuning
ReALLM: A general framework for LLM compression and fine-tuning
Louis Leconte
Lisa Bedin
Van Minh Nguyen
Eric Moulines
MQ
129
1
0
21 May 2024
Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift in
  Fine-tuning LLMs for Simultaneous Translation
Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift in Fine-tuning LLMs for Simultaneous Translation
Matthew Raffel
Victor Agostinelli
Lizhong Chen
97
7
0
16 May 2024
Falcon 7b for Software Mention Detection in Scholarly Documents
Falcon 7b for Software Mention Detection in Scholarly Documents
AmeerAli Khan
Qusai Ramadan
Cong Yang
Zeyd Boukhers
58
0
0
14 May 2024
RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text
  Detectors
RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Liam Dugan
Alyssa Hwang
Filip Trhlik
Josh Magnus Ludan
Andrew Zhu
Hainiu Xu
Daphne Ippolito
Christopher Callison-Burch
DeLMOAAML
122
52
0
13 May 2024
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and
  Composition of Experts
SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts
R. Prabhakar
R. Sivaramakrishnan
Darshan Gandhi
Yun Du
Mingran Wang
...
Urmish Thakker
Dawei Huang
Sumti Jairath
Kevin J. Brown
K. Olukotun
MoE
77
15
0
13 May 2024
Understanding the Capabilities and Limitations of Large Language Models
  for Cultural Commonsense
Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense
Siqi Shen
Lajanugen Logeswaran
Moontae Lee
Honglak Lee
Soujanya Poria
Rada Mihalcea
AI4MHLRMELM
115
33
0
07 May 2024
When LLMs Meet Cybersecurity: A Systematic Literature Review
When LLMs Meet Cybersecurity: A Systematic Literature Review
Jie Zhang
Haoyu Bu
Hui Wen
Yu Chen
Lun Li
Hongsong Zhu
145
47
0
06 May 2024
Large Language Models (LLMs) as Agents for Augmented Democracy
Large Language Models (LLMs) as Agents for Augmented Democracy
Jairo Gudiño-Rosero
Umberto Grandi
César A. Hidalgo
LLMAG
101
128
0
06 May 2024
CACTUS: Chemistry Agent Connecting Tool-Usage to Science
CACTUS: Chemistry Agent Connecting Tool-Usage to Science
Andrew D. McNaughton
Gautham Ramalaxmi
Agustin Kruel
C. Knutson
R. Varikoti
Neeraj Kumar
118
11
0
02 May 2024
General Purpose Verification for Chain of Thought Prompting
General Purpose Verification for Chain of Thought Prompting
Robert Vacareanu
Anurag Pratik
Evangelia Spiliopoulou
Zheng Qi
Giovanni Paolini
Neha Ann John
Jie Ma
Yassine Benajiba
Miguel Ballesteros
LRM
44
10
0
30 Apr 2024
Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text
  Streaming Services
Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services
Jiachen Liu
Zhiyu Wu
Jae-Won Chung
Fan Lai
Myungjin Lee
Mosharaf Chowdhury
96
29
0
25 Apr 2024
A Survey on Efficient Inference for Large Language Models
A Survey on Efficient Inference for Large Language Models
Zixuan Zhou
Xuefei Ning
Ke Hong
Tianyu Fu
Jiaming Xu
...
Shengen Yan
Guohao Dai
Xiao-Ping Zhang
Yuhan Dong
Yu Wang
176
99
0
22 Apr 2024
UnibucLLM: Harnessing LLMs for Automated Prediction of Item Difficulty
  and Response Time for Multiple-Choice Questions
UnibucLLM: Harnessing LLMs for Automated Prediction of Item Difficulty and Response Time for Multiple-Choice Questions
Ana-Cristina Rogoz
Radu Tudor Ionescu
35
3
0
20 Apr 2024
Stronger Random Baselines for In-Context Learning
Stronger Random Baselines for In-Context Learning
Gregory Yauney
David M. Mimno
80
2
0
19 Apr 2024
Sampling-based Pseudo-Likelihood for Membership Inference Attacks
Sampling-based Pseudo-Likelihood for Membership Inference Attacks
Masahiro Kaneko
Youmi Ma
Yuki Wata
Naoaki Okazaki
82
9
0
17 Apr 2024
HLAT: High-quality Large Language Model Pre-trained on AWS Trainium
HLAT: High-quality Large Language Model Pre-trained on AWS Trainium
Haozheng Fan
Hao Zhou
Guangtai Huang
Parameswaran Raman
Xinwei Fu
Gaurav Gupta
Dhananjay Ram
Yida Wang
Jun Huan
89
6
0
16 Apr 2024
LLMs4OM: Matching Ontologies with Large Language Models
LLMs4OM: Matching Ontologies with Large Language Models
Hamed Babaei Giglou
Jennifer D'Souza
Felix Engel
Sören Auer
85
11
0
16 Apr 2024
Compression Represents Intelligence Linearly
Compression Represents Intelligence Linearly
Yuzhen Huang
Jinghan Zhang
Zifei Shan
Junxian He
82
29
0
15 Apr 2024
Quantization of Large Language Models with an Overdetermined Basis
Quantization of Large Language Models with an Overdetermined Basis
D. Merkulov
Daria Cherniuk
Alexander Rudikov
Ivan Oseledets
Ekaterina Muravleva
A. Mikhalev
Boris Kashin
MQ
66
0
0
15 Apr 2024
Integrating Physiological Data with Large Language Models for Empathic
  Human-AI Interaction
Integrating Physiological Data with Large Language Models for Empathic Human-AI Interaction
Poorvesh Dongre
Majid Behravan
Kunal Gupta
Mark Billinghurst
Denis Gračanin
AI4MHLM&MA
92
6
0
14 Apr 2024
LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning
LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning
Junchi Wang
Lei Ke
MLLMLRMVLM
85
29
0
12 Apr 2024
From Words to Numbers: Your Large Language Model Is Secretly A Capable
  Regressor When Given In-Context Examples
From Words to Numbers: Your Large Language Model Is Secretly A Capable Regressor When Given In-Context Examples
Robert Vacareanu
Vlad-Andrei Negru
Vasile Suciu
Mihai Surdeanu
74
35
0
11 Apr 2024
DyKnow:Dynamically Verifying Time-Sensitive Factual Knowledge in LLMs
DyKnow:Dynamically Verifying Time-Sensitive Factual Knowledge in LLMs
Seyed Mahed Mousavi
Simone Alghisi
Giuseppe Riccardi
KELM
92
7
0
10 Apr 2024
MiniCPM: Unveiling the Potential of Small Language Models with Scalable
  Training Strategies
MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies
Shengding Hu
Yuge Tu
Xu Han
Chaoqun He
Ganqu Cui
...
Chaochao Jia
Guoyang Zeng
Dahai Li
Zhiyuan Liu
Maosong Sun
MoE
131
347
0
09 Apr 2024
The Hallucinations Leaderboard -- An Open Effort to Measure
  Hallucinations in Large Language Models
The Hallucinations Leaderboard -- An Open Effort to Measure Hallucinations in Large Language Models
Giwon Hong
Aryo Pradipta Gema
Rohit Saxena
Xiaotang Du
Ping Nie
...
Laura Perez-Beltrachini
Max Ryabinin
Xuanli He
Clémentine Fourrier
Pasquale Minervini
LRMHILM
85
12
0
08 Apr 2024
SambaLingo: Teaching Large Language Models New Languages
SambaLingo: Teaching Large Language Models New Languages
Zoltan Csaki
Bo Li
Jonathan Li
Qiantong Xu
Pian Pawakapan
Leon Zhang
Yun Du
Hengyu Zhao
Changran Hu
Urmish Thakker
94
6
0
08 Apr 2024
ALERT: A Comprehensive Benchmark for Assessing Large Language Models'
  Safety through Red Teaming
ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming
Simone Tedeschi
Felix Friedrich
P. Schramowski
Kristian Kersting
Roberto Navigli
Huu Nguyen
Bo Li
ELM
120
52
0
06 Apr 2024
Deciphering Political Entity Sentiment in News with Large Language
  Models: Zero-Shot and Few-Shot Strategies
Deciphering Political Entity Sentiment in News with Large Language Models: Zero-Shot and Few-Shot Strategies
Alapan Kuila
Sudeshna Sarkar
62
8
0
05 Apr 2024
Lossless and Near-Lossless Compression for Foundation Models
Lossless and Near-Lossless Compression for Foundation Models
Moshik Hershcovitch
Leshem Choshen
Andrew Wood
Ilias Enmouri
Peter Chin
S. Sundararaman
Danny Harnik
92
6
0
05 Apr 2024
Probing Large Language Models for Scalar Adjective Lexical Semantics and
  Scalar Diversity Pragmatics
Probing Large Language Models for Scalar Adjective Lexical Semantics and Scalar Diversity Pragmatics
Fangru Lin
Daniel Altshuler
J. Pierrehumbert
123
1
0
04 Apr 2024
HyperCLOVA X Technical Report
HyperCLOVA X Technical Report
Kang Min Yoo
Jaegeun Han
Sookyo In
Heewon Jeon
Jisu Jeong
...
Hyunkyung Noh
Se-Eun Choi
Sang-Woo Lee
Jung Hwa Lim
Nako Sung
VLM
88
9
0
02 Apr 2024
Poro 34B and the Blessing of Multilinguality
Poro 34B and the Blessing of Multilinguality
Risto Luukkonen
Jonathan Burdge
Elaine Zosa
Aarne Talman
Ville Komulainen
Vaino Hatanpaa
Peter Sarlin
S. Pyysalo
AI4CE
104
14
0
02 Apr 2024
The Impact of Prompts on Zero-Shot Detection of AI-Generated Text
The Impact of Prompts on Zero-Shot Detection of AI-Generated Text
Kaito Taguchi
Yujie Gu
Kouichi Sakurai
AAMLDeLMO
53
7
0
29 Mar 2024
A Review of Multi-Modal Large Language and Vision Models
A Review of Multi-Modal Large Language and Vision Models
Kilian Carolan
Laura Fennelly
Alan F. Smeaton
VLM
186
28
0
28 Mar 2024
FACTOID: FACtual enTailment fOr hallucInation Detection
FACTOID: FACtual enTailment fOr hallucInation Detection
Vipula Rawte
S. M. Towhidul
Krishnav Rajbangshi
Shravani Nag
Aman Chadha
Amit P. Sheth
Amitava Das
HILM
97
4
0
28 Mar 2024
"Sorry, Come Again?" Prompting -- Enhancing Comprehension and
  Diminishing Hallucination with [PAUSE]-injected Optimal Paraphrasing
"Sorry, Come Again?" Prompting -- Enhancing Comprehension and Diminishing Hallucination with [PAUSE]-injected Optimal Paraphrasing
Vipula Rawte
Islam Tonmoy
M. M. Zaman
Prachi Priya
Marcin Kardas
Alan Schelten
Ruan Silva
LRM
65
1
0
27 Mar 2024
DORE: A Dataset For Portuguese Definition Generation
DORE: A Dataset For Portuguese Definition Generation
Anna Beatriz Dimas Furtado
Tharindu Ranasinghe
Frédéric Blain
R. Mitkov
54
1
0
26 Mar 2024
A Little Leak Will Sink a Great Ship: Survey of Transparency for Large
  Language Models from Start to Finish
A Little Leak Will Sink a Great Ship: Survey of Transparency for Large Language Models from Start to Finish
Masahiro Kaneko
Timothy Baldwin
PILM
76
4
0
24 Mar 2024
Chain-of-Interaction: Enhancing Large Language Models for Psychiatric
  Behavior Understanding by Dyadic Contexts
Chain-of-Interaction: Enhancing Large Language Models for Psychiatric Behavior Understanding by Dyadic Contexts
Guangzeng Han
Weisi Liu
Xiaolei Huang
Brian Borsari
79
22
0
20 Mar 2024
Inserting Faces inside Captions: Image Captioning with Attention Guided
  Merging
Inserting Faces inside Captions: Image Captioning with Attention Guided Merging
Yannis Tevissen
Khalil Guetari
Marine Tassel
Erwan Kerleroux
Frédéric Petitpont
77
0
0
20 Mar 2024
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models
Yaowei Zheng
Richong Zhang
Junhao Zhang
Yanhan Ye
Zheyan Luo
Zhangchi Feng
Yongqiang Ma
184
560
0
20 Mar 2024
Dated Data: Tracing Knowledge Cutoffs in Large Language Models
Dated Data: Tracing Knowledge Cutoffs in Large Language Models
Jeffrey Cheng
Marc Marone
Orion Weller
Dawn J Lawrie
Daniel Khashabi
Benjamin Van Durme
117
19
0
19 Mar 2024
Loops On Retrieval Augmented Generation (LoRAG)
Loops On Retrieval Augmented Generation (LoRAG)
Ayush Thakur
Rashmi Vashisth
60
1
0
18 Mar 2024
Simple and Scalable Strategies to Continually Pre-train Large Language
  Models
Simple and Scalable Strategies to Continually Pre-train Large Language Models
Adam Ibrahim
Benjamin Thérien
Kshitij Gupta
Mats L. Richter
Quentin Anthony
Timothée Lesort
Eugene Belilovsky
Irina Rish
KELMCLL
111
63
0
13 Mar 2024
Previous
1234567
Next