ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2311.16867
  4. Cited By
The Falcon Series of Open Language Models
v1v2 (latest)

The Falcon Series of Open Language Models

28 November 2023
Ebtesam Almazrouei
Hamza Alobeidli
Abdulaziz Alshamsi
Alessandro Cappelli
Ruxandra-Aimée Cojocaru
Mérouane Debbah
Étienne Goffinet
Daniel Hesslow
Julien Launay
Quentin Malartic
Daniele Mazzotta
Badreddine Noune
B. Pannier
Guilherme Penedo
    AI4TSALM
ArXiv (abs)PDFHTML

Papers citing "The Falcon Series of Open Language Models"

50 / 306 papers shown
Title
SMART: Submodular Data Mixture Strategy for Instruction Tuning
SMART: Submodular Data Mixture Strategy for Instruction Tuning
Kowndinya Renduchintala
S. Bhatia
Ganesh Ramakrishnan
96
5
0
13 Mar 2024
Rethinking Generative Large Language Model Evaluation for Semantic
  Comprehension
Rethinking Generative Large Language Model Evaluation for Semantic Comprehension
Fangyun Wei
Xi Chen
Linzi Luo
ELMALMLRM
63
8
0
12 Mar 2024
CuentosIE: can a chatbot about "tales with a message" help to teach
  emotional intelligence?
CuentosIE: can a chatbot about "tales with a message" help to teach emotional intelligence?
Antonio Ferrández Rodríguez
Rocío Lavigne-Cerván
Jesús Peral Cortés
Ignasi Navarro-Soria
Ángel Lloret
David Gil
Carmen Rocamora
62
1
0
11 Mar 2024
CURATRON: Complete Robust Preference Data for Robust Alignment of Large
  Language Models
CURATRON: Complete Robust Preference Data for Robust Alignment of Large Language Models
S. Nguyen
Uma-Naresh Niranjan
Theja Tulabandhula
88
0
0
05 Mar 2024
Large language models surpass human experts in predicting neuroscience
  results
Large language models surpass human experts in predicting neuroscience results
Xiaoliang Luo
Akilles Rechardt
Guangzhi Sun
Kevin K. Nejad
Felipe Y´a˜nez
...
Anna Behler
Chloe M. Hall
J. Dafflon
Sherry Dongqi Bao
Bradley C. Love
91
58
0
04 Mar 2024
LM4OPT: Unveiling the Potential of Large Language Models in Formulating
  Mathematical Optimization Problems
LM4OPT: Unveiling the Potential of Large Language Models in Formulating Mathematical Optimization Problems
Tasnim Ahmed
Salimur Choudhury
73
12
0
02 Mar 2024
A Survey of AI-generated Text Forensic Systems: Detection, Attribution,
  and Characterization
A Survey of AI-generated Text Forensic Systems: Detection, Attribution, and Characterization
Tharindu Kumarage
Garima Agrawal
Paras Sheth
Raha Moraffah
Amanat Chadha
Joshua Garland
Huan Liu
DeLMO
72
13
0
02 Mar 2024
Massive Activations in Large Language Models
Massive Activations in Large Language Models
Mingjie Sun
Xinlei Chen
J. Zico Kolter
Zhuang Liu
129
81
0
27 Feb 2024
PRP: Propagating Universal Perturbations to Attack Large Language Model
  Guard-Rails
PRP: Propagating Universal Perturbations to Attack Large Language Model Guard-Rails
Neal Mangaokar
Ashish Hooda
Jihye Choi
Shreyas Chandrashekaran
Kassem Fawaz
Somesh Jha
Atul Prakash
AAML
94
37
0
24 Feb 2024
Linguistic Intelligence in Large Language Models for Telecommunications
Linguistic Intelligence in Large Language Models for Telecommunications
Tasnim Ahmed
Nicola Piovesan
Antonio De Domenico
Salimur Choudhury
83
10
0
24 Feb 2024
Chimera: A Lossless Decoding Method for Accelerating Large Language
  Models Inference by Fusing all Tokens
Chimera: A Lossless Decoding Method for Accelerating Large Language Models Inference by Fusing all Tokens
Huiping Zhuang
Jiahong Yu
Qianshi Pang
Zihao Wang
Huiping Zhuang
Cen Chen
Xiaofeng Zou
81
4
0
24 Feb 2024
Fast Adversarial Attacks on Language Models In One GPU Minute
Fast Adversarial Attacks on Language Models In One GPU Minute
Vinu Sankar Sadasivan
Shoumik Saha
Gaurang Sriramanan
Priyatham Kattakinda
Atoosa Malemir Chegini
Soheil Feizi
MIALM
106
42
0
23 Feb 2024
API-BLEND: A Comprehensive Corpora for Training and Benchmarking API
  LLMs
API-BLEND: A Comprehensive Corpora for Training and Benchmarking API LLMs
Kinjal Basu
Ibrahim Abdelaziz
Subhajit Chaudhury
Soham Dan
Mayank Agarwal
Asim Munawar
Yara Rizk
Vinod Muthusamy
Pavan Kapanipathi
Luis A. Lastras
144
20
0
23 Feb 2024
PALO: A Polyglot Large Multimodal Model for 5B People
PALO: A Polyglot Large Multimodal Model for 5B People
Muhammad Maaz
H. Rasheed
Abdelrahman M. Shaker
Salman Khan
Hisham Cholakal
Rao M. Anwer
Timothy Baldwin
Michael Felsberg
Fahad S. Khan
VLMLRM
151
15
0
22 Feb 2024
MobileLLM: Optimizing Sub-billion Parameter Language Models for
  On-Device Use Cases
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases
Zechun Liu
Changsheng Zhao
Forrest N. Iandola
Chen Lai
Yuandong Tian
...
Ernie Chang
Yangyang Shi
Raghuraman Krishnamoorthi
Liangzhen Lai
Vikas Chandra
ALM
141
103
0
22 Feb 2024
Efficient and Effective Vocabulary Expansion Towards Multilingual Large
  Language Models
Efficient and Effective Vocabulary Expansion Towards Multilingual Large Language Models
Seungduk Kim
Seungtaek Choi
Myeongho Jeong
85
7
0
22 Feb 2024
On the Tip of the Tongue: Analyzing Conceptual Representation in Large
  Language Models with Reverse-Dictionary Probe
On the Tip of the Tongue: Analyzing Conceptual Representation in Large Language Models with Reverse-Dictionary Probe
Ningyu Xu
Qi Zhang
Menghan Zhang
Peng Qian
Xuanjing Huang
LRM
131
3
0
22 Feb 2024
ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity
  within Large Language Models
ProSparse: Introducing and Enhancing Intrinsic Activation Sparsity within Large Language Models
Chenyang Song
Xu Han
Zhengyan Zhang
Shengding Hu
Xiyu Shi
...
Chen Chen
Zhiyuan Liu
Guanglin Li
Tao Yang
Maosong Sun
167
32
0
21 Feb 2024
FinBen: A Holistic Financial Benchmark for Large Language Models
FinBen: A Holistic Financial Benchmark for Large Language Models
Qianqian Xie
Weiguang Han
Zhengyu Chen
Ruoyu Xiang
Xiao Zhang
...
Yanzhao Lai
Hao Wang
Min Peng
Sophia Ananiadou
Jimin Huang
AIFin
130
48
0
20 Feb 2024
GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence
GenAudit: Fixing Factual Errors in Language Model Outputs with Evidence
Kundan Krishna
S. Ramprasad
Prakhar Gupta
Byron C. Wallace
Zachary Chase Lipton
Jeffrey P. Bigham
HILMKELMSyDa
144
9
0
19 Feb 2024
Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When
  and What to Retrieve for LLMs
Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs
Jiejun Tan
Zhicheng Dou
Yutao Zhu
Peidong Guo
Kun Fang
Ji-Rong Wen
131
30
0
19 Feb 2024
Direct Large Language Model Alignment Through Self-Rewarding Contrastive
  Prompt Distillation
Direct Large Language Model Alignment Through Self-Rewarding Contrastive Prompt Distillation
Aiwei Liu
Haoping Bai
Zhiyun Lu
Xiang Kong
Simon Wang
Jiulong Shan
Mengsi Cao
Lijie Wen
ALM
74
13
0
19 Feb 2024
Structured Chain-of-Thought Prompting for Few-Shot Generation of
  Content-Grounded QA Conversations
Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA Conversations
M. Sultan
Jatin Ganhotra
Ramón Fernández Astudillo
LRM
71
3
0
19 Feb 2024
MARS: Meaning-Aware Response Scoring for Uncertainty Estimation in
  Generative LLMs
MARS: Meaning-Aware Response Scoring for Uncertainty Estimation in Generative LLMs
Yavuz Faruk Bakman
D. Yaldiz
Baturalp Buyukates
Chenyang Tao
Dimitrios Dimitriadis
A. Avestimehr
109
25
0
19 Feb 2024
Controlled Text Generation for Large Language Model with Dynamic
  Attribute Graphs
Controlled Text Generation for Large Language Model with Dynamic Attribute Graphs
Xun Liang
Hanyu Wang
Shichao Song
Mengting Hu
Xunzhi Wang
Zhiyu Li
Feiyu Xiong
Simin Niu
67
11
0
17 Feb 2024
Disclosure and Mitigation of Gender Bias in LLMs
Disclosure and Mitigation of Gender Bias in LLMs
Xiangjue Dong
Yibo Wang
Philip S. Yu
James Caverlee
69
39
0
17 Feb 2024
BioMistral: A Collection of Open-Source Pretrained Large Language Models
  for Medical Domains
BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains
Yanis Labrak
Adrien Bazoge
Emmanuel Morin
P. Gourraud
Mickael Rouvier
Richard Dufour
227
228
0
15 Feb 2024
Attacking Large Language Models with Projected Gradient Descent
Attacking Large Language Models with Projected Gradient Descent
Simon Geisler
Tom Wollschlager
M. H. I. Abdalla
Johannes Gasteiger
Stephan Günnemann
AAMLSILM
135
62
0
14 Feb 2024
Anchor-based Large Language Models
Anchor-based Large Language Models
Jianhui Pang
Fanghua Ye
Derek F. Wong
Xin He
Wanshun Chen
Longyue Wang
KELM
160
10
0
12 Feb 2024
On the Efficacy of Eviction Policy for Key-Value Constrained Generative
  Language Model Inference
On the Efficacy of Eviction Policy for Key-Value Constrained Generative Language Model Inference
Siyu Ren
Kenny Q. Zhu
79
30
0
09 Feb 2024
Rapid Optimization for Jailbreaking LLMs via Subconscious Exploitation
  and Echopraxia
Rapid Optimization for Jailbreaking LLMs via Subconscious Exploitation and Echopraxia
Guangyu Shen
Shuyang Cheng
Kai-xian Zhang
Guanhong Tao
Shengwei An
Lu Yan
Zhuo Zhang
Shiqing Ma
Xiangyu Zhang
78
15
0
08 Feb 2024
ReLU$^2$ Wins: Discovering Efficient Activation Functions for Sparse
  LLMs
ReLU2^22 Wins: Discovering Efficient Activation Functions for Sparse LLMs
Zhengyan Zhang
Yixin Song
Guanghui Yu
Xu Han
Yankai Lin
Chaojun Xiao
Chenyang Song
Zhiyuan Liu
Zeyu Mi
Maosong Sun
82
36
0
06 Feb 2024
Data Poisoning for In-context Learning
Data Poisoning for In-context Learning
Pengfei He
Han Xu
Yue Xing
Hui Liu
Makoto Yamada
Jiliang Tang
SILMAAML
102
13
0
03 Feb 2024
OLMo: Accelerating the Science of Language Models
OLMo: Accelerating the Science of Language Models
Dirk Groeneveld
Iz Beltagy
Pete Walsh
Akshita Bhagia
Rodney Michael Kinney
...
Jesse Dodge
Kyle Lo
Luca Soldaini
Noah A. Smith
Hanna Hajishirzi
OSLM
219
413
0
01 Feb 2024
Contextual Feature Extraction Hierarchies Converge in Large Language
  Models and the Brain
Contextual Feature Extraction Hierarchies Converge in Large Language Models and the Brain
Gavin Mischler
Yinghao Aaron Li
Stephan Bickel
A. Mehta
N. Mesgarani
86
31
0
31 Jan 2024
Provably Robust Multi-bit Watermarking for AI-generated Text
Provably Robust Multi-bit Watermarking for AI-generated Text
Wenjie Qu
Dong Yin
Zixin He
Wei Zou
Tianyang Tao
Jinyuan Jia
Jiaheng Zhang
Jinyuan Jia
Jiaheng Zhang
WaLM
251
2
0
30 Jan 2024
TeenyTinyLlama: open-source tiny language models trained in Brazilian
  Portuguese
TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese
N. Corrêa
Sophia Falk
Shiza Fatimah
Aniket Sen
N. D. Oliveira
93
9
0
30 Jan 2024
Large Multi-Modal Models (LMMs) as Universal Foundation Models for
  AI-Native Wireless Systems
Large Multi-Modal Models (LMMs) as Universal Foundation Models for AI-Native Wireless Systems
Shengzhe Xu
Christo Kurisummoottil Thomas
Omar Hashash
Nikhil Muralidhar
Walid Saad
Naren Ramakrishnan
92
26
0
30 Jan 2024
CognitiveOS: Large Multimodal Model based System to Endow Any Type of
  Robot with Generative AI
CognitiveOS: Large Multimodal Model based System to Endow Any Type of Robot with Generative AI
Artem Lykov
Mikhail Konenkov
Koffivi Fidele Gbagbe
Mikhail Litvinov
D. Davletshin
A. Fedoseev
Miguel Altamirano Cabrera
Robinroy Peter
Dzmitry Tsetserukou
LM&Ro
88
5
0
29 Jan 2024
VIALM: A Survey and Benchmark of Visually Impaired Assistance with Large
  Models
VIALM: A Survey and Benchmark of Visually Impaired Assistance with Large Models
Yi Zhao
Yilin Zhang
Rong Xiang
Jing Li
Hillming Li
77
16
0
29 Jan 2024
Supporting Sensemaking of Large Language Model Outputs at Scale
Supporting Sensemaking of Large Language Model Outputs at Scale
Katy Ilonka Gero
Chelse Swoopes
Ziwei Gu
Jonathan K. Kummerfeld
Elena L. Glassman
60
38
0
24 Jan 2024
BiTA: Bi-Directional Tuning for Lossless Acceleration in Large Language
  Models
BiTA: Bi-Directional Tuning for Lossless Acceleration in Large Language Models
Feng-Huei Lin
Hanling Yi
Hongbin Li
Yifan Yang
Xiaotian Yu
Guangming Lu
Rong Xiao
88
4
0
23 Jan 2024
CognitiveDog: Large Multimodal Model Based System to Translate Vision
  and Language into Action of Quadruped Robot
CognitiveDog: Large Multimodal Model Based System to Translate Vision and Language into Action of Quadruped Robot
Artem Lykov
Mikhail Litvinov
Mikhail Konenkov
Rinat Prochii
Nikita Burtsev
Ali Alridha Abdulkarim
Artem Bazhenov
Vladimir Berman
Dzmitry Tsetserukou
VLMLM&Ro
39
19
0
17 Jan 2024
The What, Why, and How of Context Length Extension Techniques in Large
  Language Models -- A Detailed Survey
The What, Why, and How of Context Length Extension Techniques in Large Language Models -- A Detailed Survey
Saurav Pawar
S.M. Towhidul Islam Tonmoy
S. M. M. Zaman
Vinija Jain
Aman Chadha
Amitava Das
68
29
0
15 Jan 2024
Authorship Obfuscation in Multilingual Machine-Generated Text Detection
Authorship Obfuscation in Multilingual Machine-Generated Text Detection
Dominik Macko
Robert Moro
Adaku Uchendu
Ivan Srba
Jason Samuel Lucas
Michiharu Yamashita
Nafis Irtiza Tripto
Dongwon Lee
Jakub Simko
Maria Bielikova
DeLMO
98
21
0
15 Jan 2024
OOP: Object-Oriented Programming Evaluation Benchmark for Large Language
  Models
OOP: Object-Oriented Programming Evaluation Benchmark for Large Language Models
Shuai Wang
Liang Ding
Li Shen
Yong Luo
Bo Du
Dacheng Tao
ELMALM
83
3
0
12 Jan 2024
LEGOBench: Scientific Leaderboard Generation Benchmark
LEGOBench: Scientific Leaderboard Generation Benchmark
Shruti Singh
Shoaib Alam
Husain Malwat
Mayank Singh
ELM
72
1
0
11 Jan 2024
A Shocking Amount of the Web is Machine Translated: Insights from
  Multi-Way Parallelism
A Shocking Amount of the Web is Machine Translated: Insights from Multi-Way Parallelism
Brian Thompson
Mehak Preet Dhaliwal
Peter Frisch
Tobias Domhan
Marcello Federico
88
17
0
11 Jan 2024
RoleEval: A Bilingual Role Evaluation Benchmark for Large Language
  Models
RoleEval: A Bilingual Role Evaluation Benchmark for Large Language Models
Tianhao Shen
Sun Li
Quan Tu
Deyi Xiong
LLMAGELM
63
9
0
26 Dec 2023
VinaLLaMA: LLaMA-based Vietnamese Foundation Model
VinaLLaMA: LLaMA-based Vietnamese Foundation Model
Quan Van Nguyen
Huy Quang Pham
Dung Dao
ALM
64
8
0
18 Dec 2023
Previous
1234567
Next