ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2310.16944
  4. Cited By
Zephyr: Direct Distillation of LM Alignment

Zephyr: Direct Distillation of LM Alignment

25 October 2023
Lewis Tunstall
E. Beeching
Nathan Lambert
Nazneen Rajani
Kashif Rasul
Younes Belkada
Shengyi Huang
Leandro von Werra
Clémentine Fourrier
Nathan Habib
Nathan Sarrazin
Omar Sanseviero
Alexander M. Rush
Thomas Wolf
    ALM
ArXivPDFHTML

Papers citing "Zephyr: Direct Distillation of LM Alignment"

50 / 260 papers shown
Title
Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate
  Controllable Controversial Statements
Can LLMs Speak For Diverse People? Tuning LLMs via Debate to Generate Controllable Controversial Statements
Ming Li
Jiuhai Chen
Lichang Chen
Dinesh Manocha
71
17
0
16 Feb 2024
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language
  Models
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Gagan Bhatia
El Moatez Billah Nagoudi
Hasan Cavusoglu
Muhammad Abdul-Mageed
AIFin
32
18
0
16 Feb 2024
BioMistral: A Collection of Open-Source Pretrained Large Language Models
  for Medical Domains
BioMistral: A Collection of Open-Source Pretrained Large Language Models for Medical Domains
Yanis Labrak
Adrien Bazoge
Emmanuel Morin
P. Gourraud
Mickael Rouvier
Richard Dufour
105
191
0
15 Feb 2024
Recovering the Pre-Fine-Tuning Weights of Generative Models
Recovering the Pre-Fine-Tuning Weights of Generative Models
Eliahu Horwitz
Jonathan Kahana
Yedid Hoshen
50
9
0
15 Feb 2024
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM
  Instruction-Tuning
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning
Ming Li
Lichang Chen
Jiuhai Chen
Shwai He
Jiuxiang Gu
Dinesh Manocha
29
51
0
15 Feb 2024
AuditLLM: A Tool for Auditing Large Language Models Using Multiprobe
  Approach
AuditLLM: A Tool for Auditing Large Language Models Using Multiprobe Approach
Maryam Amirizaniani
Elias Martin
Tanya Roosta
Aman Chadha
Chirag Shah
26
2
0
14 Feb 2024
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
A Dense Reward View on Aligning Text-to-Image Diffusion with Preference
Shentao Yang
Tianqi Chen
Mingyuan Zhou
EGVM
34
22
0
13 Feb 2024
Refined Direct Preference Optimization with Synthetic Data for
  Behavioral Alignment of LLMs
Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs
Víctor Gallego
SyDa
35
6
0
12 Feb 2024
Gemini Goes to Med School: Exploring the Capabilities of Multimodal
  Large Language Models on Medical Challenge Problems & Hallucinations
Gemini Goes to Med School: Exploring the Capabilities of Multimodal Large Language Models on Medical Challenge Problems & Hallucinations
Ankit Pal
Malaikannan Sankarasubbu
LM&MA
78
35
0
10 Feb 2024
OpenFedLLM: Training Large Language Models on Decentralized Private Data
  via Federated Learning
OpenFedLLM: Training Large Language Models on Decentralized Private Data via Federated Learning
Rui Ye
Wenhao Wang
Jingyi Chai
Dihan Li
Zexi Li
Yinda Xu
Yaxin Du
Yanfeng Wang
Siheng Chen
ALM
FedML
AIFin
11
76
0
10 Feb 2024
Large Language Models: A Survey
Large Language Models: A Survey
Shervin Minaee
Tomáš Mikolov
Narjes Nikzad
M. Asgari-Chenaghlu
R. Socher
Xavier Amatriain
Jianfeng Gao
ALM
LM&MA
ELM
134
371
0
09 Feb 2024
Noise Contrastive Alignment of Language Models with Explicit Rewards
Noise Contrastive Alignment of Language Models with Explicit Rewards
Huayu Chen
Guande He
Lifan Yuan
Ganqu Cui
Hang Su
Jun Zhu
60
43
0
08 Feb 2024
Pedagogical Alignment of Large Language Models
Pedagogical Alignment of Large Language Models
Shashank Sonkar
Kangqi Ni
Sapana Chaudhary
Richard G. Baraniuk
AI4Ed
10
6
0
07 Feb 2024
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming
  and Robust Refusal
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Mantas Mazeika
Long Phan
Xuwang Yin
Andy Zou
Zifan Wang
...
Nathaniel Li
Steven Basart
Bo Li
David A. Forsyth
Dan Hendrycks
AAML
23
320
0
06 Feb 2024
Psychological Assessments with Large Language Models: A Privacy-Focused
  and Cost-Effective Approach
Psychological Assessments with Large Language Models: A Privacy-Focused and Cost-Effective Approach
Sergi Blanco-Cuaresma
34
1
0
05 Feb 2024
Automatic Combination of Sample Selection Strategies for Few-Shot
  Learning
Automatic Combination of Sample Selection Strategies for Few-Shot Learning
Branislav Pecher
Ivan Srba
M. Bieliková
Joaquin Vanschoren
34
1
0
05 Feb 2024
Decoding-time Realignment of Language Models
Decoding-time Realignment of Language Models
Tianlin Liu
Shangmin Guo
Leonardo Bianco
Daniele Calandriello
Quentin Berthet
Felipe Llinares-López
Jessica Hoffmann
Lucas Dixon
Michal Valko
Mathieu Blondel
AI4CE
54
35
0
05 Feb 2024
GIRT-Model: Automated Generation of Issue Report Templates
GIRT-Model: Automated Generation of Issue Report Templates
Nafiseh Nikeghbal
Amir Hossein Kargaran
Abbas Heydarnoori
20
2
0
04 Feb 2024
BRAIn: Bayesian Reward-conditioned Amortized Inference for natural
  language generation from feedback
BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback
Gaurav Pandey
Yatin Nandwani
Tahira Naseem
Mayank Mishra
Guangxuan Xu
Dinesh Raghu
Sachindra Joshi
Asim Munawar
Ramón Fernández Astudillo
BDL
44
3
0
04 Feb 2024
LongAlign: A Recipe for Long Context Alignment of Large Language Models
LongAlign: A Recipe for Long Context Alignment of Large Language Models
Yushi Bai
Xin Lv
Jiajie Zhang
Yuze He
Ji Qi
Lei Hou
Jie Tang
Yuxiao Dong
Juanzi Li
ALM
42
45
0
31 Jan 2024
RE-GAINS & EnChAnT: Intelligent Tool Manipulation Systems For Enhanced
  Query Responses
RE-GAINS & EnChAnT: Intelligent Tool Manipulation Systems For Enhanced Query Responses
Sahil Girhepuje
Siva Sankar Sajeev
Purvam Jain
Arya Sikder
Adithya Rama Varma
Ryan George
Akshay Govind Srinivasan
Mahendra Kurup
Ashmit Sinha
Sudip Mondal
RALM
37
0
0
28 Jan 2024
Airavata: Introducing Hindi Instruction-tuned LLM
Airavata: Introducing Hindi Instruction-tuned LLM
Jay Gala
Thanmay Jayakumar
Jaavid Aktar Husain
M. AswanthKumar
Mohammed Safi Ur Rahman Khan
...
Ratish Puduppully
Mitesh M. Khapra
Raj Dabre
Rudra Murthy
Anoop Kunchukuttan
42
23
0
26 Jan 2024
Spatial Transcriptomics Analysis of Zero-shot Gene Expression Prediction
Spatial Transcriptomics Analysis of Zero-shot Gene Expression Prediction
Yan Yang
Md. Zakir Hossain
Xuesong Li
Shafin Rahman
Eric A. Stone
17
4
0
26 Jan 2024
LongHealth: A Question Answering Benchmark with Long Clinical Documents
LongHealth: A Question Answering Benchmark with Long Clinical Documents
Lisa Christine Adams
Felix Busch
T. Han
Jean-Baptiste Excoffier
Matthieu Ortala
Alexander Loser
Hugo J. W. L. Aerts
Jakob Nikolas Kather
Daniel Truhn
Keno Bressem
ELM
LM&MA
AI4MH
39
10
0
25 Jan 2024
GRATH: Gradual Self-Truthifying for Large Language Models
GRATH: Gradual Self-Truthifying for Large Language Models
Weixin Chen
D. Song
Bo-wen Li
HILM
SyDa
33
5
0
22 Jan 2024
PHOENIX: Open-Source Language Adaption for Direct Preference
  Optimization
PHOENIX: Open-Source Language Adaption for Direct Preference Optimization
Matthias Uhlig
Sigurd Schacht
Sudarshan Kamath Barkur
ALM
14
1
0
19 Jan 2024
Beyond Traditional Benchmarks: Analyzing Behaviors of Open LLMs on
  Data-to-Text Generation
Beyond Traditional Benchmarks: Analyzing Behaviors of Open LLMs on Data-to-Text Generation
Zdeněk Kasner
Ondrej Dusek
33
8
0
18 Jan 2024
Carrying over algorithm in transformers
Carrying over algorithm in transformers
J. Kruthoff
24
0
0
15 Jan 2024
Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding
Generating Diverse and High-Quality Texts by Minimum Bayes Risk Decoding
Yuu Jinnai
Ukyo Honda
Tetsuro Morimura
Peinan Zhang
31
6
0
10 Jan 2024
Computational Argumentation-based Chatbots: a Survey
Computational Argumentation-based Chatbots: a Survey
Federico Castagna
Nadin Kökciyan
I. Sassoon
Simon Parsons
Elizabeth I. Sklar
28
6
0
07 Jan 2024
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language
  Models
Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models
Zixiang Chen
Yihe Deng
Huizhuo Yuan
Kaixuan Ji
Quanquan Gu
SyDa
41
275
0
02 Jan 2024
Building Efficient Universal Classifiers with Natural Language Inference
Building Efficient Universal Classifiers with Natural Language Inference
Moritz Laurer
W. Atteveldt
Andreu Casas
Kasper Welbers
33
8
0
29 Dec 2023
Some things are more CRINGE than others: Iterative Preference
  Optimization with the Pairwise Cringe Loss
Some things are more CRINGE than others: Iterative Preference Optimization with the Pairwise Cringe Loss
Jing Xu
Andrew Lee
Sainbayar Sukhbaatar
Jason Weston
15
86
0
27 Dec 2023
Zero-Shot Cross-Lingual Reranking with Large Language Models for
  Low-Resource Languages
Zero-Shot Cross-Lingual Reranking with Large Language Models for Low-Resource Languages
Mofetoluwa Adeyemi
Akintunde Oladipo
Ronak Pradeep
Jimmy J. Lin
27
1
0
26 Dec 2023
SecQA: A Concise Question-Answering Dataset for Evaluating Large
  Language Models in Computer Security
SecQA: A Concise Question-Answering Dataset for Evaluating Large Language Models in Computer Security
Zefang Liu
ELM
6
24
0
26 Dec 2023
MetaAID 2.5: A Secure Framework for Developing Metaverse Applications
  via Large Language Models
MetaAID 2.5: A Secure Framework for Developing Metaverse Applications via Large Language Models
Hongyin Zhu
36
6
0
22 Dec 2023
Experimenting with Large Language Models and vector embeddings in NASA
  SciX
Experimenting with Large Language Models and vector embeddings in NASA SciX
Sergi Blanco-Cuaresma
I. Ciucă
Alberto Accomazzi
Michael J. Kurtz
E. Henneken
...
Fernanda de Macedo Alves
Jean-Claude Paquin
Jennifer Bartlett
Mugdha S. Polimera
S. Jarmak
34
1
0
21 Dec 2023
Scaling Down to Scale Up: A Cost-Benefit Analysis of Replacing OpenAI's
  LLM with Open Source SLMs in Production
Scaling Down to Scale Up: A Cost-Benefit Analysis of Replacing OpenAI's LLM with Open Source SLMs in Production
Chandra Irugalbandara
Ashish Mahendra
Roland Daynauth
T. Arachchige
Jayanaka L. Dantanarayana
K. Flautner
Lingjia Tang
Yiping Kang
Jason Mars
ELM
28
14
0
20 Dec 2023
Language Resources for Dutch Large Language Modelling
Language Resources for Dutch Large Language Modelling
Bram Vanroy
MoE
ALM
23
7
0
20 Dec 2023
Climate Change from Large Language Models
Climate Change from Large Language Models
Hongyin Zhu
Prayag Tiwari
ELM
35
7
0
19 Dec 2023
Iterative Preference Learning from Human Feedback: Bridging Theory and
  Practice for RLHF under KL-Constraint
Iterative Preference Learning from Human Feedback: Bridging Theory and Practice for RLHF under KL-Constraint
Wei Xiong
Hanze Dong
Chen Ye
Ziqi Wang
Han Zhong
Heng Ji
Nan Jiang
Tong Zhang
OffRL
38
161
0
18 Dec 2023
Marathon: A Race Through the Realm of Long Context with Large Language
  Models
Marathon: A Race Through the Realm of Long Context with Large Language Models
Lei Zhang
Yunshui Li
Ziqiang Liu
Jiaxi Yang
Junhao Liu
Longze Chen
Run Luo
Min Yang
OffRL
LRM
45
5
0
15 Dec 2023
Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs
Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs
O. Ovadia
Menachem Brief
Moshik Mishaeli
Oren Elisha
RALM
34
132
0
10 Dec 2023
SeaLLMs -- Large Language Models for Southeast Asia
SeaLLMs -- Large Language Models for Southeast Asia
Xuan-Phi Nguyen
Wenxuan Zhang
Xin Li
Mahani Aljunied
Zhiqiang Hu
...
Yue Deng
Sen Yang
Chaoqun Liu
Hang Zhang
Li Bing
LRM
29
73
0
01 Dec 2023
ChatGPT's One-year Anniversary: Are Open-Source Large Language Models
  Catching up?
ChatGPT's One-year Anniversary: Are Open-Source Large Language Models Catching up?
Hailin Chen
Fangkai Jiao
Xingxuan Li
Chengwei Qin
Mathieu Ravaut
Ruochen Zhao
Caiming Xiong
Shafiq R. Joty
ELM
CLL
AI4MH
LRM
ALM
85
27
0
28 Nov 2023
Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models
Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models
Samuele Poppi
Tobia Poppi
Federico Cocchi
Marcella Cornia
Lorenzo Baraldi
Rita Cucchiara
VLM
24
8
0
27 Nov 2023
Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2
Hamish Ivison
Yizhong Wang
Valentina Pyatkin
Nathan Lambert
Matthew E. Peters
...
Joel Jang
David Wadden
Noah A. Smith
Iz Beltagy
Hanna Hajishirzi
ALM
ELM
26
180
0
17 Nov 2023
TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in
  Event Extraction
TextEE: Benchmark, Reevaluation, Reflections, and Future Challenges in Event Extraction
Kuan-Hao Huang
I-Hung Hsu
Tanmay Parekh
Zhiyu Xie
Zixuan Zhang
Premkumar Natarajan
Kai-Wei Chang
Nanyun Peng
Heng Ji
29
16
0
16 Nov 2023
Grounding Gaps in Language Model Generations
Grounding Gaps in Language Model Generations
Omar Shaikh
Kristina Gligorić
Ashna Khetan
Matthias Gerstgrasser
Diyi Yang
Dan Jurafsky
18
21
0
15 Nov 2023
How Well Do Large Language Models Truly Ground?
How Well Do Large Language Models Truly Ground?
Hyunji Lee
Se June Joo
Chaeeun Kim
Joel Jang
Doyoung Kim
Kyoung-Woon On
Minjoon Seo
HILM
33
6
0
15 Nov 2023
Previous
123456
Next