Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2308.07317
Cited By
Platypus: Quick, Cheap, and Powerful Refinement of LLMs
14 August 2023
Ariel N. Lee
Cole J. Hunter
Nataniel Ruiz
ALM
ObjD
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Platypus: Quick, Cheap, and Powerful Refinement of LLMs"
50 / 109 papers shown
Title
LLaMA-Excitor: General Instruction Tuning via Indirect Feature Interaction
Bo Zou
Chao Yang
Yu Qiao
Chengbin Quan
Youjian Zhao
47
6
0
01 Apr 2024
The Unreasonable Ineffectiveness of the Deeper Layers
Andrey Gromov
Kushal Tirumala
Hassan Shapourian
Paolo Glorioso
Daniel A. Roberts
46
79
0
26 Mar 2024
Cross-lingual Contextualized Phrase Retrieval
Huayang Li
Deng Cai
Zhi Qu
Qu Cui
Hidetaka Kamigaito
Lemao Liu
Taro Watanabe
34
0
0
25 Mar 2024
KIT-19: A Comprehensive Korean Instruction Toolkit on 19 Tasks for Fine-Tuning Korean Large Language Models
Dongjun Jang
Sungjoo Byun
Hyemi Jo
Hyopil Shin
ALM
21
0
0
25 Mar 2024
WangchanLion and WangchanX MRC Eval
Wannaphong Phatthiyaphaibun
Surapon Nonesung
Patomporn Payoungkhamdee
Peerat Limkonchotiwat
Can Udomcharoenchaikit
Jitkapat Sawatphol
Chompakorn Chaksangchaichot
E. Chuangsuwanich
Sarana Nutanong
50
0
0
24 Mar 2024
Reinforcement Learning from Reflective Feedback (RLRF): Aligning and Improving LLMs via Fine-Grained Self-Reflection
Kyungjae Lee
Dasol Hwang
Sunghyun Park
Youngsoo Jang
Moontae Lee
46
8
0
21 Mar 2024
Do Large Language Models Solve ARC Visual Analogies Like People Do?
Gustaw Opielka
Hannes Rosenbusch
Veerle Vijverberg
Claire E. Stevenson
LRM
32
6
0
13 Mar 2024
InjecAgent: Benchmarking Indirect Prompt Injections in Tool-Integrated Large Language Model Agents
Qiusi Zhan
Zhixiang Liang
Zifan Ying
Daniel Kang
LLMAG
46
73
0
05 Mar 2024
Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
Yiming Huang
Xiao Liu
Yeyun Gong
Zhibin Gou
Yelong Shen
Nan Duan
Weizhu Chen
AIMat
LRM
58
36
0
04 Mar 2024
Birbal: An efficient 7B instruct-model fine-tuned with curated datasets
Ashvini Jindal
P. Rajpoot
Ankur P. Parikh
35
6
0
04 Mar 2024
Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks
Fakhraddin Alwajih
El Moatez Billah Nagoudi
Gagan Bhatia
Abdelrahman Mohamed
Muhammad Abdul-Mageed
VLM
LRM
35
11
0
01 Mar 2024
AutoRD: An Automatic and End-to-End System for Rare Disease Knowledge Graph Construction Based on Ontologies-enhanced Large Language Models
Lang Cao
Jimeng Sun
Adam Cross
30
3
0
01 Mar 2024
Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems
Zhenting Qi
Hanlin Zhang
Eric Xing
Sham Kakade
Hima Lakkaraju
SILM
44
18
0
27 Feb 2024
HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMs
Cem Uluoglakci
T. Taşkaya-Temizel
HILM
35
2
0
25 Feb 2024
EHRNoteQA: An LLM Benchmark for Real-World Clinical Practice Using Discharge Summaries
Sunjun Kweon
Jiyoun Kim
Heeyoung Kwak
Dongchul Cha
Hangyul Yoon
Kwanghyun Kim
Jeewon Yang
Seunghyun Won
Edward Choi
LM&MA
32
4
0
25 Feb 2024
Take the Bull by the Horns: Hard Sample-Reweighted Continual Training Improves LLM Generalization
Xuxi Chen
Zhendong Wang
Daouda Sow
Junjie Yang
Tianlong Chen
Yingbin Liang
Mingyuan Zhou
Zhangyang Wang
34
5
0
22 Feb 2024
Reformatted Alignment
Run-Ze Fan
Xuefeng Li
Haoyang Zou
Junlong Li
Shwai He
Ethan Chern
Jiewen Hu
Pengfei Liu
62
8
0
19 Feb 2024
Generation Meets Verification: Accelerating Large Language Model Inference with Smart Parallel Auto-Correct Decoding
Hanling Yi
Feng-Huei Lin
Hongbin Li
Peiyang Ning
Xiaotian Yu
Rong Xiao
LRM
21
10
0
19 Feb 2024
eCeLLM: Generalizing Large Language Models for E-commerce from Large-scale, High-quality Instruction Data
B. Peng
Xinyi Ling
Ziru Chen
Huan Sun
Xia Ning
ELM
37
16
0
13 Feb 2024
GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements
Alex Havrilla
Sharath Raparthy
Christoforus Nalmpantis
Jane Dwivedi-Yu
Maksym Zhuravinskyi
Eric Hambro
Roberta Railneau
ReLM
LRM
41
50
0
13 Feb 2024
LLaVA-Docent: Instruction Tuning with Multimodal Large Language Model to Support Art Appreciation Education
Unggi Lee
Minji Jeon
Yunseo Lee
Gyuri Byun
Yoorim Son
Jaeyoon Shin
Hongkyu Ko
Hyeoncheol Kim
14
8
0
09 Feb 2024
Evading Data Contamination Detection for Language Models is (too) Easy
Jasper Dekoninck
Mark Niklas Muller
Maximilian Baader
Marc Fischer
Martin Vechev
96
18
0
05 Feb 2024
BiTA: Bi-Directional Tuning for Lossless Acceleration in Large Language Models
Feng-Huei Lin
Hanling Yi
Hongbin Li
Yifan Yang
Xiaotian Yu
Guangming Lu
Rong Xiao
39
3
0
23 Jan 2024
Distilling Mathematical Reasoning Capabilities into Small Language Models
Xunyu Zhu
Jian Li
Yong Liu
Can Ma
Weiping Wang
LRM
34
9
0
22 Jan 2024
Knowledge Fusion of Large Language Models
Fanqi Wan
Xinting Huang
Deng Cai
Xiaojun Quan
Wei Bi
Shuming Shi
MoMe
34
61
0
19 Jan 2024
Effects of diversity incentives on sample diversity and downstream model performance in LLM-based text augmentation
Ján Cegin
Branislav Pecher
Jakub Simko
Ivan Srba
M. Bieliková
Peter Brusilovsky
33
10
0
12 Jan 2024
RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation
Mahdi Nikdan
Soroush Tabesh
Elvir Crnčević
Dan Alistarh
8
27
0
09 Jan 2024
Reasons to Reject? Aligning Language Models with Judgments
Weiwen Xu
Deng Cai
Zhisong Zhang
Wai Lam
Shuming Shi
ALM
21
14
0
22 Dec 2023
Urban Generative Intelligence (UGI): A Foundational Platform for Agents in Embodied City Environment
Fengli Xu
Jun Zhang
Chen Gao
J. Feng
Yong Li
AI4CE
LLMAG
24
29
0
19 Dec 2023
VinaLLaMA: LLaMA-based Vietnamese Foundation Model
Quan Van Nguyen
Huy Quang Pham
Dung Dao
ALM
21
8
0
18 Dec 2023
Rethinking the Instruction Quality: LIFT is What You Need
Yang Xu
Yongqiang Yao
Yufan Huang
Mengnan Qi
Maoquan Wang
Bin Gu
Neel Sundaresan
ALM
21
35
0
12 Dec 2023
Batched Low-Rank Adaptation of Foundation Models
Yeming Wen
Swarat Chaudhuri
OffRL
21
19
0
09 Dec 2023
E4SRec: An Elegant Effective Efficient Extensible Solution of Large Language Models for Sequential Recommendation
Xinhang Li
Chong Chen
Xiangyu Zhao
Yong Zhang
Chunxiao Xing
89
41
0
05 Dec 2023
Jellyfish: A Large Language Model for Data Preprocessing
Haochen Zhang
Yuyang Dong
Chuan Xiao
M. Oyamada
29
26
0
04 Dec 2023
The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning
Bill Yuchen Lin
Abhilasha Ravichander
Ximing Lu
Nouha Dziri
Melanie Sclar
Khyathi Raghavi Chandu
Chandra Bhagavatula
Yejin Choi
22
164
0
04 Dec 2023
SymNoise: Advancing Language Model Fine-tuning with Symmetric Noise
A. Yadav
Arjun Singh
43
2
0
03 Dec 2023
SeaLLMs -- Large Language Models for Southeast Asia
Xuan-Phi Nguyen
Wenxuan Zhang
Xin Li
Mahani Aljunied
Zhiqiang Hu
...
Yue Deng
Sen Yang
Chaoqun Liu
Hang Zhang
Li Bing
LRM
29
73
0
01 Dec 2023
RefinedFields: Radiance Fields Refinement for Unconstrained Scenes
Karim Kassab
Antoine Schnepf
Jean-Yves Franceschi
Laurent Caraffa
Jeremie Mary
Valérie Gouet-Brunet
VGen
30
7
0
01 Dec 2023
ChatGPT's One-year Anniversary: Are Open-Source Large Language Models Catching up?
Hailin Chen
Fangkai Jiao
Xingxuan Li
Chengwei Qin
Mathieu Ravaut
Ruochen Zhao
Caiming Xiong
Shafiq R. Joty
ELM
CLL
AI4MH
LRM
ALM
85
27
0
28 Nov 2023
Large Language Models Meet Computer Vision: A Brief Survey
Raby Hamadi
LM&MA
26
4
0
28 Nov 2023
Comprehensive Benchmarking of Entropy and Margin Based Scoring Metrics for Data Selection
Anusha Sabbineni
Nikhil Anand
Maria Minakova
27
0
0
27 Nov 2023
How Far Have We Gone in Vulnerability Detection Using Large Language Models
Zeyu Gao
Hao Wang
Yuchen Zhou
Wenyu Zhu
Chao Zhang
24
17
0
21 Nov 2023
Which is better? Exploring Prompting Strategy For LLM-based Metrics
Joonghoon Kim
Saeran Park
Kiyoon Jeong
Sangmin Lee
S. Han
Jiyoon Lee
Pilsung Kang
6
15
0
07 Nov 2023
Correction with Backtracking Reduces Hallucination in Summarization
Zhenzhen Liu
Chao-gang Wan
Varsha Kishore
Jin Peng Zhou
Minmin Chen
Kilian Q. Weinberger
HILM
26
3
0
24 Oct 2023
NEFTune: Noisy Embeddings Improve Instruction Finetuning
Neel Jain
Ping Yeh-Chiang
Yuxin Wen
John Kirchenbauer
Hong-Min Chu
...
Avi Schwarzschild
Aniruddha Saha
Micah Goldblum
Jonas Geiping
Tom Goldstein
23
75
0
09 Oct 2023
Guiding Language Model Math Reasoning with Planning Tokens
Xinyi Wang
Lucas Caccia
O. Ostapenko
Xingdi Yuan
William Yang Wang
Alessandro Sordoni
LRM
33
19
0
09 Oct 2023
MuggleMath: Assessing the Impact of Query and Response Augmentation on Math Reasoning
Chengpeng Li
Zheng Yuan
Hongyi Yuan
Guanting Dong
Keming Lu
Jiancan Wu
Chuanqi Tan
Xiang Wang
Chang Zhou
LRM
20
21
0
09 Oct 2023
EMO: Earth Mover Distance Optimization for Auto-Regressive Language Modeling
Siyu Ren
Zhiyong Wu
Kenny Q. Zhu
26
3
0
07 Oct 2023
ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
Zhibin Gou
Zhihong Shao
Yeyun Gong
Yelong Shen
Yujiu Yang
Minlie Huang
Nan Duan
Weizhu Chen
LRM
AI4CE
LLMAG
49
145
0
29 Sep 2023
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models
L. Yu
Weisen Jiang
Han Shi
Jincheng Yu
Zhengying Liu
Yu Zhang
James T. Kwok
Zheng Li
Adrian Weller
Weiyang Liu
OSLM
LRM
41
329
0
21 Sep 2023
Previous
1
2
3
Next