ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.05463
  4. Cited By
Textbooks Are All You Need II: phi-1.5 technical report

Textbooks Are All You Need II: phi-1.5 technical report

11 September 2023
Yuan-Fang Li
Sébastien Bubeck
Ronen Eldan
Allison Del Giorno
Suriya Gunasekar
Yin Tat Lee
    ALM
    LRM
ArXivPDFHTML

Papers citing "Textbooks Are All You Need II: phi-1.5 technical report"

50 / 338 papers shown
Title
Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion
Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion
Guanchu Wang
Yu-Neng Chuang
Ruixiang Tang
Shaochen Zhong
Jiayi Yuan
...
Zirui Liu
V. Chaudhary
Shuai Xu
James Caverlee
Xia Hu
PILM
84
1
0
06 Oct 2024
Self-Powered LLM Modality Expansion for Large Speech-Text Models
Self-Powered LLM Modality Expansion for Large Speech-Text Models
Tengfei Yu
Xuebo Liu
Zhiyi Hou
Liang Ding
Dacheng Tao
Min Zhang
32
0
0
04 Oct 2024
A Probabilistic Perspective on Unlearning and Alignment for Large Language Models
A Probabilistic Perspective on Unlearning and Alignment for Large Language Models
Yan Scholten
Stephan Günnemann
Leo Schwinn
MU
63
6
0
04 Oct 2024
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models
Junfeng Fang
Houcheng Jiang
Kun Wang
Yunshan Ma
Shi Jie
Xiangnan He
Tat-Seng Chua
Tat-seng Chua
KELM
44
34
0
03 Oct 2024
CreDes: Causal Reasoning Enhancement and Dual-End Searching for Solving
  Long-Range Reasoning Problems using LLMs
CreDes: Causal Reasoning Enhancement and Dual-End Searching for Solving Long-Range Reasoning Problems using LLMs
Kangsheng Wang
Xiao Zhang
Hao Liu
Songde Han
Huimin Ma
Tianyu Hu
LRM
59
5
0
02 Oct 2024
U-shaped and Inverted-U Scaling behind Emergent Abilities of Large Language Models
U-shaped and Inverted-U Scaling behind Emergent Abilities of Large Language Models
Tung-Yu Wu
Pei-Yu Lo
ReLM
LRM
46
2
0
02 Oct 2024
AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge
  Distillation for Large Language Models in Code Generation
AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation
Ziyang Luo
Xin Li
Hongzhan Lin
Jing Ma
Lidong Bing
VLM
29
0
0
01 Oct 2024
Scrambled text: training Language Models to correct OCR errors using
  synthetic data
Scrambled text: training Language Models to correct OCR errors using synthetic data
Jonathan Bourne
SyDa
38
2
0
29 Sep 2024
Unified Gradient-Based Machine Unlearning with Remain Geometry
  Enhancement
Unified Gradient-Based Machine Unlearning with Remain Geometry Enhancement
Zhehao Huang
Xinwen Cheng
JingHao Zheng
Haoran Wang
Zhengbao He
Tao Li
X. Huang
MU
47
6
0
29 Sep 2024
Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining
  for Clinical LLMs
Beyond Fine-tuning: Unleashing the Potential of Continuous Pretraining for Clinical LLMs
Clément Christophe
Tathagata Raha
Svetlana Maslenkova
Muhammad Umar Salman
Praveen K Kanithi
Marco AF Pimentel
Shadab Khan
LM&MA
41
2
0
23 Sep 2024
Time Awareness in Large Language Models: Benchmarking Fact Recall Across Time
Time Awareness in Large Language Models: Benchmarking Fact Recall Across Time
David Herel
Vojtech Bartek
Jiri Jirak
Tomáš Mikolov
50
3
0
20 Sep 2024
CSCE: Boosting LLM Reasoning by Simultaneous Enhancing of Causal Significance and Consistency
CSCE: Boosting LLM Reasoning by Simultaneous Enhancing of Causal Significance and Consistency
Kangsheng Wang
Xiao Zhang
Zizheng Guo
Tianyu Hu
Huimin Ma
LRM
48
7
0
20 Sep 2024
MEOW: MEMOry Supervised LLM Unlearning Via Inverted Facts
MEOW: MEMOry Supervised LLM Unlearning Via Inverted Facts
Tianle Gu
Kexin Huang
Ruilin Luo
Yuanqi Yao
Yujiu Yang
Yan Teng
Yingchun Wang
MU
42
5
0
18 Sep 2024
Synthetic continued pretraining
Synthetic continued pretraining
Zitong Yang
Neil Band
Shuangping Li
Emmanuel Candès
Tatsunori Hashimoto
CLL
SyDa
41
11
0
11 Sep 2024
An Investigation of Warning Erroneous Chat Translations in Cross-lingual
  Communication
An Investigation of Warning Erroneous Chat Translations in Cross-lingual Communication
Yunmeng Li
Jun Suzuki
Makoto Morishita
Kaori Abe
Kentaro Inui
65
1
0
28 Aug 2024
BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and
  Deduplication by Introducing a Competitive Large Language Model Baseline
BaichuanSEED: Sharing the Potential of ExtensivE Data Collection and Deduplication by Introducing a Competitive Large Language Model Baseline
Guosheng Dong
Zhuoran Zhang
Yiding Sun
Da Pan
Zheng Liang
...
Bingning Wang
Wentao Zhang
Jiaxin Mao
Zenan Zhou
Weipeng Chen
ALM
48
2
0
27 Aug 2024
Assessing Contamination in Large Language Models: Introducing the
  LogProber method
Assessing Contamination in Large Language Models: Introducing the LogProber method
Nicolas Yax
Pierre-Yves Oudeyer
Stefano Palminteri
40
4
0
26 Aug 2024
On-Device Language Models: A Comprehensive Review
On-Device Language Models: A Comprehensive Review
Jiajun Xu
Zhiyuan Li
Wei Chen
Qun Wang
Xin Gao
Qi Cai
Ziyuan Ling
50
27
0
26 Aug 2024
Enhancing SQL Query Generation with Neurosymbolic Reasoning
Enhancing SQL Query Generation with Neurosymbolic Reasoning
Henrijs Princis
Cristina David
Alan Mycroft
42
2
0
25 Aug 2024
A Law of Next-Token Prediction in Large Language Models
A Law of Next-Token Prediction in Large Language Models
Hangfeng He
Weijie J. Su
35
5
0
24 Aug 2024
CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution
CRUXEval-X: A Benchmark for Multilingual Code Reasoning, Understanding and Execution
Ruiyang Xu
Jialun Cao
Yaojie Lu
Ming Wen
Hongyu Lin
Xianpei Han
Ben He
Shing-Chi Cheung
Le Sun
LRM
ELM
39
3
0
23 Aug 2024
Show-o: One Single Transformer to Unify Multimodal Understanding and
  Generation
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation
Jinheng Xie
Weijia Mao
Zechen Bai
David Junhao Zhang
Weihao Wang
Kevin Qinghong Lin
Yuchao Gu
Zhijie Chen
Zhenheng Yang
Mike Zheng Shou
57
165
0
22 Aug 2024
To Code, or Not To Code? Exploring Impact of Code in Pre-training
To Code, or Not To Code? Exploring Impact of Code in Pre-training
Viraat Aryabumi
Yixuan Su
Raymond Ma
Adrien Morisot
Ivan Zhang
Acyr Locatelli
Marzieh Fadaee
Ahmet Üstün
Sara Hooker
SyDa
AI4CE
48
19
0
20 Aug 2024
CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing
  Hallucinations in LVLMs
CLIP-DPO: Vision-Language Models as a Source of Preference for Fixing Hallucinations in LVLMs
Yassine Ouali
Adrian Bulat
Brais Martínez
Georgios Tzimiropoulos
VLM
MLLM
40
18
0
19 Aug 2024
Leveraging Web-Crawled Data for High-Quality Fine-Tuning
Leveraging Web-Crawled Data for High-Quality Fine-Tuning
Jing Zhou
Chenglin Jiang
Wei Shen
Xiao Zhou
Xiaonan He
ALM
50
3
0
15 Aug 2024
Investigating Instruction Tuning Large Language Models on Graphs
Investigating Instruction Tuning Large Language Models on Graphs
Kerui Zhu
Bo-Wei Huang
Bowen Jin
Yizhu Jiao
Ming Zhong
Kevin Chang
Shou-De Lin
Jiawei Han
69
2
0
10 Aug 2024
Unlocking Decoding-time Controllability: Gradient-Free Multi-Objective
  Alignment with Contrastive Prompts
Unlocking Decoding-time Controllability: Gradient-Free Multi-Objective Alignment with Contrastive Prompts
Tingchen Fu
Yupeng Hou
Julian McAuley
Rui Yan
38
3
0
09 Aug 2024
In2Core: Leveraging Influence Functions for Coreset Selection in
  Instruction Finetuning of Large Language Models
In2Core: Leveraging Influence Functions for Coreset Selection in Instruction Finetuning of Large Language Models
Ayrton San Joaquin
Bin Wang
Zhengyuan Liu
Nicholas Asher
Brian Lim
Philippe Muller
Nancy Chen
42
0
0
07 Aug 2024
ULLME: A Unified Framework for Large Language Model Embeddings with
  Generation-Augmented Learning
ULLME: A Unified Framework for Large Language Model Embeddings with Generation-Augmented Learning
Hieu Man
Nghia Trung Ngo
Franck Dernoncourt
Thien Huu Nguyen
AI4TS
48
4
0
06 Aug 2024
Large Language Model Aided QoS Prediction for Service Recommendation
Large Language Model Aided QoS Prediction for Service Recommendation
Huiying Liu
Zekun Zhang
Honghao Li
Qilin Wu
Yiwen Zhang
20
1
0
05 Aug 2024
Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats in Customized Large Language Models
Why Are My Prompts Leaked? Unraveling Prompt Extraction Threats in Customized Large Language Models
Zi Liang
Haibo Hu
Qingqing Ye
Yaxin Xiao
Haoyang Li
AAML
ELM
SILM
56
6
0
05 Aug 2024
The Impact of Hyperparameters on Large Language Model Inference
  Performance: An Evaluation of vLLM and HuggingFace Pipelines
The Impact of Hyperparameters on Large Language Model Inference Performance: An Evaluation of vLLM and HuggingFace Pipelines
Agathe Balayn
53
2
0
02 Aug 2024
PERSOMA: PERsonalized SOft ProMpt Adapter Architecture for Personalized
  Language Prompting
PERSOMA: PERsonalized SOft ProMpt Adapter Architecture for Personalized Language Prompting
Liam Hebert
Krishna Sayana
Ambarish Jash
Alexandros Karatzoglou
Geordie Williamson
Sumanth Doddapaneni
Yanli Cai
Dima Kuzmin
36
3
0
02 Aug 2024
Improving Text Embeddings for Smaller Language Models Using Contrastive
  Fine-tuning
Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning
Trapoom Ukarapol
Zhicheng Lee
Amy Xin
20
1
0
01 Aug 2024
Finch: Prompt-guided Key-Value Cache Compression
Finch: Prompt-guided Key-Value Cache Compression
Giulio Corallo
Paolo Papotti
38
3
0
31 Jul 2024
ML-Mamba: Efficient Multi-Modal Large Language Model Utilizing Mamba-2
ML-Mamba: Efficient Multi-Modal Large Language Model Utilizing Mamba-2
Wenjun Huang
Jiakai Pan
Jiahao Tang
Yanyu Ding
Yifei Xing
Yuhe Wang
Zhengzhuo Wang
Jianguo Hu
Mamba
47
5
0
29 Jul 2024
Understanding Memorisation in LLMs: Dynamics, Influencing Factors, and
  Implications
Understanding Memorisation in LLMs: Dynamics, Influencing Factors, and Implications
Till Speicher
Mohammad Aflah Khan
Qinyuan Wu
Vedant Nanda
Soumi Das
Bishwamittra Ghosh
Krishna P. Gummadi
Evimaria Terzi
49
3
0
27 Jul 2024
Self-Directed Synthetic Dialogues and Revisions Technical Report
Self-Directed Synthetic Dialogues and Revisions Technical Report
Nathan Lambert
Hailey Schoelkopf
Aaron Gokaslan
Luca Soldaini
Valentina Pyatkin
Louis Castricato
SyDa
45
2
0
25 Jul 2024
Exploring Description-Augmented Dataless Intent Classification
Exploring Description-Augmented Dataless Intent Classification
Ruoyu Hu
Foaad Khosmood
Abbas Edalat
AI4TS
45
0
0
25 Jul 2024
DDK: Distilling Domain Knowledge for Efficient Large Language Models
DDK: Distilling Domain Knowledge for Efficient Large Language Models
Jiaheng Liu
Chenchen Zhang
Jinyang Guo
Yuanxing Zhang
Haoran Que
...
Congnan Liu
Wenbo Su
Jiamang Wang
Lin Qu
Bo Zheng
48
3
0
23 Jul 2024
SynCPKL: Harnessing LLMs to Generate Synthetic Data for Commonsense
  Persona Knowledge Linking
SynCPKL: Harnessing LLMs to Generate Synthetic Data for Commonsense Persona Knowledge Linking
Kuan-Yen Lin
50
0
0
21 Jul 2024
Open Artificial Knowledge
Open Artificial Knowledge
Vadim Borisov
Richard H. Schreiber
53
0
0
19 Jul 2024
Agent-E: From Autonomous Web Navigation to Foundational Design
  Principles in Agentic Systems
Agent-E: From Autonomous Web Navigation to Foundational Design Principles in Agentic Systems
Tamer Abuelsaad
Deepak Akkil
Prasenjit Dey
Ashish Jagmohan
Aditya Vempaty
Ravi Kokku
46
23
0
17 Jul 2024
From 'Showgirls' to 'Performers': Fine-tuning with Gender-inclusive
  Language for Bias Reduction in LLMs
From 'Showgirls' to 'Performers': Fine-tuning with Gender-inclusive Language for Bias Reduction in LLMs
Marion Bartl
Susan Leavy
43
8
0
05 Jul 2024
LLM Roleplay: Simulating Human-Chatbot Interaction
LLM Roleplay: Simulating Human-Chatbot Interaction
Hovhannes Tamoyan
Hendrik Schuff
Iryna Gurevych
52
8
0
04 Jul 2024
Text2TimeSeries: Enhancing Financial Forecasting through Time Series
  Prediction Updates with Event-Driven Insights from Large Language Models
Text2TimeSeries: Enhancing Financial Forecasting through Time Series Prediction Updates with Event-Driven Insights from Large Language Models
Litton J. Kurisinkel
Pruthwik Mishra
Yue Zhang
45
4
0
04 Jul 2024
A Survey of Data Synthesis Approaches
A Survey of Data Synthesis Approaches
Hsin-Yu Chang
Pei-Yu Chen
Tun-Hsiang Chou
Chang-Sheng Kao
Hsuan-Yun Yu
Yen-Ting Lin
Yun-Nung Chen
44
6
0
04 Jul 2024
Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through
  Self-Correction in Language Models
Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Models
Haritz Puerto
Tilek Chubakov
Xiaodan Zhu
Harish Tayyar Madabushi
Iryna Gurevych
ReLM
LRM
52
9
1
03 Jul 2024
CFinBench: A Comprehensive Chinese Financial Benchmark for Large
  Language Models
CFinBench: A Comprehensive Chinese Financial Benchmark for Large Language Models
Ying Nie
Binwei Yan
Tianyu Guo
Hao Liu
Haoyu Wang
...
Weihao Wang
Qiang Li
Weijian Sun
Yunhe Wang
Dacheng Tao
ELM
53
2
0
02 Jul 2024
Can Small Language Models Learn, Unlearn, and Retain Noise Patterns?
Can Small Language Models Learn, Unlearn, and Retain Noise Patterns?
Nicy Scaria
Silvester John Joseph Kennedy
Deepak N. Subramani
MU
19
2
0
01 Jul 2024
Previous
1234567
Next