ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2309.05463
  4. Cited By
Textbooks Are All You Need II: phi-1.5 technical report

Textbooks Are All You Need II: phi-1.5 technical report

11 September 2023
Yuan-Fang Li
Sébastien Bubeck
Ronen Eldan
Allison Del Giorno
Suriya Gunasekar
Yin Tat Lee
    ALM
    LRM
ArXivPDFHTML

Papers citing "Textbooks Are All You Need II: phi-1.5 technical report"

50 / 338 papers shown
Title
Advancing General Multimodal Capability of Vision-language Models with Pyramid-descent Visual Position Encoding
Advancing General Multimodal Capability of Vision-language Models with Pyramid-descent Visual Position Encoding
Ziyang Chen
Mingxiao Li
Z. Chen
Nan Du
Xiaolong Li
Yuexian Zou
53
0
0
19 Jan 2025
PalmBench: A Comprehensive Benchmark of Compressed Large Language Models on Mobile Platforms
PalmBench: A Comprehensive Benchmark of Compressed Large Language Models on Mobile Platforms
Yilong Li
Jingyu Liu
Hao Zhang
M Badri Narayanan
Utkarsh Sharma
Shuai Zhang
Pan Hu
Yijing Zeng
Jayaram Raghuram
Suman Banerjee
MQ
44
2
0
10 Jan 2025
Efficient Architectures for High Resolution Vision-Language Models
Miguel Carvalho
Bruno Martins
MLLM
VLM
45
0
0
05 Jan 2025
LLMzSz{\L}: a comprehensive LLM benchmark for Polish
LLMzSz{\L}: a comprehensive LLM benchmark for Polish
Krzysztof Jassem
Michał Ciesiółka
Filip Graliñski
Piotr Jabłoński
Jakub Pokrywka
Marek Kubis
Monika Jabłońska
Ryszard Staruch
41
1
0
04 Jan 2025
General Information Metrics for Improving AI Model Training Efficiency
Jianfeng Xu
Congcong Liu
Xiaoying Tan
Xiaojie Zhu
Anpeng Wu
...
Weijun Kong
Chun Li
Hu Xu
Kun Kuang
Fei Wu
77
0
0
02 Jan 2025
FED: Fast and Efficient Dataset Deduplication Framework with GPU Acceleration
FED: Fast and Efficient Dataset Deduplication Framework with GPU Acceleration
Youngjun Son
Chaewon Kim
Jaejin Lee
50
0
0
02 Jan 2025
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Wenqi Zhang
Hang Zhang
Xin Li
Jiashuo Sun
Yongliang Shen
Weiming Lu
Deli Zhao
Yueting Zhuang
Lidong Bing
VLM
43
2
0
01 Jan 2025
Hansel: Output Length Controlling Framework for Large Language Models
Hansel: Output Length Controlling Framework for Large Language Models
Seoha Song
Junhyun Lee
Hyeonmok Ko
75
0
0
18 Dec 2024
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for
  Fast, Memory Efficient, and Long Context Finetuning and Inference
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference
Benjamin Warner
Antoine Chaffin
Benjamin Clavié
Orion Weller
Oskar Hallström
...
Tom Aarsen
Nathan Cooper
Griffin Adams
Jeremy Howard
Iacopo Poli
93
82
0
18 Dec 2024
Phi-4 Technical Report
Phi-4 Technical Report
Marah Abdin
J. Aneja
Harkirat Singh Behl
Sébastien Bubeck
Ronen Eldan
...
Rachel A. Ward
Yue Wu
Dingli Yu
Cyril Zhang
Yi Zhang
ALM
SyDa
104
88
0
12 Dec 2024
Learning to Reason via Self-Iterative Process Feedback for Small
  Language Models
Learning to Reason via Self-Iterative Process Feedback for Small Language Models
Kaiyuan Chen
Jin Wang
Xuejie Zhang
LRM
ReLM
85
2
0
11 Dec 2024
Code LLMs: A Taxonomy-based Survey
Code LLMs: A Taxonomy-based Survey
Nishat Raihan
Christian D. Newman
Marcos Zampieri
97
1
0
11 Dec 2024
The Well: a Large-Scale Collection of Diverse Physics Simulations for Machine Learning
The Well: a Large-Scale Collection of Diverse Physics Simulations for Machine Learning
Ruben Ohana
Michael McCabe
Lucas Meyer
Rudy Morel
Fruzsina J. Agocs
...
François Rozet
Liam Parker
M. Cranmer
S. Ho
Shirley Ho
PINN
AI4CE
74
8
1
30 Nov 2024
Is Oracle Pruning the True Oracle?
Is Oracle Pruning the True Oracle?
Sicheng Feng
Keda Tao
Haoyu Wang
VLM
70
0
0
28 Nov 2024
Efficient Learning Content Retrieval with Knowledge Injection
Batuhan Sariturk
Rabia Bayraktar
Merve Elmas Erdem
83
0
0
28 Nov 2024
Towards Robust Evaluation of Unlearning in LLMs via Data Transformations
Towards Robust Evaluation of Unlearning in LLMs via Data Transformations
Abhinav Joshi
Shaswati Saha
Divyaksh Shukla
Sriram Vema
Harsh Jhamtani
Manas Gaur
Ashutosh Modi
MU
91
3
0
23 Nov 2024
Hymba: A Hybrid-head Architecture for Small Language Models
Hymba: A Hybrid-head Architecture for Small Language Models
Xin Dong
Y. Fu
Shizhe Diao
Wonmin Byeon
Zijia Chen
...
Min-Hung Chen
Yoshi Suhara
Y. Lin
Jan Kautz
Pavlo Molchanov
Mamba
102
21
0
20 Nov 2024
Training Bilingual LMs with Data Constraints in the Targeted Language
Training Bilingual LMs with Data Constraints in the Targeted Language
Skyler Seto
Maartje ter Hoeve
He Bai
Natalie Schluter
David Grangier
86
0
0
20 Nov 2024
Does Prompt Formatting Have Any Impact on LLM Performance?
Does Prompt Formatting Have Any Impact on LLM Performance?
Jia He
Mukund Rungta
David Koleczek
Arshdeep Sekhon
Franklin X Wang
Sadid Hasan
LLMAG
LRM
29
42
0
15 Nov 2024
SparrowVQE: Visual Question Explanation for Course Content Understanding
SparrowVQE: Visual Question Explanation for Course Content Understanding
Jialu Li
Manish Kumar Thota
Ruslan Gokhman
Radek Holik
Youshan Zhang
41
1
0
12 Nov 2024
VTechAGP: An Academic-to-General-Audience Text Paraphrase Dataset and Benchmark Models
VTechAGP: An Academic-to-General-Audience Text Paraphrase Dataset and Benchmark Models
Ming Cheng
Jiaying Gong
Chenhan Yuan
William A. Ingram
Edward A. Fox
Hoda Eldardiry
47
0
0
07 Nov 2024
Crystal: Illuminating LLM Abilities on Language and Code
Crystal: Illuminating LLM Abilities on Language and Code
Tianhua Tao
Junbo Li
Bowen Tan
Hongyi Wang
William Marshall
...
Joel Hestness
Natalia Vassilieva
Zhiqiang Shen
Eric P. Xing
Zhengzhong Liu
52
4
0
06 Nov 2024
Extracting Unlearned Information from LLMs with Activation Steering
Extracting Unlearned Information from LLMs with Activation Steering
Atakan Seyitoğlu
A. Kuvshinov
Leo Schwinn
Stephan Günnemann
MU
LLMSV
43
3
0
04 Nov 2024
SLED: Self Logits Evolution Decoding for Improving Factuality in Large
  Language Models
SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Models
Jianyi Zhang
Da-Cheng Juan
Cyrus Rashtchian
Chun-Sung Ferng
Heinrich Jiang
Yiran Chen
43
4
0
01 Nov 2024
GigaCheck: Detecting LLM-generated Content
GigaCheck: Detecting LLM-generated Content
Irina Tolstykh
Aleksandra Tsybina
Sergey Yakubson
Aleksandr Gordeev
Vladimir Dokholyan
Maksim Kuprashevich
DeLMO
48
1
0
31 Oct 2024
BENCHAGENTS: Automated Benchmark Creation with Agent Interaction
BENCHAGENTS: Automated Benchmark Creation with Agent Interaction
Natasha Butt
Varun Chandrasekaran
Neel Joshi
Besmira Nushi
Vidhisha Balachandran
42
6
0
29 Oct 2024
Unlearning as multi-task optimization: A normalized gradient difference approach with an adaptive learning rate
Unlearning as multi-task optimization: A normalized gradient difference approach with an adaptive learning rate
Zhiqi Bu
Xiaomeng Jin
Bhanukiran Vinzamuri
Anil Ramakrishna
Kai-Wei Chang
V. Cevher
Mingyi Hong
MU
91
7
0
29 Oct 2024
Improving Multimodal Large Language Models Using Continual Learning
Improving Multimodal Large Language Models Using Continual Learning
Shikhar Srivastava
Md Yousuf Harun
Robik Shrestha
Christopher Kanan
KELM
VLM
CLL
33
1
0
25 Oct 2024
Process Supervision-Guided Policy Optimization for Code Generation
Process Supervision-Guided Policy Optimization for Code Generation
Ning Dai
Zheng Wu
Renjie Zheng
Ziyun Wei
Wenlei Shi
Xing Jin
Guanlin Liu
Chen Dun
Liang Huang
Lin Yan
56
8
0
23 Oct 2024
Self-calibration for Language Model Quantization and Pruning
Self-calibration for Language Model Quantization and Pruning
Miles Williams
G. Chrysostomou
Nikolaos Aletras
MQ
186
0
0
22 Oct 2024
Frontiers in Intelligent Colonoscopy
Frontiers in Intelligent Colonoscopy
Ge-Peng Ji
Jingyi Liu
Peng Xu
Nick Barnes
Fahad Shahbaz Khan
Salman Khan
Deng-Ping Fan
49
4
0
22 Oct 2024
KatzBot: Revolutionizing Academic Chatbot for Enhanced Communication
KatzBot: Revolutionizing Academic Chatbot for Enhanced Communication
Sahil Kumar
Deepa Paikar
Kiran Sai Vutukuri
Haider Ali
Shashidhar Reddy Ainala
Aditya Murli Krishnan
Youshan Zhang
26
1
0
21 Oct 2024
Enabling Energy-Efficient Deployment of Large Language Models on
  Memristor Crossbar: A Synergy of Large and Small
Enabling Energy-Efficient Deployment of Large Language Models on Memristor Crossbar: A Synergy of Large and Small
Zhehui Wang
Tao Luo
Cheng Liu
Weichen Liu
Rick Siow Mong Goh
Weng-Fai Wong
31
1
0
21 Oct 2024
How to Build a Pre-trained Multimodal model for Simultaneously Chatting
  and Decision-making?
How to Build a Pre-trained Multimodal model for Simultaneously Chatting and Decision-making?
Zuojin Tang
Bin-Bin Hu
Chenyang Zhao
De Ma
Gang Pan
Bin Liu
23
0
0
21 Oct 2024
MedLogic-AQA: Enhancing Medical Question Answering with Abstractive
  Models Focusing on Logical Structures
MedLogic-AQA: Enhancing Medical Question Answering with Abstractive Models Focusing on Logical Structures
Aizan Zafar
Kshitij Mishra
Asif Ekbal
28
0
0
20 Oct 2024
A Survey on Data Synthesis and Augmentation for Large Language Models
A Survey on Data Synthesis and Augmentation for Large Language Models
Ke Wang
Jiahui Zhu
Minjie Ren
Ziqiang Liu
Shiwei Li
...
Yiming Lei
Xiaoyu Wu
Qiqi Zhan
Qingjie Liu
Yunhong Wang
SyDa
42
18
0
16 Oct 2024
Table-LLM-Specialist: Language Model Specialists for Tables using
  Iterative Generator-Validator Fine-tuning
Table-LLM-Specialist: Language Model Specialists for Tables using Iterative Generator-Validator Fine-tuning
Junjie Xing
Yeye He
Mengyu Zhou
Haoyu Dong
Shi Han
Dongmei Zhang
S. Chaudhuri
LMTD
54
1
0
16 Oct 2024
Mastering the Craft of Data Synthesis for CodeLLMs
Mastering the Craft of Data Synthesis for CodeLLMs
Meng Chen
Philip Arthur
Qianyu Feng
Cong Duy Vu Hoang
Yu-Heng Hong
...
Mark Johnson
Kemal Kurniawan
Don Dharmasiri
Long Duong
Yuan-Fang Li
SyDa
60
1
0
16 Oct 2024
DISP-LLM: Dimension-Independent Structural Pruning for Large Language
  Models
DISP-LLM: Dimension-Independent Structural Pruning for Large Language Models
Shangqian Gao
Chi-Heng Lin
Ting Hua
Tang Zheng
Yilin Shen
Hongxia Jin
Yen-Chang Hsu
30
3
0
15 Oct 2024
LargePiG: Your Large Language Model is Secretly a Pointer Generator
LargePiG: Your Large Language Model is Secretly a Pointer Generator
Zhongxiang Sun
Zihua Si
Xiaoxue Zang
Kai Zheng
Yang Song
Xiao Zhang
Jun Xu
HILM
RALM
42
0
0
15 Oct 2024
LLM Unlearning via Loss Adjustment with Only Forget Data
LLM Unlearning via Loss Adjustment with Only Forget Data
Yaxuan Wang
Jiaheng Wei
Chris Liu
Jinlong Pang
Qiang Liu
A. Shah
Yujia Bao
Yang Liu
Wei Wei
KELM
MU
43
8
0
14 Oct 2024
Parameter-Efficient Fine-Tuning of Large Language Models using Semantic
  Knowledge Tuning
Parameter-Efficient Fine-Tuning of Large Language Models using Semantic Knowledge Tuning
Nusrat Jahan Prottasha
Asif Mahmud
Md. Shohanur Islam Sobuj
Prakash Bhat
Md. Kowsher
Niloofar Yousefi
O. Garibay
35
4
0
11 Oct 2024
Scalable Representation Learning for Multimodal Tabular Transactions
Scalable Representation Learning for Multimodal Tabular Transactions
Natraj Raman
Sumitra Ganesh
Manuela Veloso
LMTD
33
0
0
10 Oct 2024
COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act
COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act
Philipp Guldimann
Alexander Spiridonov
Robin Staab
Nikola Jovanović
Mark Vero
...
Mislav Balunović
Nikola Konstantinov
Pavol Bielik
Petar Tsankov
Martin Vechev
ELM
53
4
0
10 Oct 2024
Exploring the Readiness of Prominent Small Language Models for the
  Democratization of Financial Literacy
Exploring the Readiness of Prominent Small Language Models for the Democratization of Financial Literacy
Tagore Rao Kosireddy
Jeffrey D. Wall
Evan Lucas
34
1
0
09 Oct 2024
Self-Boosting Large Language Models with Synthetic Preference Data
Self-Boosting Large Language Models with Synthetic Preference Data
Qingxiu Dong
Li Dong
Xingxing Zhang
Zhifang Sui
Furu Wei
SyDa
48
7
0
09 Oct 2024
Generative Model for Less-Resourced Language with 1 billion parameters
Generative Model for Less-Resourced Language with 1 billion parameters
Domen Vreš
Martin Božič
Aljaž Potočnik
Tomaž Martinčič
Marko Robnik-Šikonja
26
1
0
09 Oct 2024
CoBa: Convergence Balancer for Multitask Finetuning of Large Language
  Models
CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models
Zi Gong
Hang Yu
Cong Liao
Bingchang Liu
Chaoyu Chen
Jianguo Li
MoMe
29
4
0
09 Oct 2024
Probing Language Models on Their Knowledge Source
Probing Language Models on Their Knowledge Source
Zineddine Tighidet
Andrea Mogini
Jiali Mei
Benjamin Piwowarski
Patrick Gallinari
KELM
40
1
0
08 Oct 2024
DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing
  with Language Models
DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models
Ranchi Zhao
Zhen Leng Thai
Yifan Zhang
Shengding Hu
Yunqi Ba
Jie Zhou
Jie Cai
Zhiyuan Liu
Maosong Sun
41
1
0
08 Oct 2024
Previous
1234567
Next