ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Communities
  3. ...

Neighbor communities

0 / 0 papers shown
Title
Top Contributors
Name# Papers# Citations
Social Events
DateLocationEvent
  1. Home
  2. Communities
  3. ALM

Alignment for Language Models

ALM
More data

Focuses on research that actively explores methods and strategies to ensure language models' outputs align with human values, ethics, and intentions, constituting a significant portion of the paper's content.

Neighbor communities

51015

Featured Papers

0 / 0 papers shown
Title

All papers

50 / 1,717 papers shown
Title
CodeSimpleQA: Scaling Factuality in Code Large Language Models
CodeSimpleQA: Scaling Factuality in Code Large Language Models
Jian Yang
Wei Zhang
Yizhi Li
Shawn Guo
Haowen Wang
...
Ge Zhang
Zili Wang
Zhoujun Li
Xianglong Liu
Weifeng Lv
HILMALMELM
88
0
0
22 Dec 2025
SiamGPT: Quality-First Fine-Tuning for Stable Thai Text Generation
SiamGPT: Quality-First Fine-Tuning for Stable Thai Text Generation
Thittipat Pairatsuppawat
Abhibhu Tachaapornchai
Paweekorn Kusolsomboon
Chutikan Chaiwong
Thodsaporn Chay-intr
Kobkrit Viriyayudhakorn
Nongnuch Ketui
Aslan B. Wong
ALMLRM
0
0
0
22 Dec 2025
CORE: Concept-Oriented Reinforcement for Bridging the Definition-Application Gap in Mathematical Reasoning
CORE: Concept-Oriented Reinforcement for Bridging the Definition-Application Gap in Mathematical Reasoning
Zijun Gao
Zhikun Xu
Xiao Ye
Ben Zhou
OffRLALMLRM
0
0
0
21 Dec 2025
DEER: A Comprehensive and Reliable Benchmark for Deep-Research Expert Reports
DEER: A Comprehensive and Reliable Benchmark for Deep-Research Expert Reports
Janghoon Han
Heegyu Kim
Changho Lee
Dahm Lee
Min Hyung Park
Hosung Song
Stanley Jungkyu Choi
Moontae Lee
Honglak Lee
ALMHILM
108
0
0
19 Dec 2025
CIFE: Code Instruction-Following Evaluation
CIFE: Code Instruction-Following Evaluation
Sravani Gunnu
Shanmukha Guttula
Hima Patel
ALMELM
76
0
0
19 Dec 2025
Are We on the Right Way to Assessing LLM-as-a-Judge?
Are We on the Right Way to Assessing LLM-as-a-Judge?
Yuanning Feng
Sinan Wang
Zhengxiang Cheng
Yao Wan
Dongping Chen
ALMELM
100
0
0
17 Dec 2025
On Assessing the Relevance of Code Reviews Authored by Generative Models
On Assessing the Relevance of Code Reviews Authored by Generative Models
Robert Heumüller
Frank Ortmeier
ALMELM
56
0
0
17 Dec 2025
Agreement Between Large Language Models and Human Raters in Essay Scoring: A Research Synthesis
Agreement Between Large Language Models and Human Raters in Essay Scoring: A Research Synthesis
Hongli Li
Che Han Chen
Kevin Fan
Chiho Young-Johnson
Soyoung Lim
Yali Feng
ALM
16
0
0
16 Dec 2025
Revisiting the Reliability of Language Models in Instruction-Following
Revisiting the Reliability of Language Models in Instruction-Following
Jianshuo Dong
Yutong Zhang
Yan Liu
Zhenyu Zhong
Tao Wei
Chao Zhang
Han Qiu
ALM
120
0
0
15 Dec 2025
The Data Efficiency Frontier of Financial Foundation Models: Scaling Laws from Continued Pretraining
The Data Efficiency Frontier of Financial Foundation Models: Scaling Laws from Continued Pretraining
Jesse Ponnock
ALM
8
0
0
13 Dec 2025
The FACTS Leaderboard: A Comprehensive Benchmark for Large Language Model Factuality
The FACTS Leaderboard: A Comprehensive Benchmark for Large Language Model Factuality
Aileen Cheng
Alon Jacovi
Amir Globerson
Ben Golan
Charles Kwong
...
Srinivasan Venkatachary
Tulsee Doshi
Yossi Matias
Sasha Goldshtein
Dipanjan Das
HILMALMKELM
192
0
0
11 Dec 2025
PACIFIC: a framework for generating benchmarks to check Precise Automatically Checked Instruction Following In Code
PACIFIC: a framework for generating benchmarks to check Precise Automatically Checked Instruction Following In Code
Itay Dreyfuss
Antonio Abu Nassar
Samuel Ackerman
Axel Ben David
Eitan Farchi
Rami Katan
Orna Raz
Marcel Zalmanovici
ALM
149
0
0
11 Dec 2025
PCMind-2.1-Kaiyuan-2B Technical Report
PCMind-2.1-Kaiyuan-2B Technical Report
Kairong Luo
Zhenbo Sun
Xinyu Shi
Shengqi Chen
Bowen Yu
...
Hengtao Tao
Hui Wang
Fangming Liu
Kaifeng Lyu
Wenguang Chen
ALMMoEVLM
156
0
0
08 Dec 2025
Nanbeige4-3B Technical Report: Exploring the Frontier of Small Language Models
Nanbeige4-3B Technical Report: Exploring the Frontier of Small Language Models
Chen Yang
Guangyue Peng
Jiaying Zhu
Ran Le
Ruixiang Feng
...
Yuntao Wen
Zekai Wang
Zhenwei An
Zhicong Sun
Zongchao Chen
ALM
152
0
0
06 Dec 2025
Counting Without Running: Evaluating LLMs' Reasoning About Code Complexity
Counting Without Running: Evaluating LLMs' Reasoning About Code Complexity
Gregory Bolet
Giorgis Georgakoudis
Konstantinos Parasyris
Harshitha Menon
Niranjan Hasabnis
Kirk W. Cameron
Gal Oren
ALMLRM
142
0
0
04 Dec 2025
SR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model Alignment
SR-GRPO: Stable Rank as an Intrinsic Geometric Reward for Large Language Model Alignment
Yixuan Tang
Yi Yang
ALM
84
0
0
02 Dec 2025
PEFT-Factory: Unified Parameter-Efficient Fine-Tuning of Autoregressive Large Language Models
PEFT-Factory: Unified Parameter-Efficient Fine-Tuning of Autoregressive Large Language Models
Robert Belanec
Ivan Srba
Maria Bielikova
ALM
292
0
0
02 Dec 2025
Financial Instruction Following Evaluation (FIFE)
Financial Instruction Following Evaluation (FIFE)
Glenn Matlin
Siddharth
Anirudh JM
Aditya Shukla
Yahya Hassan
Sudheer Chava
ALM
92
0
0
01 Dec 2025
Learned-Rule-Augmented Large Language Model Evaluators
Learned-Rule-Augmented Large Language Model Evaluators
Jie Meng
Jin Mao
ALMELMLRM
92
0
0
01 Dec 2025
RL-Struct: A Lightweight Reinforcement Learning Framework for Reliable Structured Output in LLMs
RL-Struct: A Lightweight Reinforcement Learning Framework for Reliable Structured Output in LLMs
Ruike Hu
Shulei Wu
OffRLALM
100
0
0
29 Nov 2025
Instruction Tuning of Large Language Models for Tabular Data Generation-in One Day
Instruction Tuning of Large Language Models for Tabular Data Generation-in One Day
Milad Abdollahzadeh
Abdul Raheem
Zilong Zhao
Uzair Javaid
Kevin Yee
Nalam Venkata Abhishek
Tram Truong-Huu
Biplab Sikdar
LMTDALM
155
0
0
28 Nov 2025
BRIDGE: Building Representations In Domain Guided Program Verification
BRIDGE: Building Representations In Domain Guided Program Verification
Robert Joseph George
Carson Eisenach
Udaya Ghai
Dominique C. Perrault-Joncas
A. Anandkumar
Dean Phillips Foster
ALMLRM
345
0
0
26 Nov 2025
Orthographic Constraint Satisfaction and Human Difficulty Alignment in Large Language Models
Orthographic Constraint Satisfaction and Human Difficulty Alignment in Large Language Models
Bryan Edward Tuck
Rakesh M. Verma
ALM
117
0
0
26 Nov 2025
Can Finetuing LLMs on Small Human Samples Increase Heterogeneity, Alignment, and Belief-Action Coherence?
Can Finetuing LLMs on Small Human Samples Increase Heterogeneity, Alignment, and Belief-Action Coherence?
Steven Wang
Kyle Hunt
Shaojie Tang
Kenneth Joseph
ALM
132
0
0
26 Nov 2025
Mortgage Language Model: Domain-Adaptive Pretraining with Residual Instruction, Alignment Tuning, and Task-Specific Routing
Mortgage Language Model: Domain-Adaptive Pretraining with Residual Instruction, Alignment Tuning, and Task-Specific Routing
Manish Jain
Satheesh Kumar Ponnambalam
Salman Faroz
Chandrakanth Lns
Vinay Sharma
ALM
567
0
0
26 Nov 2025
A Set of Rules for Model Validation
A Set of Rules for Model Validation
José Camacho
ALMAI4CE
301
0
0
24 Nov 2025
Reproducibility Study of Large Language Model Bayesian Optimization
Reproducibility Study of Large Language Model Bayesian Optimization
Adam Rychert
Gasper Spagnolo
Evgenii Posashkov
ALM
101
0
0
24 Nov 2025
Building Domain-Specific Small Language Models via Guided Data Generation
Building Domain-Specific Small Language Models via Guided Data Generation
Aman Kumar
Ekant Muljibhai Amin
Xian Yeow Lee
Lasitha Vidyaratne
Ahmed K. Farahat
Dipanjan Ghosh
Yuta Koreeda
Chetan Gupta
ALM
100
0
0
23 Nov 2025
From Code Foundation Models to Agents and Applications: A Comprehensive Survey and Practical Guide to Code Intelligence
From Code Foundation Models to Agents and Applications: A Comprehensive Survey and Practical Guide to Code Intelligence
J. Yang
Wei Emma Zhang
Shark Liu
J. Wu
Shawn Guo
...
Zizheng Zhan
Jiajun Zhang
Jie Zhang
Zhaoxiang Zhang
Bo Zheng
LLMAGALMELM
586
0
0
23 Nov 2025
MindEval: Benchmarking Language Models on Multi-turn Mental Health Support
MindEval: Benchmarking Language Models on Multi-turn Mental Health Support
José P. Pombal
Maya DÉon
Nuno M. Guerreiro
Pedro Henrique Martins
António Farinhas
Ricardo Rei
AI4MHALMELM
428
0
0
23 Nov 2025
Evaluating Large Language Models on the 2026 Korean CSAT Mathematics Exam: Measuring Mathematical Ability in a Zero-Data-Leakage Setting
Evaluating Large Language Models on the 2026 Korean CSAT Mathematics Exam: Measuring Mathematical Ability in a Zero-Data-Leakage Setting
Goun Pyeon
Inbum Heo
Jeesu Jung
Taewook Hwang
Hyuk Namgoong
H. Seo
Yerim Han
Eunbin Kim
Hyeonseok Kang
Sangkeun Jung
ELMALMLRM
140
0
0
23 Nov 2025
Efficient Inference Using Large Language Models with Limited Human Data: Fine-Tuning then Rectification
Efficient Inference Using Large Language Models with Limited Human Data: Fine-Tuning then Rectification
Lei Wang
Zikun Ye
Jinglong Zhao
ALM
136
0
0
23 Nov 2025
SDA: Steering-Driven Distribution Alignment for Open LLMs without Fine-Tuning
Wei Xia
Zhi-Hong Deng
ALM
207
0
0
20 Nov 2025
PromptTailor: Multi-turn Intent-Aligned Prompt Synthesis for Lightweight LLMs
PromptTailor: Multi-turn Intent-Aligned Prompt Synthesis for Lightweight LLMs
Yizhou Xu
Janet Davis
ALM
97
0
0
20 Nov 2025
ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning
ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning
Hongwei Liu
J. Liu
Shudong Liu
Haodong Duan
Yuqiang Li
...
Conghui He
Qi Zhang
Songyang Zhang
Lei Bai
Kai Chen
LRMALMELM
367
0
0
18 Nov 2025
Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection
Scaling Generative Verifiers For Natural Language Mathematical Proof Verification And Selection
Sadegh Mahdavi
Branislav Kisacanin
Shubham Toshniwal
Wei Du
Ivan Moshkov
George Armstrong
Renjie Liao
Christos Thrampoulidis
Igor Gitman
ALMLRM
201
2
0
17 Nov 2025
Spark-Prover-X1: Formal Theorem Proving Through Diverse Data Training
Spark-Prover-X1: Formal Theorem Proving Through Diverse Data Training
Xinyuan Zhou
Yi Lei
Xiaoyu Zhou
Jingyi Sun
Yu Zhu
Zhongyi Ye
Weitai Zhang
Quan Liu
Si Wei
Cong Liu
ALMLRM
239
0
0
17 Nov 2025
Group-Aware Reinforcement Learning for Output Diversity in Large Language Models
Group-Aware Reinforcement Learning for Output Diversity in Large Language Models
Oron Anschel
Alon Shoshan
Adam Botach
Shunit Haviv Hakimi
Asaf Gendler
Emanuel Ben-Baruch
Nadav Bhonker
Igor Kviatkovsky
Manoj Aggarwal
Gérard Medioni
ALM
348
1
0
16 Nov 2025
LaoBench: A Large-Scale Multidimensional Lao Benchmark for Large Language Models
LaoBench: A Large-Scale Multidimensional Lao Benchmark for Large Language Models
Jian Gao
Richeng Xuan
Zhaolu Kang
Dingshi Liao
Wenxin Huang
...
Yangdi Xu
Bowen Qin
Zheqi He
Xi Yang
Changjin Li
ALMELM
383
0
0
14 Nov 2025
MACEval: A Multi-Agent Continual Evaluation Network for Large Models
MACEval: A Multi-Agent Continual Evaluation Network for Large Models
Z. Chen
Yuze Sun
Yuan Tian
Wenjun Zhang
Guangtao Zhai
ALMELM
169
0
0
12 Nov 2025
Design, Results and Industry Implications of the World's First Insurance Large Language Model Evaluation Benchmark
Design, Results and Industry Implications of the World's First Insurance Large Language Model Evaluation Benchmark
Hua Zhou
Bing Ma
Yufei Zhang
Yi Zhao
ALMELM
246
0
0
11 Nov 2025
AlignSurvey: A Comprehensive Benchmark for Human Preferences Alignment in Social Surveys
AlignSurvey: A Comprehensive Benchmark for Human Preferences Alignment in Social Surveys
Chenxi Lin
Weikang Yuan
Zhuoren Jiang
Biao Huang
Ruitao Zhang
Jianan Ge
Yueqian Xu
Jianxing Yu
ALM
461
0
0
11 Nov 2025
RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services
RedOne 2.0: Rethinking Domain-specific LLM Post-Training in Social Networking Services
Fei Zhao
Chonggang Lu
Haofu Qian
Fangcheng Shi
Zijie Meng
...
Zheyong Xie
Zheyu Ye
Zhe Xu
Yao Hu
Shaosheng Cao
ALM
159
0
0
10 Nov 2025
LPFQA: A Long-Tail Professional Forum-based Benchmark for LLM Evaluation
LPFQA: A Long-Tail Professional Forum-based Benchmark for LLM Evaluation
Liya Zhu
Peizhuang Cong
Aowei Ji
Wenya Wu
Jiani Hou
...
Jingzhe Ding
Tong Yang
Z. Wang
Ge Zhang
Wenhao Huang
ALMELM
445
0
0
09 Nov 2025
Where Do LLMs Still Struggle? An In-Depth Analysis of Code Generation Benchmarks
Where Do LLMs Still Struggle? An In-Depth Analysis of Code Generation Benchmarks
Amir Molzam Sharifloo
Maedeh Heydari
Parsa Kazerooni
Daniel Maninger
Mira Mezini
ALM
200
0
0
06 Nov 2025
One Battle After Another: Probing LLMs' Limits on Multi-Turn Instruction Following with a Benchmark Evolving Framework
One Battle After Another: Probing LLMs' Limits on Multi-Turn Instruction Following with a Benchmark Evolving Framework
Qi Jia
Kaiwei Zhang
Xiujie Song
Ye Shen
Xiangyang Zhu
Guangtao Zhai
Xiangyang Zhu
Guangtao Zhai
ALM
196
0
0
05 Nov 2025
Large language models require a new form of oversight: capability-based monitoring
Large language models require a new form of oversight: capability-based monitoring
Katherine C. Kellogg
Bingyang Ye
Yifan Hu
G. Savova
Byron Wallace
Danielle S. Bitterman
ALMELMAI4MH
229
1
0
05 Nov 2025
GRDD+: An Extended Greek Dialectal Dataset with Cross-Architecture Fine-tuning Evaluation
GRDD+: An Extended Greek Dialectal Dataset with Cross-Architecture Fine-tuning Evaluation
S. Chatzikyriakidis
Dimitris Papadakis
Sevasti-Ioanna Papaioannou
Erofili Psaltaki
ALM
185
0
0
05 Nov 2025
Targeted Error Correction in Knowledge Distillation: Small Language Models Surpass GPT
Targeted Error Correction in Knowledge Distillation: Small Language Models Surpass GPT
Hee-Jin Lee
Zhen Guo
Luchao Jin
Morteza Moazami Goudarzi
KELMALM
196
0
0
04 Nov 2025
The ORCA Benchmark: Evaluating Real-World Calculation Accuracy in Large Language Models
The ORCA Benchmark: Evaluating Real-World Calculation Accuracy in Large Language Models
Claudia Herambourg
Dawid Siuda
Julia Kopczyńska
Joao R. L. Santos
Wojciech Sas
Joanna Śmietańska-Nowak
ELMALMLRM
310
0
0
04 Nov 2025
Loading #Papers per Month with "ALM"
Past speakers
Name (-)
Top Contributors
Name (-)
Top Organizations at ResearchTrend.AI
Name (-)
Social Events
DateLocationEvent
No social events available