Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Home
Papers
All Papers
50 / 644,079 papers shown
All Types
Date Range
Most recent
From Data Leak to Secret Misses: The Impact of Data Leakage on Secret Detection Models
Farnaz Soltaniani
Mohammad Ghafari
PILM
ELM
10
0
0
30 Jan 2026
Visual Personalization Turing Test
Rameen Abdal
James Burgess
Sergey Tulyakov
Kuan-Chieh Jackson Wang
EGVM
ViT
11
0
0
30 Jan 2026
Make Anything Match Your Target: Universal Adversarial Perturbations against Closed-Source MLLMs via Multi-Crop Routed Meta Optimization
Hui Lu
Yi Yu
Yiming Yang
Chenyu Yi
Xueyi Ke
Qixing Zhang
Bingquan Shen
Alex Kot
Xudong Jiang
AAML
0
0
0
30 Jan 2026
THINKSAFE: Self-Generated Safety Alignment for Reasoning Models
Seanie Lee
Sangwoo Park
Yumin Choi
Gyeongman Kim
Minki Kang
Jihun Yun
Dongmin Park
Jongho Park
Sung Ju Hwang
OffRL
LRM
0
0
0
30 Jan 2026
DAVIS: OOD Detection via Dominant Activations and Variance for Increased Separation
Abid Hassan
Tuan Ngo
Saad Shafiq
Nenad Medvidovic
OODD
11
0
0
30 Jan 2026
EvoClinician: A Self-Evolving Agent for Multi-Turn Medical Diagnosis via Test-Time Evolutionary Learning
Yufei He
Juncheng Liu
Zhiyuan Hu
Yulin Chen
Yue Liu
...
Nuo Chen
Jun Hu
Bryan Hooi
Xinxing Xu
Jiang Bian
0
0
0
30 Jan 2026
Conditional Performance Guarantee for Large Reasoning Models
Jianguo Huang
Hao Zeng
Bingyi Jing
Hongxin Wei
Bo An
LRM
0
0
0
30 Jan 2026
AutoRefine: From Trajectories to Reusable Expertise for Continual LLM Agent Refinement
Libin Qiu
Zhirong Gao
Junfu Chen
Yuhang Ye
Weizhi Huang
Xiaobo Xue
Wenkai Qiu
Shuo Tang
0
0
0
30 Jan 2026
Real-Time Aligned Reward Model beyond Semantics
Zixuan Huang
Xin Xia
Yuxi Ren
Jianbin Zheng
Xuefeng Xiao
...
Zhongxiang Dai
Fuzhen Zhuang
Jianxin Li
Yikun Ban
Deqing Wang
0
0
0
30 Jan 2026
Test-Time Mixture of World Models for Embodied Agents in Dynamic Environments
Jinwoo Jang
Minjong Yoo
Sihyung Yoon
Honguk Woo
MoE
TTA
11
0
0
30 Jan 2026
Full-Graph vs. Mini-Batch Training: Comprehensive Analysis from a Batch Size and Fan-Out Size Perspective
Mengfan Liu
Da Zheng
Junwei Su
Chuan Wu
GNN
11
0
0
30 Jan 2026
ReGuLaR: Variational Latent Reasoning Guided by Rendered Chain-of-Thought
Fanmeng Wang
Haotian Liu
Guojiang Zhao
Hongteng Xu
Zhifeng Gao
BDL
OffRL
LRM
0
0
0
30 Jan 2026
Probing the Trajectories of Reasoning Traces in Large Language Models
Marthe Ballon
Brecht Verbeken
Vincent Ginis
Andres Algaba
LRM
0
0
0
30 Jan 2026
RAudit: A Blind Auditing Protocol for Large Language Model Reasoning
Edward Y. Chang
Longling Geng
LRM
0
0
0
30 Jan 2026
Chain-of-thought obfuscation learned from output supervision can generalise to unseen tasks
Nathaniel Mitrani Hadida
Sassan Bhanji
Cameron Tice
Puria Radmard
LRM
0
0
0
30 Jan 2026
Guided by Trajectories: Repairing and Rewarding Tool-Use Trajectories for Tool-Integrated Reasoning
Siyu Gong
Linan Yue
Weibo Gao
Fangzhou Yao
Shimin Di
Lei Feng
Min-Ling Zhang
LRM
0
0
0
30 Jan 2026
DiffuSpeech: Silent Thought, Spoken Answer via Unified Speech-Text Diffusion
Yuxuan Lou
Ziming Wu
Yaochen Wang
Yong Liu
Yingxuan Ren
Fuming Lai
Shaobing Lian
Jie Tang
Yang You
DiffM
LRM
0
0
0
30 Jan 2026
MTDrive: Multi-turn Interactive Reinforcement Learning for Autonomous Driving
Xidong Li
Mingyu Guo
Chenchao Xu
Bailin Li
Wenjing Zhu
Yangang Zou
Rui Chen
Zehuan Wang
LRM
0
0
0
30 Jan 2026
Do Transformers Have the Ability for Periodicity Generalization?
Huanyu Liu
Ge Li
Yihong Dong
Sihan Wu
Peixu Wang
Sihao Cheng
Taozhi Chen
Kechi Zhang
Hao Zhu
Tongxuan Liu
LRM
0
0
0
30 Jan 2026
Rethinking LLM-as-a-Judge: Representation-as-a-Judge with Small Language Models via Semantic Capacity Asymmetry
Zhuochun Li
Yong Zhang
Ming Li
Yuelyu Ji
Yiming Zeng
...
Yun Zhu
Yanmeng Wang
Shaojun Wang
Jing Xiao
Daqing He
ELM
LRM
0
0
0
30 Jan 2026
EntroCut: Entropy-Guided Adaptive Truncation for Efficient Chain-of-Thought Reasoning in Small-scale Large Reasoning Models
Hongxi Yan
Qingjie Liu
Yunhong Wang
LRM
0
0
0
30 Jan 2026
Transform-Augmented GRPO Improves Pass@k
Khiem Le
Youssef Mroueh
Phuc Nguyen
Chi-Heng Lin
Shangqian Gao
Ting Hua
Nitesh V. Chawla
LRM
0
0
0
30 Jan 2026
PlatoLTL: Learning to Generalize Across Symbols in LTL Instructions for Multi-Task RL
Jacques Cloete
Mathias Jackermeier
Ioannis Havoutis
Alessandro Abate
OffRL
LRM
0
0
0
30 Jan 2026
HeaPA: Difficulty-Aware Heap Sampling and On-Policy Query Augmentation for LLM Reinforcement Learning
Weiqi Wang
Xin Liu
Binxuan Huang
Hejie Cui
Rongzhi Zhang
...
Yifan Gao
Priyanka Nigam
Bing Yin
Lihong Li
Yangqiu Song
OffRL
LRM
0
0
0
30 Jan 2026
Deep Search with Hierarchical Meta-Cognitive Monitoring Inspired by Cognitive Neuroscience
Zhongxiang Sun
Qipeng Wang
Weijie Yu
Jingxuan Yang
Haolang Lu
Jun Xu
LRM
0
0
0
30 Jan 2026
VideoGPA: Distilling Geometry Priors for 3D-Consistent Video Generation
Hongyang Du
Junjie Ye
Xiaoyan Cong
Runhao Li
Jingcheng Ni
Aman Agarwal
Zeqi Zhou
Zekun Li
Randall Balestriero
Yue Wang
DiffM
VGen
0
0
0
30 Jan 2026
Decoupled Diffusion Sampling for Inverse Problems on Function Spaces
Thomas Y.L. Lin
Jiachen Yao
Lufang Chiang
Julius Berner
Anima Anandkumar
DiffM
0
0
0
30 Jan 2026
Denoising the Deep Sky: Physics-Based CCD Noise Formation for Astronomical Imaging
Shuhong Liu
Xining Ge
Ziying Gu
Lin Gu
Ziteng Cui
Xuangeng Chu
Jun Liu
Dong Li
Tatsuya Harada
DiffM
0
0
0
30 Jan 2026
FourierSampler: Unlocking Non-Autoregressive Potential in Diffusion Language Models via Frequency-Guided Generation
Siyang He
Qiqi Wang
Xiaoran Liu
Hongnan Ma
Yiwei Shi
...
Ying Zhu
Tianyi Liang
Zengfeng Huang
Ziwei He
Xipeng Qiu
DiffM
0
0
0
30 Jan 2026
Relaxing Positional Alignment in Masked Diffusion Language Models
Mengyu Ye
Ryosuke Takahashi
Keito Kudo
Jun Suzuki
DiffM
0
0
0
30 Jan 2026
Learn More with Less: Uncertainty Consistency Guided Query Selection for RLVR
Hao Yi
Yulan Hu
Xin Li
Sheng Ouyang
Lizhong Ding
Yong Liu
OffRL
0
0
0
30 Jan 2026
Decoding in Geometry: Alleviating Embedding-Space Crowding for Complex Reasoning
Yixin Yang
Qingxiu Dong
Zhifang Sui
LRM
0
0
0
30 Jan 2026
Anytime Safe PAC Efficient Reasoning
Chengyao Yu
Hao Zeng
Youxin Zhu
Jianguo Huang
Huajun Zeng
Bingyi Jing
LRM
0
0
0
30 Jan 2026
Neural Clothing Tryer: Customized Virtual Try-On via Semantic Enhancement and Controlling Diffusion Model
Zhijing Yang
Weiwei Zhang
Mingliang Yang
Siyuan Peng
Yukai Shi
Junpeng Tan
Tianshui Chen
Liruo Zhong
DiffM
0
0
0
30 Jan 2026
Time-Annealed Perturbation Sampling: Diverse Generation for Diffusion Language Models
Jingxuan Wu
Zhenglin Wan
Xingrui Yu
Yuzhe Yang
Yiqiao Huang
Ivor Tsang
Yang You
DiffM
0
0
0
30 Jan 2026
MIRRORTALK: Forging Personalized Avatars Via Disentangled Style and Hierarchical Motion Control
Renjie Lu
Xulong Zhang
Xiaoyang Qu
Jianzong Wang
Shangfei Wang
DiffM
0
0
0
30 Jan 2026
ScribbleSense: Generative Scribble-Based Texture Editing with Intent Prediction
Yudi Zhang
Yeming Geng
Lei Zhang
DiffM
0
0
0
30 Jan 2026
Sequence Diffusion Model for Temporal Link Prediction in Continuous-Time Dynamic Graph
Nguyen Minh Duc
Viet Cuong Ta
DiffM
0
0
0
30 Jan 2026
Weak Diffusion Priors Can Still Achieve Strong Inverse-Problem Performance
Jing Jia
Wei Yuan
Sifan Liu
Liyue Shen
Guanyang Wang
DiffM
0
0
0
30 Jan 2026
Beauty and the Beast: Imperceptible Perturbations Against Diffusion-Based Face Swapping via Directional Attribute Editing
Yilong Huang
Songze Li
DiffM
AAML
0
0
0
30 Jan 2026
Safer Policy Compliance with Dynamic Epistemic Fallback
Joseph Marvin Imperial
Harish Tayyar Madabushi
AAML
0
0
0
30 Jan 2026
Now You Hear Me: Audio Narrative Attacks Against Large Audio-Language Models
Ye Yu
Haibo Jin
Yaoning Yu
Jun Zhuang
Haohan Wang
AuLLM
AAML
0
0
0
30 Jan 2026
ShotFinder: Imagination-Driven Open-Domain Video Shot Retrieval via Web Search
Tao Yu
Haopeng Jin
Hao Wang
Shenghua Chai
Yujia Yang
...
Cheng Zhong
Xiao Ma
Zhang Zhang
Yan Huang
Liang Wang
DiffM
VGen
11
0
0
30 Jan 2026
AEGIS: White-Box Attack Path Generation using LLMs and Training Effectiveness Evaluation for Large-Scale Cyber Defence Exercises
Ivan K. Tung
Yu Xiang Shi
Alex Chien
Wenkai Liu
Lawrence Zheng
AAML
0
0
0
30 Jan 2026
Statistical Estimation of Adversarial Risk in Large Language Models under Best-of-N Sampling
Mingqian Feng
Xiaodong Liu
Weiwei Yang
Chenliang Xu
Christopher White
Jianfeng Gao
AAML
0
0
0
30 Jan 2026
Is Training Necessary for Anomaly Detection?
Xingwu Zhang
Guanxuan Li
Paul Henderson
Gerardo Aragon-Camarasa
Zijun Long
AAML
0
0
0
30 Jan 2026
Whispers of Wealth: Red-Teaming Google's Agent Payments Protocol via Prompt Injection
Tanusree Debi
Wentian Zhu
AAML
0
0
0
30 Jan 2026
FedDis: A Causal Disentanglement Framework for Federated Traffic Prediction
Chengyang Zhou
Zijian Zhang
Chunxu Zhang
Hao Miao
Yulin Zhang
Kedi Lyu
Juncheng Hu
FedML
11
0
0
30 Jan 2026
On Safer Reinforcement Learning Policies for Sedation and Analgesia in Intensive Care
Joel Romero-Hernandez
Oscar Camara
OffRL
0
0
0
30 Jan 2026
Automatic Constraint Policy Optimization based on Continuous Constraint Interpolation Framework for Offline Reinforcement Learning
Xinchen Han
Qiuyang Fang
Hossam Afifi
Michel Marot
OffRL
0
0
0
30 Jan 2026
1
2
3
4
...
12880
12881
12882
Next
Page 1 of 12882
Page
of 12882
Go