Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
All Papers
0 / 0 papers shown
Home
Papers
All Papers
50 / 647,316 papers shown
All Types
Date Range
Most recent
Dataset Distillation via Relative Distribution Matching and Cognitive Heritage
Qianxin Xia
Jiawei Du
Yuhan Zhang
Jielei Wang
Guoming Lu
DD
7
0
0
05 Feb 2026
Codified Finite-state Machines for Role-playing
Letian Peng
Yupeng Hou
Kun Zhou
Jingbo Shang
AI4CE
7
0
0
05 Feb 2026
Among Us: Measuring and Mitigating Malicious Contributions in Model Collaboration Systems
Ziyuan Yang
Wenxuan Ding
Shangbin Feng
Yulia Tsvetkov
AAML
7
0
0
05 Feb 2026
Advancing Opinion Dynamics Modeling with Neural Diffusion-Convection-Reaction Equation
Chenghua Gong
Yihang Jiang
Hao Li
Rui Sun
Juyuan Zhang
Tianjun Gu
Liming Pan
Linyuan Lü
DiffM
AI4CE
11
0
0
05 Feb 2026
Cost-Efficient RAG for Entity Matching with LLMs: A Blocking-based Exploration
Chuangtao Ma
Zeyu Zhang
Arijit Khan
Sebastian Schelter
Paul Groth
3DV
14
0
0
05 Feb 2026
ALIVE: Awakening LLM Reasoning via Adversarial Learning and Instructive Verbal Evaluation
Yiwen Duan
Jing Ye
Xinpei Zhao
OffRL
LRM
10
0
0
05 Feb 2026
DFPO: Scaling Value Modeling via Distributional Flow towards Robust and Generalizable LLM Post-Training
Dingwei Zhu
Zhiheng Xi
Shihan Dou
Jiahan Li
Chenhao Huang
...
Yuran Wang
Tao Gui
Xipeng Qiu
Qi Zhang
Xuanjing Huang
OffRL
10
0
0
05 Feb 2026
ProAct: Agentic Lookahead in Interactive Environments
Yangbin Yu
Mingyu Yang
Junyou Li
Yiming Gao
Feiyu Liu
...
Jiafei Lyu
Yicheng Liu
Zhicong Lu
Deheng Ye
Jie Jiang
VLM
LRM
6
0
0
05 Feb 2026
Causal Inference on Stopped Random Walks in Online Advertising
Jia Yuan Yu
CML
15
0
0
05 Feb 2026
How Controlling the Variance can Improve Training Stability of Sparsely Activated DNNs and CNNs
Emily Dent
Jared Tanner
BDL
13
0
0
05 Feb 2026
Monte Carlo Rendering to Diffusion Curves with Differential BEM
Ryusuke Sugimoto
Christopher Batty
Siddhartha Chaudhuri
Iliyan Georgiev
Toshiya Hachisuka
Kevin Wampler
Michal Lukáč
DiffM
10
0
0
05 Feb 2026
Explainable Pathomics Feature Visualization via Correlation-aware Conditional Feature Editing
Yuechen Yang
Junlin Guo
Ruining Deng
Junchao Zhu
Zhengyi Lu
...
Xingyi Guo
Yu Wang
Shilin Zhao
Haichun Yang
Yuankai Huo
MedIm
6
0
0
05 Feb 2026
Consistency-Preserving Concept Erasure via Unsafe-Safe Pairing and Directional Fisher-weighted Adaptation
Yongwoo Kim
Sungmin Cha
Hyunsoo Kim
Jaewon Lee
Donghyun Kim
DiffM
7
0
0
05 Feb 2026
Rewards as Labels: Revisiting RLVR from a Classification Perspective
Zepeng Zhai
Meilin Chen
Jiaxuan Zhao
Junlang Qian
Lei Shen
Yuan Lu
OffRL
7
0
0
05 Feb 2026
Stop Rewarding Hallucinated Steps: Faithfulness-Aware Step-Level Reinforcement Learning for Small Reasoning Models
Shuo Nie
Hexuan Deng
Chao Wang
Ruiyu Fang
Xuebo Liu
Shuangyong Song
Yu Li
Min Zhang
Xuelong Li
ReLM
HILM
LRM
13
0
0
05 Feb 2026
Focus-Scan-Refine: From Human Visual Perception to Efficient Visual Token Pruning
Enwei Tong
Yuanchao Bai
Yao Zhu
Junjun Jiang
Xianming Liu
VLM
10
0
0
05 Feb 2026
Price of universality in vector quantization is at most 0.11 bit
Alina Harbuzova
Or Ordentlich
Yury Polyanskiy
MQ
2
0
0
05 Feb 2026
Projected Boosting with Fairness Constraints: Quantifying the Cost of Fair Training Distributions
Amir Asiaee
Kaveh Aryan
FedML
6
0
0
05 Feb 2026
Wave-Trainer-Fit: Neural Vocoder with Trainable Prior and Fixed-Point Iteration towards High-Quality Speech Generation from SSL features
Hien Ohnaka
Yuma Shirahata
Masaya Kawamura
DiffM
10
0
0
05 Feb 2026
SSG: Scaled Spatial Guidance for Multi-Scale Visual Autoregressive Generation
Youngwoo Shin
Jiwan Hur
Junmo Kim
DiffM
10
0
0
05 Feb 2026
Dual-Representation Image Compression at Ultra-Low Bitrates via Explicit Semantics and Implicit Textures
Chuqin Zhou
Xiaoyue Ling
Yunuo Chen
Jincheng Dai
Guo Lu
Wenjun Zhang
DiffM
7
0
0
05 Feb 2026
Robust Federated Learning via Byzantine Filtering over Encrypted Updates
Adda Akram Bendoukha
Aymen Boudguiga
Nesrine Kaaniche
Renaud Sirdey
Didem Demirag
Sébastien Gambs
FedML
9
0
0
05 Feb 2026
Cross-Domain Offline Policy Adaptation via Selective Transition Correction
Mengbei Yan
Jiafei Lyu
Shengjie Sun
Zhongjian Qiao
Jingwen Yang
Zichuan Lin
Deheng Ye
Xiu Li
OffRL
6
0
0
05 Feb 2026
Private Prediction via Shrinkage
Chao Yan
FedML
8
0
0
05 Feb 2026
Mapper-GIN: Lightweight Structural Graph Abstraction for Corrupted 3D Point Cloud Classification
Jeongbin You
Donggun Kim
Sejun Park
Seungsang Oh
3DPC
13
0
0
05 Feb 2026
FedRandom: Sampling Consistent and Accurate Contribution Values in Federated Learning
Arno Geimer
Beltran Fiz Pontiveros
Radu State
FedML
15
0
0
05 Feb 2026
UI-Mem: Self-Evolving Experience Memory for Online Reinforcement Learning in Mobile GUI Agents
Han Xiao
Guozhi Wang
Hao Wang
Shilong Liu
Yuxiang Chai
Yue Pan
Yufeng Zhou
Xiaoxin Chen
Yafei Wen
Hongsheng Li
OffRL
OnRL
14
0
0
05 Feb 2026
AgenticTagger: Structured Item Representation for Recommendation with LLM Agents
Zhouhang Xie
Bo Peng
Zhankui He
Ziqi Chen
Alice Han
...
Julian McAuley
Wang-Cheng Kang
Derek Zhiyuan Cheng
Beidou Wang
Randolph Brown
3DV
9
0
0
05 Feb 2026
Bayesian Neighborhood Adaptation for Graph Neural Networks
Paribesh Regmi
Rui Li
Kishan K C
BDL
GNN
18
0
0
05 Feb 2026
CFRecs: Counterfactual Recommendations on Real Estate User Listing Interaction Graphs
Seyedmasoud Mousavi
Ruomeng Xu
Xiaojing Zhu
CML
9
0
0
05 Feb 2026
Joint Embedding Variational Bayes
Amin Oji
Paul Fieguth
BDL
DRL
18
0
0
05 Feb 2026
Emulating Aggregate Human Choice Behavior and Biases with GPT Conversational Agents
Stephen Pilli
Vivek Nallur
AI4CE
9
0
0
05 Feb 2026
RFM-Pose:Reinforcement-Guided Flow Matching for Fast Category-Level 6D Pose Estimation
Diya He
Qingchen Liu
Cong Zhang
Jiahu Qin
DiffM
7
0
0
05 Feb 2026
Visualizing the loss landscapes of physics-informed neural networks
Conor Rowan
Finn Murphy-Blanchard
AI4CE
6
0
0
05 Feb 2026
CLIP-Map: Structured Matrix Mapping for Parameter-Efficient CLIP Compression
Kangjie Zhang
Wenxuan Huang
Xin Zhou
Boxiang Zhou
Dejia Song
...
Lizhuang Ma
Nemo Chen
Xu Tang
Yao Hu
Shaohui Lin
CLIP
VLM
6
0
0
05 Feb 2026
RL-VLA
3
^3
3
: Reinforcement Learning VLA Accelerating via Full Asynchronism
Zhong Guan
Haoran Sun
Yongjian Guo
Shuai Di
Xiaodong Bai
...
Xiaotie Deng
Xi Xiao
Sheng Wen
Yicheng Gong
Junwu Xiong
OffRL
6
0
0
05 Feb 2026
Contour Refinement using Discrete Diffusion in Low Data Regime
Fei Yu Guan
Ian Keefe
Sophie Wilkinson
Daniel D.B. Perrakis
Steven Waslander
MedIm
7
0
0
05 Feb 2026
Principled Confidence Estimation for Deep Computed Tomography
Matteo Gätzner
Johannes Kirschner
MedIm
9
0
0
05 Feb 2026
Back to Basics: Revisiting Exploration in Reinforcement Learning for LLM Reasoning via Generative Probabilities
Pengyi Li
Elizaveta Goncharova
Andrey Kuznetsov
Ivan Oseledets
OffRL
6
0
0
05 Feb 2026
PMT Waveform Simulation and Reconstruction with Conditional Diffusion Network
Kainan Liu
Jingyu Huang
Guihong Huang
Jianyi Luo
DiffM
6
0
0
05 Feb 2026
Once Correct, Still Wrong: Counterfactual Hallucination in Multilingual Vision-Language Models
Basel Mousi
Fahim Dalvi
Shammur Chowdhury
Firoj Alam
Nadir Durrani
VLM
6
0
0
05 Feb 2026
How to Achieve the Intended Aim of Deep Clustering Now, without Deep Learning
Kai Ming Ting
Wei-Jie Xu
Hang Zhang
OOD
5
0
0
05 Feb 2026
EBPO: Empirical Bayes Shrinkage for Stabilizing Group-Relative Policy Optimization
Kevin Han
Yuhang Zhou
Mingze Gao
Gedi Zhou
Serena Li
Abhishek Kumar
Xiangjun Fan
Weiwei Li
Lizhu Zhang
OffRL
6
0
0
05 Feb 2026
A Guide to Large Language Models in Modeling and Simulation: From Core Techniques to Critical Challenges
Philippe J. Giabbanelli
OffRL
6
0
0
05 Feb 2026
ARCHI-TTS: A flow-matching-based Text-to-Speech Model with Self-supervised Semantic Aligner and Accelerated Inference
Chunyat Wu
Jiajun Deng
Zhengxi Liu
Zheqi Dai
Haolin He
Qiuqiang Kong
DiffM
6
0
0
05 Feb 2026
LOBSTgER-enhance: an underwater image enhancement pipeline
Andreas Mentzelopoulos
Keith Ellenbogen
DiffM
6
0
0
05 Feb 2026
Variance Reduction Based Experience Replay for Policy Optimization
Hua Zheng
Wei Xie
M. Ben Feng
Keilung Choy
OffRL
6
0
0
05 Feb 2026
Disentangled Representation Learning via Flow Matching
Jinjin Chi
Taoping Liu
Mengtao Yin
Ximing Li
Yongcheng Jing
Dacheng Tao
DRL
DiffM
OOD
CoGe
13
0
0
05 Feb 2026
Distributional Reinforcement Learning with Diffusion Bridge Critics
Shutong Ding
Yimiao Zhou
Ke Hu
Mokai Pan
Shan Zhong
Yanwei Fu
Jingya Wang
Ye Shi
OffRL
6
0
0
05 Feb 2026
Weaver: End-to-End Agentic System Training for Video Interleaved Reasoning
Yudi Shi
Shangzhe Di
Qirui Chen
Qinian Wang
Jiayin Cai
Xiaolong Jiang
Yao Hu
Weidi Xie
LRM
11
0
0
05 Feb 2026
1
2
3
4
...
12945
12946
12947
Next
Page 1 of 12947
Page
of 12947
Go