←── back to feed
/topics/arxiv-cs-lg-papers-june-2-2026

arXiv cs.LG papers June 2 2026

50 items1 sourcesupdated 15d agotrend 0

On June 2, 2026, arXiv's cs.LG category published 20 papers spanning quantization and compression of mixture-of-experts models, concept bottleneck models for explainability, speculative decoding optimization, world models survey, reinforcement learning theory, and practical agentic system design. Key technical contributions include BitsMoE for efficient MoE quantization, DAStatFormer for distributed acoustic sensing, GEM for concept erasure in Rectified Flows, and theoretical work on transformer-based tree search in RL.

[BLG]blog/rss50
BitsMoE: Efficient Spectral Energy-Guided Bit Allocation for MoE LLM Quantization
arXiv cs.LG · Jiayu Zhao, Zihan Teng, Minhao Fan, Tianrui Ma, Wentao Ren, Song Chen, Weichen Liu · 15d
DAStatFormer: A Hybrid Multibranch Transformer with Statistical Feature Integration for DAS-Based Pattern Recognitions
arXiv cs.LG · Michel Dione (CERI SN - IMT Nord Europe), Jerry Lonlac (CERI SN - IMT Nord Europe), H\'el\`ene Louis (CERI SN - IMT Nord Europe), Anthony Fleury (CERI SN - IMT Nord Europe), Stephane Lecoeuche · 15d
Hoeffding Concept Bottleneck Models with Applications to Overhead Images
arXiv cs.LG · Cl\'ement B\'enard, Manon Arfib, Christophe Labreuche, Victor Qu\'etu · 15d
From Demonstrations to Rewards: Test-Time Prompt Optimization for VLM Reward Models
arXiv cs.LG · Christian Gumbsch, Leonardo Barcellona, Lennard Sch\"unemann, Platon Karageorgis, Andrii Zadaianchuk, Zehao Wang, Sergey Zakharov, Fabien Despinoy, Rahaf Aljundi, Efstratios Gavves · 15d
A Shared Valence Axis Across Modern LLMs and Human EEG: The Saturation Regularity
arXiv cs.LG · Yousef A. Radwan, Xuhui Liu, Kilichbek Haydarov, Yuqian Fu, Mohamed Elhoseiny · 15d
Automatically Differentiable Nonlinear Tensor Networks (ADNTNs) for Exponential Compression of Deep Neural Networks
arXiv cs.LG · Andrzej Cichocki, Michal Wietczak · 15d
Foundation-Preserving Adaptation via Generalized Rayleigh-Quotient Optimization
arXiv cs.LG · Dongjun Kim, Adrian de Wynter, Huancheng Chen, Heasung Kim, Haris Vikalo · 15d
World Models: A Comprehensive Survey of Architectures, Methodologies, Reasoning Paradigms, and Applications
arXiv cs.LG · Arif Hassan Zidan, Yi Pan, Hanqi Jiang, Ruiyu Yan, Wei Ruan, Zihao Wu, Lifeng Chen, Weihang You, Xinliang Li, Bowen Chen, Huawen Hu, Peilong Wang, Sizhuang Liu, Jing Zhang, Siyuan Li, Zhengliang Liu, Yu Bao, Lin Zhao, Lichao Sun, Dajiang Zhu, Xiang Li, Jinglei Lv, Quanzheng Li, Wei Liu, Tianming Liu, Wei Zhang · 15d
On Effectiveness and Efficiency of Agentic Tool-calling and RL Training
arXiv cs.LG · Tong Liu, Cheng Qian, Matej Cief, Yuan He, Daniele Dan, Nikolaos Aletras, Gabriella Kazai · 15d
Generative AI and Digital Ecosystem Resilience: A Proactive Lifecycle-Based Survey
arXiv cs.LG · Jonghyun Chung, Rishabh Chaddha, Sanket Badhe, Debanshu Das, Nathan Huang, Amanpreet Kaur · 15d
Geometric Erasure by Contrastive Velocity Matching in Rectified Flows
arXiv cs.LG · Jonas Henry Grebe, Tobias Braun, Anna Rohrbach, Marcus Rohrbach · 15d
Adaptive data selection improves wearable prediction under low baseline performance
arXiv cs.LG · Ali Kargarandehkordi · 15d
BudgetDraft: Acceptance-Aware Multi-View Training for Sparse-KV Speculative Decoding
arXiv cs.LG · Liang He, Jingbo Wen, Qishi Zhan, Yixiong Chen, Kangning Cui, Qizhen Lan, Xilu Wang · 15d
RAFT: Data Refinement and Adaptive Distillation for Domain Fine-Tuning with Alleviated Forgetting
arXiv cs.LG · Yuduo Li, Xiaofeng Shi, Qian Kou, Longbin Yu, Hua Zhou · 15d
Emergence of Exploration in Policy Gradient Reinforcement Learning via Retrying
arXiv cs.LG · Soichiro Nishimori, Paavo Parmas, Sotetsu Koyamada, Tadashi Kozuno, Toshinori Kitamura, Shin Ishii, Yutaka Matsuo · 15d
ChurnNet: A Optimized Modern AI for Churn Prediction
arXiv cs.LG · Syed Saad Saif, Giulio Maggiore, Paolo Russo, Damiano Distante · 15d
Beyond Augmentation: Score-Guided Pathological Prior for EEG-based Depression Detection
arXiv cs.LG · Xiaojing Chen, Jingqi Cheng, Xu Zhao, Wan Jiang, Jingjing Wu · 15d
Agentic Transformers Provably Learn to Search via Reinforcement Learning
arXiv cs.LG · Tong Yang, Yu Huang, Yingbin Liang, Yuejie Chi · 15d
AI-Guided Design and Optimization of Graphite-Based Anodes via Iterative Experimental Feedback
arXiv cs.LG · Qian Du, Mark M. Sullivan, James E. Saal, Florian Huber · 15d
Learning to Construct Practical Agentic Systems
arXiv cs.LG · Aditya Kumar, Zhihan Lei, Jerry Yan, Joshua W. Momo, Lauhitya Reddy, Rafael Enrique Cabrera Jimenez, Cassandra A. Cohen, Arthur Kajiyama, William W. Cohen · 15d
BAGEN: Are LLM Agents Budget-Aware?
arXiv cs.LG · Yuxiang Lin, Zihan Wang, Mengyang Liu, Yuxuan Shan, Longju Bai, Junyao Zhang, Xing Jin, Boshan Chen, Jinyan Su, Xingyao Wang, Jiaxin Pei, Manling Li · 15d
From Rashomon Theory to PRAXIS: Efficient Decision Tree Rashomon Sets
arXiv cs.LG · Zakk Heile, Hayden McTavish, Varun Babbar, Margo Seltzer, Cynthia Rudin · 15d
Quantized Reasoning Models Think They Need to Think Longer, but They Do Not
arXiv cs.LG · Sanae Lotfi, Polina Kirichenko, Steven Li, Zechun Liu · 15d
LithoGRPO: Fast Inverse Lithography via GRPO Reinforced Flow Matching
arXiv cs.LG · Yao Lai, Xuyuan Xiong, Zeyue Xue, Guojin Chen, Jing Wang, Xihui Liu, Rui Zhang, Robert Mullins, Bei Yu, Ping Luo · 15d
A Pre-Training Analogue of Grokking in Language Models: Tracing Delayed Grammatical Generalization
arXiv cs.LG · Sherin Muckatira, Namrata Shivagunde, Vijeta Deshpande, Anna Rumshisky · 15d
InfoAtlas: A Foundation Model for Zero-Shot Statistical Dependence Estimate
arXiv cs.LG · Zhengyang Hu, Yanzhi Chen, Hanxiang Ren, Qunsong Zeng, Youyi Zheng, Adrian Weller, Kaibin Huang, Yanchao Yang · 15d
ARCA: Adapter-Residual Credit Assignment When Token Signals Degenerate
arXiv cs.LG · Rodney Lafuente-Mercado · 15d
When Softmax Fails at the Top: Extreme Value Corrections for InfoNCE
arXiv cs.LG · Melihcan Erol, Suat Evren, Oktay Ozel, Alexander Morgan, Jongha Jon Ryu, Lizhong Zheng · 15d
Inner Product Aware Quantization: Provably Fast, Accurate, and Adaptive Algorithms
arXiv cs.LG · Nathan White, Krish Singal · 15d
Accurate Large-sample Uncertainty Quantification using Stochastic Gradient Markov Chain Monte Carlo
arXiv cs.LG · Yu Wang, Jie Ding, Jonathan H. Huggins · 15d
Adaptive Order Policies for Masked Diffusion
arXiv cs.LG · Jama Hussein Mohamud, Mohsin Hasan, Mirco Ravanelli, Yoshua Bengio · 15d
FLaG: Fine-Grained Latent Grouping for Hallucination Detection
arXiv cs.LG · Wentao Ye, Liyao Li, Zhiqing Xiao, Muzhi Zhu, Jiaqi Hu, Zhanming Shen, Xiaomeng Hu, Sean Du, Haobo Wang · 15d
Modeling Spectral Energy Shifts in Spatio-Temporal Graph Anomaly Detection
arXiv cs.LG · Yilin Liu, Hongchao Zhang, Taylor T. Johnson, Ahmad F. Taha, Meiyi Ma · 15d
Rethinking the Role of Temperature in Large Language Model Distillation
arXiv cs.LG · Hoang-Chau Luong, Lingwei Chen · 15d
Large-scale Uncertainty Quantification for Latent Variable Models Using Subsampling Markov Chain Monte Carlo
arXiv cs.LG · Xiaoyu Wang, Jonathan H. Huggins · 15d
Adversarially Robust Control of Conditional Value-at-Risk via Rockafellar-Uryasev Conformal Inference
arXiv cs.LG · Catherine Chen, Jingyan Shen, Zhun Deng, Lihua Lei · 15d
Perturbative methods for non-parametric instrumental variable
arXiv cs.LG · Wei Bu, Arthur Gretton · 15d
KG-Guard: Graph-Based Hallucination Detection for Knowledge Base Question Answering
arXiv cs.LG · Albert Sawczyn, Piotr Bielak, Tomasz Kajdanowicz · 15d
CHAM-net: A Contrastive Hierarchical Adaptive Meta-network for Robust Global Methane Flux Prediction
arXiv cs.LG · Rongchao Dong, Yiming Sun, Shuo Chen, Youmi Oh, Licheng Liu, Yiqun Xie, Xiaowei Jia · 15d
Balancing Learning Rates Across Layers: Exact Two-Step Dynamics and Optimal Scaling in Linear Neural Networks
arXiv cs.LG · Tianyu Pang, Vignesh Kothapalli, Shenyang Deng, Haohui Wang, Dawei Zhou, Yaoqing Yang · 15d
ROGUE: Misaligned Agent Behavior Arising from Ordinary Computer Use
arXiv cs.LG · Jeremy Tien, Abishek Anand, Yu-Rou Tuan, Yuchen Shen, J. Zico Kolter, Aran Nayebi · 15d
PE-means: Improved Differentially Private $k$-means Clustering through Private Evolution
arXiv cs.LG · Thomas Humphries, Zinan Lin, Sergey Yekhanin · 15d
The role of class encoding in neural collapse
arXiv cs.LG · Bastien Massion, Roy Makhlouf, Estelle Massart · 15d
Longitudinal Multimodal Sensing of Physical Activity and Well-Being in Older Adults
arXiv cs.LG · Flavio Di Martino, Mattia G. Campana, Marcello Magno, Lorenza Pratali, Franca Delmastro · 15d
(HB-ARFM) History-Bootstrapped Flow Matching for Inverse Boiling Reconstruction
arXiv cs.LG · Xianwei Zou, Sheikh Md Shakeel Hassan, Arthur Feeney, Aparna Chandramowlishwaran · 15d
Drift Q-Learning
arXiv cs.LG · Anas Houssaini, Mohamad H. Danesh, Amin Abyaneh, Scott Fujimoto, Hsiu-Chin Lin, David Meger · 15d
GLENS: Global Search via Learning from Solver Iterates with Diffusion Models
arXiv cs.LG · Anjian Li, Bartolomeo Stellato, Ryne Beeson · 15d
Reinforcement Learning with Pairwise Preferences in Long-Term Decision Problems
arXiv cs.LG · Jonathan Cola\c{c}o Carr, Prakash Panangaden, Doina Precup, Benjamin Van Roy · 15d
How Much Orthogonalization Does Muon Need?
arXiv cs.LG · Hua Huang · 15d
CRMA: A Spectrally-Bounded Backbone for Modular Continual Fine-Tuning of LLMs
arXiv cs.LG · Kiran Nayudu, Aswini Nutakki, Sai Vinay Naidu, Ashwin Shanmugasundaram · 15d