←── back to feed
/topics/arxiv-cs-lg-papers-june-2-2026
arXiv cs.LG papers June 2 2026
50 items●1 sources●updated 15d ago●trend 0
On June 2, 2026, arXiv's cs.LG category published 20 papers spanning quantization and compression of mixture-of-experts models, concept bottleneck models for explainability, speculative decoding optimization, world models survey, reinforcement learning theory, and practical agentic system design. Key technical contributions include BitsMoE for efficient MoE quantization, DAStatFormer for distributed acoustic sensing, GEM for concept erasure in Rectified Flows, and theoretical work on transformer-based tree search in RL.
[BLG]blog/rss50
BitsMoE: Efficient Spectral Energy-Guided Bit Allocation for MoE LLM Quantization
DAStatFormer: A Hybrid Multibranch Transformer with Statistical Feature Integration for DAS-Based Pattern Recognitions
Hoeffding Concept Bottleneck Models with Applications to Overhead Images
From Demonstrations to Rewards: Test-Time Prompt Optimization for VLM Reward Models
A Shared Valence Axis Across Modern LLMs and Human EEG: The Saturation Regularity
Automatically Differentiable Nonlinear Tensor Networks (ADNTNs) for Exponential Compression of Deep Neural Networks
Foundation-Preserving Adaptation via Generalized Rayleigh-Quotient Optimization
World Models: A Comprehensive Survey of Architectures, Methodologies, Reasoning Paradigms, and Applications
On Effectiveness and Efficiency of Agentic Tool-calling and RL Training
Generative AI and Digital Ecosystem Resilience: A Proactive Lifecycle-Based Survey
Geometric Erasure by Contrastive Velocity Matching in Rectified Flows
Adaptive data selection improves wearable prediction under low baseline performance
BudgetDraft: Acceptance-Aware Multi-View Training for Sparse-KV Speculative Decoding
RAFT: Data Refinement and Adaptive Distillation for Domain Fine-Tuning with Alleviated Forgetting
Emergence of Exploration in Policy Gradient Reinforcement Learning via Retrying
ChurnNet: A Optimized Modern AI for Churn Prediction
Beyond Augmentation: Score-Guided Pathological Prior for EEG-based Depression Detection
Agentic Transformers Provably Learn to Search via Reinforcement Learning
AI-Guided Design and Optimization of Graphite-Based Anodes via Iterative Experimental Feedback
Learning to Construct Practical Agentic Systems
BAGEN: Are LLM Agents Budget-Aware?
From Rashomon Theory to PRAXIS: Efficient Decision Tree Rashomon Sets
Quantized Reasoning Models Think They Need to Think Longer, but They Do Not
LithoGRPO: Fast Inverse Lithography via GRPO Reinforced Flow Matching
A Pre-Training Analogue of Grokking in Language Models: Tracing Delayed Grammatical Generalization
InfoAtlas: A Foundation Model for Zero-Shot Statistical Dependence Estimate
ARCA: Adapter-Residual Credit Assignment When Token Signals Degenerate
When Softmax Fails at the Top: Extreme Value Corrections for InfoNCE
Inner Product Aware Quantization: Provably Fast, Accurate, and Adaptive Algorithms
Accurate Large-sample Uncertainty Quantification using Stochastic Gradient Markov Chain Monte Carlo
Adaptive Order Policies for Masked Diffusion
FLaG: Fine-Grained Latent Grouping for Hallucination Detection
Modeling Spectral Energy Shifts in Spatio-Temporal Graph Anomaly Detection
Rethinking the Role of Temperature in Large Language Model Distillation
Large-scale Uncertainty Quantification for Latent Variable Models Using Subsampling Markov Chain Monte Carlo
Adversarially Robust Control of Conditional Value-at-Risk via Rockafellar-Uryasev Conformal Inference
Perturbative methods for non-parametric instrumental variable
KG-Guard: Graph-Based Hallucination Detection for Knowledge Base Question Answering
CHAM-net: A Contrastive Hierarchical Adaptive Meta-network for Robust Global Methane Flux Prediction
Balancing Learning Rates Across Layers: Exact Two-Step Dynamics and Optimal Scaling in Linear Neural Networks
ROGUE: Misaligned Agent Behavior Arising from Ordinary Computer Use
PE-means: Improved Differentially Private $k$-means Clustering through Private Evolution
The role of class encoding in neural collapse
Longitudinal Multimodal Sensing of Physical Activity and Well-Being in Older Adults
(HB-ARFM) History-Bootstrapped Flow Matching for Inverse Boiling Reconstruction
Drift Q-Learning
GLENS: Global Search via Learning from Solver Iterates with Diffusion Models
Reinforcement Learning with Pairwise Preferences in Long-Term Decision Problems
How Much Orthogonalization Does Muon Need?
CRMA: A Spectrally-Bounded Backbone for Modular Continual Fine-Tuning of LLMs