←── back to feed
/topics/arxiv-cs-lg-papers-june-9-2026
arXiv cs.LG papers June 9 2026
47 items●1 sources●updated 8d ago●trend 0
On June 9, 2026, arXiv's cs.LG category published 20 papers spanning offline reinforcement learning for plasma control, medical image classification, decentralized swarm coordination, emergence theory, cloud resource allocation, time series generation, diffusion language models, autonomous scientific discovery, and various machine learning optimization techniques. The papers address practical challenges in deployment, efficiency, and robustness across domains including nuclear fusion, healthcare, smart grids, and LLM serving.
[BLG]blog/rss47
Offline Reinforcement Learning for Plasma Control in Nuclear Fusion: Codebase and Benchmark
MedicalRec: Medical recommender system for image classification without retraining
SPIN: Decentralized Swarm Control via Tensorized Policy Coordination
Emergence via Phase Transitions: Mechanism Landscapes and Universal Convergence Across Complex Systems
STARIXNet: Multivariate and Multi-attribute Deep Learning Approach to Real-Time Resource Allocation in Cloud Platforms
TriHead-GAN: A Generative Adversarial Network with Triple-Head Discriminator for Carbon Emission Time Series Generation
Enabling KV Caching of Shared Prefix for Diffusion Language Models
When Should an AI Scientist Stop? Verifiable Experiment Steering and Refusal for Autonomous Discovery
Training-Inference Kernel Contracts: Bounding Divergence in Post-Training and Deployment
Customer Churn Prediction on Structured Data Using FT-Transformer and Stacking Ensembles
Outage Detection in Self-Healing Smart Grids Using Reinforcement Learning with Spectral Graph Neural Networks
From Human Guidance to Autonomy: Agent Skill System for End-to-End LLM Deployment on Spatial NPUs
The Routing Plateau: Understanding and Breaking the Accuracy Limits of LLM Routers
Optimality of Sequential Filtering Under Independent Cost and Selectivity Models
ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research
UNIQ: Conformal Calibration for Adaptive Conservatism in Offline Reinforcement Learning
Shortcuts in the Tail: Debiasing via Post-Hoc Spectral Compression of Fine-Tuning Updates
Repetition Mismatch: Why Data Mixture Experiments Don't Scale and How to Fix Them
A Topological Characterization of Graph Neural Networks via Stochastic Block Model Embeddings on the n-Sphere
DiffoR: A Unified Continuous Generative Framework for Universal Ordinal Regression
Reachability and asymptotics of Gaussian Transformer dynamics
LFNO: Bridging Laplace and Fourier via Transient-Steady Decomposition
Sample-Efficient Post-Training for LEGO Spatial-Physics Reasoning
MetaEvo: A Meta-Optimization Framework for Experience-Driven Agent Evolution
Contribution Weights: A Geometrical Analysis of Self-Attention Transformers
SRT: Super-Resolution for Time Series via Disentangled Rectified Flow
QDSP: An Interpretable Structured Learning Framework for Predicting Death or Cerebral Palsy in Very Low Birth Weight Infants
Position: Genomic Model Research Must Move Beyond Anecdotal Evaluation of Interpretability Methods
LEAF: Growing Trees Without Branching for Speech-Aware Large Language Model Post-Training
Measuring Poverty and Inequality with Reduced Data: A Machine Learning Approach Using Nigerian Household Data
Structured Neuron Pruning in Deep Neural Networks Using Multi-Armed Bandits
Item Response Scaling Laws: A Measurement Theory Approach for Efficient and Generalizable Neural Scaling Estimation
Query Lens: Interpreting Sparse Key-Value Features with Indirect Effects
ScaleSweep: Accurate NVFP4 Post-Training Quantization of LLMs via Block Scale Initialization
Graph Neural Networks for Predicting Solvability of Finite Groups
HASA: Subnet Allocation for Compute-Constrained Model-Heterogeneous Federated Learning
Airport Terminal Passenger Queue Forecasting for Departure Gates and Security Checkpoints
Finite Certificates for In-Context Determinacy and a Threshold Theory of Emergence in Language Models
Sequential statistical inference for Large Language Models: Representation, validity, and monitoring
Learning Transfers: Kan Extensions for Neural Invariants
Large Language Models Should Learn Personalized Rather Than Aggregated Human Preferences
Trait-space Monitoring for Emergent Misalignment During Supervised Finetuning
Evaluation of ML Resource Utilization Requires Model Life Cycle Assessment
KITE: A Tri-Modal Transformer Integrating Text, Images, and Knowledge Graphs for Fake News Detection
DOG-DPO:Dynamic Optimization in Geometry for Safety Alignment
Semantic Cache Distillation: Efficient State Transfer via Reuse and Selective Patching
Test-Time Adaptive Composition for Machine Learning as a Service (MLaaS) in IoT Environments