←── back to feed
/topics/arxiv-cs-lg-papers-june-9-2026

arXiv cs.LG papers June 9 2026

47 items1 sourcesupdated 8d agotrend 0

On June 9, 2026, arXiv's cs.LG category published 20 papers spanning offline reinforcement learning for plasma control, medical image classification, decentralized swarm coordination, emergence theory, cloud resource allocation, time series generation, diffusion language models, autonomous scientific discovery, and various machine learning optimization techniques. The papers address practical challenges in deployment, efficiency, and robustness across domains including nuclear fusion, healthcare, smart grids, and LLM serving.

[BLG]blog/rss47
Offline Reinforcement Learning for Plasma Control in Nuclear Fusion: Codebase and Benchmark
arXiv cs.LG · Yang Fu, Haomin Bao, Rohit Sonker, Xiaoyan Hu, Aravind Venugopal, Jeff Schneider, Jiayu Chen · 8d
MedicalRec: Medical recommender system for image classification without retraining
arXiv cs.LG · Roghayeh Taghavi, Aysa Hasanazde Bashkandi, Amir Ali Bengari, Mohammad Amin Raji, Mohammad Salahi Ardekani, Parisa Mardukhian, Parvaneh Rezaei, Ramin Mousa · 8d
SPIN: Decentralized Swarm Control via Tensorized Policy Coordination
arXiv cs.LG · Zhaowen Fan · 8d
Emergence via Phase Transitions: Mechanism Landscapes and Universal Convergence Across Complex Systems
arXiv cs.LG · Truong Xuan Khanh · 8d
STARIXNet: Multivariate and Multi-attribute Deep Learning Approach to Real-Time Resource Allocation in Cloud Platforms
arXiv cs.LG · Ahmed Abdulaal, Maruf Aytekin, Thilaga kumaran Srinivasan, Tomer Lancewicki · 8d
TriHead-GAN: A Generative Adversarial Network with Triple-Head Discriminator for Carbon Emission Time Series Generation
arXiv cs.LG · Zesen Wang, Lijuan Lan, Yonggang Li, Chunhua Yang · 8d
Enabling KV Caching of Shared Prefix for Diffusion Language Models
arXiv cs.LG · Younghun Go, Jaehoon Han, Changyong Shin, Chuk Yoo, Gyeongsik Yang · 8d
When Should an AI Scientist Stop? Verifiable Experiment Steering and Refusal for Autonomous Discovery
arXiv cs.LG · Neel Tushar Shah, Manglam Kartik · 8d
Training-Inference Kernel Contracts: Bounding Divergence in Post-Training and Deployment
arXiv cs.LG · Bruce Changlong Xu, Lan Wu · 8d
Customer Churn Prediction on Structured Data Using FT-Transformer and Stacking Ensembles
arXiv cs.LG · Joyjit Roy, Samaresh Kumar Singh, Laxmi Shaw · 8d
Outage Detection in Self-Healing Smart Grids Using Reinforcement Learning with Spectral Graph Neural Networks
arXiv cs.LG · Lihui Liu, Mucun Sun, Caisheng Wang · 8d
From Human Guidance to Autonomy: Agent Skill System for End-to-End LLM Deployment on Spatial NPUs
arXiv cs.LG · Jiajie Li, Erwei Wang, Zhiru Zhang, Samuel Bayliss · 8d
The Routing Plateau: Understanding and Breaking the Accuracy Limits of LLM Routers
arXiv cs.LG · Yifan Lu, Qiyue Zhang, Shenrun Zhang, Zhibo Yu, Zhuang Wang, Hanjie Chen, Jiarong Xing · 8d
Optimality of Sequential Filtering Under Independent Cost and Selectivity Models
arXiv cs.LG · Hrishikesh Paranjape, Abhishek Mandal, Xian Sun · 8d
ResearchClawBench: A Benchmark for End-to-End Autonomous Scientific Research
arXiv cs.LG · Wanghan Xu, Shuo Li, Tianlin Ye, Qinglong Cao, Yixin Chen, Hengjian Gao, Yiheng Wang, Qi Li, Kun Li, Sheng Xu, Shengdu Chai, Fangchen Yu, Xiangyu Zhao, Zhangrui Zhao, Weijie Ma, Zijie Guo, Haoyu Zhou, Haoxiang Yin, Lixue Cheng, Chaofan Hu, Haoxuan Li, Lu Mi, Xuxuan Xie, Yifan Zhou, Ruizhe Chen, Zhiwang Zhou, Xingjian Guo, Yuhao Zhou, Xuming He, Shengyuan Xu, Xinyu Gu, Jiamin Wu, Mianxin Liu, Chunfeng Song, Fenghua Ling, Dongzhan Zhou, Shixiang Tang, Yuqiang Li, Mao Su, Peng Ye, Siqi Sun, Bin Wang, Xue Yang, Zhenfei Yin, Tianfan Fu, Guangtao Zhai, Wanli Ouyang, Bo Zhang, Lei Bai, Wenlong Zhang · 8d
UNIQ: Conformal Calibration for Adaptive Conservatism in Offline Reinforcement Learning
arXiv cs.LG · Aditya Upadhyay · 8d
Shortcuts in the Tail: Debiasing via Post-Hoc Spectral Compression of Fine-Tuning Updates
arXiv cs.LG · Edward Sun, Dmitrii Troitskii · 8d
Repetition Mismatch: Why Data Mixture Experiments Don't Scale and How to Fix Them
arXiv cs.LG · Kevin Zhou, Lisa Alazraki, Kris Cao, Marek Rei · 8d
A Topological Characterization of Graph Neural Networks via Stochastic Block Model Embeddings on the n-Sphere
arXiv cs.LG · Gopal Anantharaman · 8d
DiffoR: A Unified Continuous Generative Framework for Universal Ordinal Regression
arXiv cs.LG · Hongxu Ma, Lin Wang, Chenghou Jin, Han Zhou, Jie Zhang, Xiaoyu Yang, Chunjie Chen, Jihong Guan, Shuigeng Zhou · 8d
Reachability and asymptotics of Gaussian Transformer dynamics
arXiv cs.LG · Albert Alcalde, Zhengping Ji, Enrique Zuazua · 8d
LFNO: Bridging Laplace and Fourier via Transient-Steady Decomposition
arXiv cs.LG · Jeongun Ha, Sanga Yoon, Donghun Lee · 8d
Sample-Efficient Post-Training for LEGO Spatial-Physics Reasoning
arXiv cs.LG · Yuhuan Yuan, Zhouliang Yu, Minghao Liu, Weiyang Liu, Ge Lin Kan · 8d
MetaEvo: A Meta-Optimization Framework for Experience-Driven Agent Evolution
arXiv cs.LG · Bowen Ren, Heyan Huang, Yinghao Li, Yang Gao · 8d
Contribution Weights: A Geometrical Analysis of Self-Attention Transformers
arXiv cs.LG · Harry Jake Cunningham, Nicola Muca Cirone · 8d
SRT: Super-Resolution for Time Series via Disentangled Rectified Flow
arXiv cs.LG · Jufang Duan, Shenglong Xiao, Yuren Zhang · 8d
QDSP: An Interpretable Structured Learning Framework for Predicting Death or Cerebral Palsy in Very Low Birth Weight Infants
arXiv cs.LG · Ling Wang, Xiaolong Li, Hui Zhou, Jing Shi, Fuhao Zhang, Dapeng Chen, Nan Mu · 8d
Position: Genomic Model Research Must Move Beyond Anecdotal Evaluation of Interpretability Methods
arXiv cs.LG · Shasha Zhou, Mingyu Huang, Ke Li · 8d
LEAF: Growing Trees Without Branching for Speech-Aware Large Language Model Post-Training
arXiv cs.LG · Argyrios Gerogiannis, Yekaterina Yegorova, Mark Hasegawa-Johnson, Venugopal V. Veeravalli · 8d
Measuring Poverty and Inequality with Reduced Data: A Machine Learning Approach Using Nigerian Household Data
arXiv cs.LG · Vanesa Jord\'a, Miguel Ni\~no-Zaraz\'ua · 8d
Structured Neuron Pruning in Deep Neural Networks Using Multi-Armed Bandits
arXiv cs.LG · Salem Ameen, Sunil Vadera · 8d
Item Response Scaling Laws: A Measurement Theory Approach for Efficient and Generalizable Neural Scaling Estimation
arXiv cs.LG · Sang Truong, Yuheng Tu, Rylan Schaeffer, Sanmi Koyejo · 8d
Query Lens: Interpreting Sparse Key-Value Features with Indirect Effects
arXiv cs.LG · Hwiyeong Lee, Ingyu Bang, Uiji Hwang, Hyelim Lim, Taeuk Kim · 8d
ScaleSweep: Accurate NVFP4 Post-Training Quantization of LLMs via Block Scale Initialization
arXiv cs.LG · Li Lin, Xiaojun Wan · 8d
Graph Neural Networks for Predicting Solvability of Finite Groups
arXiv cs.LG · Tal Weissblat · 8d
HASA: Subnet Allocation for Compute-Constrained Model-Heterogeneous Federated Learning
arXiv cs.LG · Amir Hossein Shahdadian, Ahmed M. Abdelmoniem, Mahdi Taheri, Samira Nazari, Christian Herglotz · 8d
Airport Terminal Passenger Queue Forecasting for Departure Gates and Security Checkpoints
arXiv cs.LG · Juhwan Lee, Seokbin Yoon, Keumjin Lee, Hojong Baik, Seyeon Jung · 8d
Finite Certificates for In-Context Determinacy and a Threshold Theory of Emergence in Language Models
arXiv cs.LG · Faruk Alpay, Hamdi Alakkad · 8d
Sequential statistical inference for Large Language Models: Representation, validity, and monitoring
arXiv cs.LG · Yao Xie · 8d
Learning Transfers: Kan Extensions for Neural Invariants
arXiv cs.LG · Luciano Melodia · 8d
Large Language Models Should Learn Personalized Rather Than Aggregated Human Preferences
arXiv cs.LG · Cristina Garbacea · 8d
Trait-space Monitoring for Emergent Misalignment During Supervised Finetuning
arXiv cs.LG · Huy Nghiem, Sy-Tuyen Ho, Sarah Wiegreffe, Hal Daum\'e III · 8d
Evaluation of ML Resource Utilization Requires Model Life Cycle Assessment
arXiv cs.LG · Jared Fernandez, Clara Na, Yonatan Bisk, Constantine Samaras, Emma Strubell · 8d
KITE: A Tri-Modal Transformer Integrating Text, Images, and Knowledge Graphs for Fake News Detection
arXiv cs.LG · Kevin Patel, Shashi Bhushan Jha · 8d
DOG-DPO:Dynamic Optimization in Geometry for Safety Alignment
arXiv cs.LG · Yi Nian, Tiankai Yang, Yudi Zhang, Qi Pan, Zelong Xu, Shenzhe Zhu, Qingqing Luan, Yue Huang, Xiangliang Zhang, Yue Zhao · 8d
Semantic Cache Distillation: Efficient State Transfer via Reuse and Selective Patching
arXiv cs.LG · Qianli Ma, Zhiqing Tang, Hanshuai Cui, Zhi Yao, Weijia Jia · 8d
Test-Time Adaptive Composition for Machine Learning as a Service (MLaaS) in IoT Environments
arXiv cs.LG · Deepak Kanneganti, Sajib Mistry, Sheik Mohammad Mostakim Fattah, Aneesh Krishna · 8d