←── back to feed
/topics/arxiv-cs-lg-papers-may-29-2026

arXiv cs.LG papers May 29 2026

47 items1 sourcesupdated 19d agotrend 0

On May 29, 2026, arXiv's cs.LG category published 20 papers spanning mechanistic interpretability of language models, reinforcement learning, multimodal learning, time-series analysis, diffusion models, and domain-specific applications including drug discovery and power systems. Topics ranged from knowledge editing mechanisms and catastrophic forgetting to world models, LLM trading agents, and compact language models with adaptive reasoning.

  • ROME and MEMIT knowledge editing methods target a common subset of weights; binary mask reverses 80% of edits despite fact-specific changes
  • RL fine-tuning preserves internal computational circuits better than SFT, with differential circuit vulnerability analysis explaining mechanistic origins of catastrophic forgetting
  • TradeArena testbed reveals pre-failure signatures in LLM trading agents: planning embeddings drift from normal-state centroids before drawdowns
  • VAE-based world models trained on random embodied exploration develop spatial semantic structure mirroring physical geometry (6.6x improvement in position RSA)
  • CosmicFish-HRM compact language model uses Hierarchical Reasoning Module for adaptive inference-time reasoning depth allocation
  • 20 papers published across knowledge editing, RL, multimodal learning, time-series generation, diffusion control, and domain applications (drug discovery, power systems, metagenomic annotation)
[BLG]blog/rss47
One Mask to Rule Them All: On Hidden Facts after Editing and How to Find Them
arXiv cs.LG · Ali Holmov, Paul Youssef, Nandi Schoots, Christin Seifert · 19d
Representation Signatures and Risk-Feedback Alignment in LLM Trading Agents
arXiv cs.LG · Weicheng Xue · 19d
Mechanistic origins of catastrophic forgetting: why RL preserves circuits better than SFT?
arXiv cs.LG · Jeanmely Rojas Nunez, Viraj Sawant, Nathan Allen, Nomgondalai Amgalanbaatar, Yannis Zongo, Vasu Sharma, Maheep Chaudhary · 19d
Molecular Lead Optimization via Agentic Tool Planning
arXiv cs.LG · Lingxiao Li, Haobo Zhang, Ruohao Fan, Bin Chen, Jiayu Zhou · 19d
Self-Play Reinforcement Learning under Imperfect Information in Big 2
arXiv cs.LG · Aalok Patwa · 19d
Emergent Semantic Representations in World Models through Physical Interaction without Linguistic Supervision
arXiv cs.LG · Jiayi Fang · 19d
Continuity and Ordinality Matter: Constraining Time Series Tokens for Effective Time Series Analysis with Large Language Models
arXiv cs.LG · Musheng Li, Ziying Zhang, Cheng jin, Yuantao Gu · 19d
PrismFlow: Residual Dynamics for Flow Matching in Time-Series Generation
arXiv cs.LG · Junru Zhang, Lang Feng, Jinbo Wang, Xu Guo, Yucheng Wang, Han Yu, Min Wu, Yabo Dong, Duanqing Xu · 19d
TaxDistill: Improving Metagenomic Taxonomic Annotation via Distilled Genomic Foundation Models
arXiv cs.LG · Rongye Ye, Lun Li, Zheng Luo, Yiran Zhan, Shuhui Song · 19d
Balancing Multimodal Learning through Label Space Reshaping
arXiv cs.LG · Xiaoyu Ma, Weijie Zhang, Yuanhao Gao, Han Miao, Yongjian Deng, Hao Chen · 19d
Representation Alignment Rests on Linear Structure
arXiv cs.LG · Kiril Bangachev, Guy Bresler, Yury Polyanskiy · 19d
Pre-Registering the Detectable Effect: A Paired-MDE Budget for 4-bit Quantization Benchmarks, with a Pilot Audit
arXiv cs.LG · Zexin Zhuang, Yanhang Li, Zhichao Fan · 19d
Towards Continuous-time Causal Foundation Models
arXiv cs.LG · Dennis Thumm, Ruben Wiedemann, Ying Chen · 19d
Context Distillation as Latent Memory Management
arXiv cs.LG · Ziyang Zheng, Zeju Li, Xiangyu Wen, Jianyuan Zhong, Junhua Huang, Lei Chen, Mingxuan Yuan, Qiang Xu · 19d
Feature Geometry of LoRA Adapters: A Sparse Autoencoder Analysis of Representational Divergence in Fine-Tuned Language Models
arXiv cs.LG · Prasanth K K · 19d
Spectral Guidance for Flexible and Efficient Control of Diffusion Models
arXiv cs.LG · Gabriel Moreira, Manuel Marques, Jo\~ao Paulo Costeira, Chenyan Xiong · 19d
Sequential Physics-Constrained Neural Operator Forward Modeling for the $\textit{Norne}$ Reservoir System
arXiv cs.LG · Clement Etienam, Juntao Yang, Oleg Ovcharenko, Nick Luiken, Tsubasa Onishi, Nefeli Moridis, Issam Said · 19d
Cycle-Space Informed Detection of Autoencoded Blind False Data Injection Attacks on Power Systems
arXiv cs.LG · Xin Li, Chenhan Xiao, Jonathan Cohen, Aviad Elyashar, Yang Weng, Rami Puzis · 19d
When LLM Reward Design Fails: Diagnostic-Driven Refinement for Sparse Structured RL
arXiv cs.LG · Youting Wang, Yuan Tang, Bowen Liu, Xuan Liu, Dingyan Shang · 19d
CosmicFish-HRM: Adaptive Reasoning via Hierarchical Recurrent Mechanisms in Compact Language Models
arXiv cs.LG · Venkat Akhil Lakkapragada · 19d
A Training-Time Diagnostic for Generalization via the Log-Alignment Ratio
arXiv cs.LG · Ali Shehper, Ashish Vaswani · 19d
Comparing Post-Hoc Explainable AI Methods for Interpreting Black-Box EEG Models in Depression Detection
arXiv cs.LG · Antonia \v{S}ar\v{c}evi\'c, Nikolina Frid · 19d
The Hamilton-Jacobi Theory of Deep Learning
arXiv cs.LG · Jose Marie Antonio Mi\~noza, Erika Fille T. Legara, Christopher P. Monterola · 19d
Learning Robust and Task-Invariant Functional Representation from fMRI through Siamese Self-Supervised Learning
arXiv cs.LG · Jiyao Wang, Peiyu Duan, Nicha C. Dvornek, Lawrence H. Staib, Denis Sukhodolsky, Pamela Ventola, James S. Duncan · 19d
FormInv: A Measurement Protocol for Semantic Invariance in Mathematical Reasoning Benchmarks
arXiv cs.LG · Nishal Thomas, Noel Thomas · 19d
FedQHD: Closed-Form Function-Space Federated Reinforcement Learning
arXiv cs.LG · Yuchen Hou, Yongshan Chen, Zhuowen Zou, Calvin Yeung, Mohsen Imani, Tian Lan, Mahdi Imani · 19d
LoRe: Adaptive Interaction-Evaluation Routing with Per-Step Interaction Budgets for Iterative Graph Solvers
arXiv cs.LG · Jintao Li, Yong-Yi Wang, Zheng-An Wang, Heng Fan · 19d
Causal Intelligence for Constraint-Aware Intervention Design to Induce State Transitions
arXiv cs.LG · Zixuan Song, Uwe Mueller, Dimitris V. Manatakis · 19d
Label-Free Reinforcement Learning via Cross-Model Entropy
arXiv cs.LG · Matt Gorbett, Hossein Shirazi · 19d
Designing Active Tether-Net Systems for Space Debris Capture with Graph-Learning-Aided Mixed-Combinatorial Optimization
arXiv cs.LG · Feng Liu, Achira Boonrath, Gishnu Madhu, Eleonora M. Botta, Souma Chowdhury · 19d
Return-to-Go Is More Than a Number: Q-Guided Alignment for Return-Conditioned Supervised Learning
arXiv cs.LG · Yuxiao Yang, Weitong Zhang · 19d
Moment Matching Q-Learning
arXiv cs.LG · Yiyan (Edgar), Liang, Sifei Liu, Weitong Zhang · 19d
Parallel Adaptive Multi-Objective Evolutionary Learning of Discretized Bayesian Network Classifiers for Clinical Data
arXiv cs.LG · Damy M. F. Ha, Tanja Alderliesten, Peter A. N. Bosman · 19d
Ensemble Score Filtering for Real-Data Energy Consumption Forecast Correction
arXiv cs.LG · Ruoyu Hu, Dahai Yu, Feng Bao, Guang Wang, Guannan Zhang · 19d
Knowledge Offloading: Decomposing LLMs into Sparse Backbones and Memory Modules
arXiv cs.LG · Karim Galliamov, Rochelle Choenni, Ivan Titov · 19d
OISD: On-Policy Internal Self-Distillation of Language Models
arXiv cs.LG · Xinyu Liu, Darryl Cherian Jacob, Yang Zhou, Jindong Wang, Pan He · 19d
Model Merging by Output-Space Projection
arXiv cs.LG · Bethan Evans, Benjamin Etheridge, Stephen Roberts, Jared Tanner · 19d
Bridging Chemists and AI: An Expert-Augmented Framework for Interpretable Route Evaluation
arXiv cs.LG · Yujia Guo, Mikhail Kabeshov, Tat Hong Duong Le, Samuel Genheden, Marco V. Mijangos, Varvara Voinarvoska, Giulia Bergonzini, Ola Engkvist, Samuel Kaski · 19d
When and How Long? The Readout-Mediator Angle in Temporal Reasoning
arXiv cs.LG · Shreyas Fadnavis, Praitayini Kanakaraj, Felix Wyss · 19d
Apertus LLM Family Expansion via Distillation and Quantization
arXiv cs.LG · Andrei Panferov, Davit Melikidze, Martin Jaggi, Dan Alistarh · 19d
Unveiling Multi-regime Patterns in SciML: Distinct Failure Modes and Regime-specific Optimization
arXiv cs.LG · Yuxin Wang, Yuanzhe Hu, Xiaokun Zhong, Xiaopeng Wang, Haiquan Lu, Tianyu Pang, Michael W. Mahoney, Yujun Yan, Pu Ren, Yaoqing Yang · 19d
RUBRIC-ARROW: Alternating Pointwise Rubric Reward Modeling for LLM Post-training in Non-verifiable Domains
arXiv cs.LG · Haoxiang Jiang, Zihan Dong, Tianci Liu, Wanying Wang, Ran Xu, Tony Yu, Linjun Zhang, Haoyu Wang · 19d
Parallax: Parameterized Local Linear Attention for Language Modeling
arXiv cs.LG · Yifei Zuo, Dhruv Pai, Zhichen Zeng, Alec Dewulf, Shuming Hu, Zhaoran Wang · 19d
PROTOCOL: Late Interaction Retrieval for Protein Homolog Search
arXiv cs.LG · Gabrielle Cohn, Rohan Gumaste, Minh Hoang, Vihan Lakshman · 19d
Evolutionary Refinement of Generative Graph Topologies: A Hybrid WGAN-GA Approach
arXiv cs.LG · James Sargant, Seyedeh Ava Razi Razavi, Renata Dividino, Sheridan Houghten · 19d
Probabilistic bias adjustment of seasonal forecasts using generative machine learning: A case study of Arctic sea ice predictions
arXiv cs.LG · Parsa Gooya, Reinel Sospedra-Alfonso · 19d
Lightweight Multimodal LLM-Enabled Cost-Effective Defect Grading of Power Transmission Equipment
arXiv cs.CL · Tao Wang, Lipeng Zhu, Jiayong Li, Feng Gao, Siwen Liang · 19d