←── back to feed

/topics/arxiv-cs-lg-papers-may-29-2026

arXiv cs.LG papers May 29 2026

47 items●1 sources●updated 19d ago●trend 0

┌─ summary ─────────────────────────────┐

On May 29, 2026, arXiv's cs.LG category published 20 papers spanning mechanistic interpretability of language models, reinforcement learning, multimodal learning, time-series analysis, diffusion models, and domain-specific applications including drug discovery and power systems. Topics ranged from knowledge editing mechanisms and catastrophic forgetting to world models, LLM trading agents, and compact language models with adaptive reasoning.

┌─ key points ──────────────────────────┐

ROME and MEMIT knowledge editing methods target a common subset of weights; binary mask reverses 80% of edits despite fact-specific changes
RL fine-tuning preserves internal computational circuits better than SFT, with differential circuit vulnerability analysis explaining mechanistic origins of catastrophic forgetting
TradeArena testbed reveals pre-failure signatures in LLM trading agents: planning embeddings drift from normal-state centroids before drawdowns
VAE-based world models trained on random embodied exploration develop spatial semantic structure mirroring physical geometry (6.6x improvement in position RSA)
CosmicFish-HRM compact language model uses Hierarchical Reasoning Module for adaptive inference-time reasoning depth allocation
20 papers published across knowledge editing, RL, multimodal learning, time-series generation, diffusion control, and domain applications (drug discovery, power systems, metagenomic annotation)

┌─ items (47) ──────────────────────────┐

[BLG]blog/rss47

One Mask to Rule Them All: On Hidden Facts after Editing and How to Find Them

arXiv cs.LG · Ali Holmov, Paul Youssef, Nandi Schoots, Christin Seifert · 19d

Representation Signatures and Risk-Feedback Alignment in LLM Trading Agents

arXiv cs.LG · Weicheng Xue · 19d

Mechanistic origins of catastrophic forgetting: why RL preserves circuits better than SFT?

arXiv cs.LG · Jeanmely Rojas Nunez, Viraj Sawant, Nathan Allen, Nomgondalai Amgalanbaatar, Yannis Zongo, Vasu Sharma, Maheep Chaudhary · 19d

Molecular Lead Optimization via Agentic Tool Planning

arXiv cs.LG · Lingxiao Li, Haobo Zhang, Ruohao Fan, Bin Chen, Jiayu Zhou · 19d

Self-Play Reinforcement Learning under Imperfect Information in Big 2

arXiv cs.LG · Aalok Patwa · 19d

Emergent Semantic Representations in World Models through Physical Interaction without Linguistic Supervision

arXiv cs.LG · Jiayi Fang · 19d

Continuity and Ordinality Matter: Constraining Time Series Tokens for Effective Time Series Analysis with Large Language Models

arXiv cs.LG · Musheng Li, Ziying Zhang, Cheng jin, Yuantao Gu · 19d

PrismFlow: Residual Dynamics for Flow Matching in Time-Series Generation

arXiv cs.LG · Junru Zhang, Lang Feng, Jinbo Wang, Xu Guo, Yucheng Wang, Han Yu, Min Wu, Yabo Dong, Duanqing Xu · 19d

TaxDistill: Improving Metagenomic Taxonomic Annotation via Distilled Genomic Foundation Models

arXiv cs.LG · Rongye Ye, Lun Li, Zheng Luo, Yiran Zhan, Shuhui Song · 19d

Balancing Multimodal Learning through Label Space Reshaping

arXiv cs.LG · Xiaoyu Ma, Weijie Zhang, Yuanhao Gao, Han Miao, Yongjian Deng, Hao Chen · 19d

Representation Alignment Rests on Linear Structure

arXiv cs.LG · Kiril Bangachev, Guy Bresler, Yury Polyanskiy · 19d

Pre-Registering the Detectable Effect: A Paired-MDE Budget for 4-bit Quantization Benchmarks, with a Pilot Audit

arXiv cs.LG · Zexin Zhuang, Yanhang Li, Zhichao Fan · 19d

Towards Continuous-time Causal Foundation Models

arXiv cs.LG · Dennis Thumm, Ruben Wiedemann, Ying Chen · 19d

Context Distillation as Latent Memory Management

arXiv cs.LG · Ziyang Zheng, Zeju Li, Xiangyu Wen, Jianyuan Zhong, Junhua Huang, Lei Chen, Mingxuan Yuan, Qiang Xu · 19d

Feature Geometry of LoRA Adapters: A Sparse Autoencoder Analysis of Representational Divergence in Fine-Tuned Language Models

arXiv cs.LG · Prasanth K K · 19d

Spectral Guidance for Flexible and Efficient Control of Diffusion Models

arXiv cs.LG · Gabriel Moreira, Manuel Marques, Jo\~ao Paulo Costeira, Chenyan Xiong · 19d

Sequential Physics-Constrained Neural Operator Forward Modeling for the $\textit{Norne}$ Reservoir System

arXiv cs.LG · Clement Etienam, Juntao Yang, Oleg Ovcharenko, Nick Luiken, Tsubasa Onishi, Nefeli Moridis, Issam Said · 19d

Cycle-Space Informed Detection of Autoencoded Blind False Data Injection Attacks on Power Systems

arXiv cs.LG · Xin Li, Chenhan Xiao, Jonathan Cohen, Aviad Elyashar, Yang Weng, Rami Puzis · 19d

When LLM Reward Design Fails: Diagnostic-Driven Refinement for Sparse Structured RL

arXiv cs.LG · Youting Wang, Yuan Tang, Bowen Liu, Xuan Liu, Dingyan Shang · 19d

CosmicFish-HRM: Adaptive Reasoning via Hierarchical Recurrent Mechanisms in Compact Language Models

arXiv cs.LG · Venkat Akhil Lakkapragada · 19d

A Training-Time Diagnostic for Generalization via the Log-Alignment Ratio

arXiv cs.LG · Ali Shehper, Ashish Vaswani · 19d

Comparing Post-Hoc Explainable AI Methods for Interpreting Black-Box EEG Models in Depression Detection

arXiv cs.LG · Antonia \v{S}ar\v{c}evi\'c, Nikolina Frid · 19d

The Hamilton-Jacobi Theory of Deep Learning

arXiv cs.LG · Jose Marie Antonio Mi\~noza, Erika Fille T. Legara, Christopher P. Monterola · 19d

Learning Robust and Task-Invariant Functional Representation from fMRI through Siamese Self-Supervised Learning

arXiv cs.LG · Jiyao Wang, Peiyu Duan, Nicha C. Dvornek, Lawrence H. Staib, Denis Sukhodolsky, Pamela Ventola, James S. Duncan · 19d

FormInv: A Measurement Protocol for Semantic Invariance in Mathematical Reasoning Benchmarks

arXiv cs.LG · Nishal Thomas, Noel Thomas · 19d

FedQHD: Closed-Form Function-Space Federated Reinforcement Learning

arXiv cs.LG · Yuchen Hou, Yongshan Chen, Zhuowen Zou, Calvin Yeung, Mohsen Imani, Tian Lan, Mahdi Imani · 19d

LoRe: Adaptive Interaction-Evaluation Routing with Per-Step Interaction Budgets for Iterative Graph Solvers

arXiv cs.LG · Jintao Li, Yong-Yi Wang, Zheng-An Wang, Heng Fan · 19d

Causal Intelligence for Constraint-Aware Intervention Design to Induce State Transitions

arXiv cs.LG · Zixuan Song, Uwe Mueller, Dimitris V. Manatakis · 19d

Label-Free Reinforcement Learning via Cross-Model Entropy

arXiv cs.LG · Matt Gorbett, Hossein Shirazi · 19d

Designing Active Tether-Net Systems for Space Debris Capture with Graph-Learning-Aided Mixed-Combinatorial Optimization

arXiv cs.LG · Feng Liu, Achira Boonrath, Gishnu Madhu, Eleonora M. Botta, Souma Chowdhury · 19d

Return-to-Go Is More Than a Number: Q-Guided Alignment for Return-Conditioned Supervised Learning

arXiv cs.LG · Yuxiao Yang, Weitong Zhang · 19d

Moment Matching Q-Learning

arXiv cs.LG · Yiyan (Edgar), Liang, Sifei Liu, Weitong Zhang · 19d

Parallel Adaptive Multi-Objective Evolutionary Learning of Discretized Bayesian Network Classifiers for Clinical Data

arXiv cs.LG · Damy M. F. Ha, Tanja Alderliesten, Peter A. N. Bosman · 19d

Ensemble Score Filtering for Real-Data Energy Consumption Forecast Correction

arXiv cs.LG · Ruoyu Hu, Dahai Yu, Feng Bao, Guang Wang, Guannan Zhang · 19d

Knowledge Offloading: Decomposing LLMs into Sparse Backbones and Memory Modules

arXiv cs.LG · Karim Galliamov, Rochelle Choenni, Ivan Titov · 19d

OISD: On-Policy Internal Self-Distillation of Language Models

arXiv cs.LG · Xinyu Liu, Darryl Cherian Jacob, Yang Zhou, Jindong Wang, Pan He · 19d

Model Merging by Output-Space Projection

arXiv cs.LG · Bethan Evans, Benjamin Etheridge, Stephen Roberts, Jared Tanner · 19d

Bridging Chemists and AI: An Expert-Augmented Framework for Interpretable Route Evaluation

arXiv cs.LG · Yujia Guo, Mikhail Kabeshov, Tat Hong Duong Le, Samuel Genheden, Marco V. Mijangos, Varvara Voinarvoska, Giulia Bergonzini, Ola Engkvist, Samuel Kaski · 19d

When and How Long? The Readout-Mediator Angle in Temporal Reasoning

arXiv cs.LG · Shreyas Fadnavis, Praitayini Kanakaraj, Felix Wyss · 19d

Apertus LLM Family Expansion via Distillation and Quantization

arXiv cs.LG · Andrei Panferov, Davit Melikidze, Martin Jaggi, Dan Alistarh · 19d

Unveiling Multi-regime Patterns in SciML: Distinct Failure Modes and Regime-specific Optimization

arXiv cs.LG · Yuxin Wang, Yuanzhe Hu, Xiaokun Zhong, Xiaopeng Wang, Haiquan Lu, Tianyu Pang, Michael W. Mahoney, Yujun Yan, Pu Ren, Yaoqing Yang · 19d

RUBRIC-ARROW: Alternating Pointwise Rubric Reward Modeling for LLM Post-training in Non-verifiable Domains

arXiv cs.LG · Haoxiang Jiang, Zihan Dong, Tianci Liu, Wanying Wang, Ran Xu, Tony Yu, Linjun Zhang, Haoyu Wang · 19d

Parallax: Parameterized Local Linear Attention for Language Modeling

arXiv cs.LG · Yifei Zuo, Dhruv Pai, Zhichen Zeng, Alec Dewulf, Shuming Hu, Zhaoran Wang · 19d

PROTOCOL: Late Interaction Retrieval for Protein Homolog Search

arXiv cs.LG · Gabrielle Cohn, Rohan Gumaste, Minh Hoang, Vihan Lakshman · 19d

Evolutionary Refinement of Generative Graph Topologies: A Hybrid WGAN-GA Approach

arXiv cs.LG · James Sargant, Seyedeh Ava Razi Razavi, Renata Dividino, Sheridan Houghten · 19d

Probabilistic bias adjustment of seasonal forecasts using generative machine learning: A case study of Arctic sea ice predictions

arXiv cs.LG · Parsa Gooya, Reinel Sospedra-Alfonso · 19d

Lightweight Multimodal LLM-Enabled Cost-Effective Defect Grading of Power Transmission Equipment

arXiv cs.CL · Tao Wang, Lipeng Zhu, Jiayong Li, Feng Gao, Siwen Liang · 19d