←── back to feed
/topics/arxiv-cs-lg-papers-may-29-2026
arXiv cs.LG papers May 29 2026
47 items●1 sources●updated 19d ago●trend 0
On May 29, 2026, arXiv's cs.LG category published 20 papers spanning mechanistic interpretability of language models, reinforcement learning, multimodal learning, time-series analysis, diffusion models, and domain-specific applications including drug discovery and power systems. Topics ranged from knowledge editing mechanisms and catastrophic forgetting to world models, LLM trading agents, and compact language models with adaptive reasoning.
- ROME and MEMIT knowledge editing methods target a common subset of weights; binary mask reverses 80% of edits despite fact-specific changes
- RL fine-tuning preserves internal computational circuits better than SFT, with differential circuit vulnerability analysis explaining mechanistic origins of catastrophic forgetting
- TradeArena testbed reveals pre-failure signatures in LLM trading agents: planning embeddings drift from normal-state centroids before drawdowns
- VAE-based world models trained on random embodied exploration develop spatial semantic structure mirroring physical geometry (6.6x improvement in position RSA)
- CosmicFish-HRM compact language model uses Hierarchical Reasoning Module for adaptive inference-time reasoning depth allocation
- 20 papers published across knowledge editing, RL, multimodal learning, time-series generation, diffusion control, and domain applications (drug discovery, power systems, metagenomic annotation)
[BLG]blog/rss47
One Mask to Rule Them All: On Hidden Facts after Editing and How to Find Them
Representation Signatures and Risk-Feedback Alignment in LLM Trading Agents
Mechanistic origins of catastrophic forgetting: why RL preserves circuits better than SFT?
Molecular Lead Optimization via Agentic Tool Planning
Self-Play Reinforcement Learning under Imperfect Information in Big 2
Emergent Semantic Representations in World Models through Physical Interaction without Linguistic Supervision
Continuity and Ordinality Matter: Constraining Time Series Tokens for Effective Time Series Analysis with Large Language Models
PrismFlow: Residual Dynamics for Flow Matching in Time-Series Generation
TaxDistill: Improving Metagenomic Taxonomic Annotation via Distilled Genomic Foundation Models
Balancing Multimodal Learning through Label Space Reshaping
Representation Alignment Rests on Linear Structure
Pre-Registering the Detectable Effect: A Paired-MDE Budget for 4-bit Quantization Benchmarks, with a Pilot Audit
Towards Continuous-time Causal Foundation Models
Context Distillation as Latent Memory Management
Feature Geometry of LoRA Adapters: A Sparse Autoencoder Analysis of Representational Divergence in Fine-Tuned Language Models
Spectral Guidance for Flexible and Efficient Control of Diffusion Models
Sequential Physics-Constrained Neural Operator Forward Modeling for the $\textit{Norne}$ Reservoir System
Cycle-Space Informed Detection of Autoencoded Blind False Data Injection Attacks on Power Systems
When LLM Reward Design Fails: Diagnostic-Driven Refinement for Sparse Structured RL
CosmicFish-HRM: Adaptive Reasoning via Hierarchical Recurrent Mechanisms in Compact Language Models
A Training-Time Diagnostic for Generalization via the Log-Alignment Ratio
Comparing Post-Hoc Explainable AI Methods for Interpreting Black-Box EEG Models in Depression Detection
The Hamilton-Jacobi Theory of Deep Learning
Learning Robust and Task-Invariant Functional Representation from fMRI through Siamese Self-Supervised Learning
FormInv: A Measurement Protocol for Semantic Invariance in Mathematical Reasoning Benchmarks
FedQHD: Closed-Form Function-Space Federated Reinforcement Learning
LoRe: Adaptive Interaction-Evaluation Routing with Per-Step Interaction Budgets for Iterative Graph Solvers
Causal Intelligence for Constraint-Aware Intervention Design to Induce State Transitions
Label-Free Reinforcement Learning via Cross-Model Entropy
Designing Active Tether-Net Systems for Space Debris Capture with Graph-Learning-Aided Mixed-Combinatorial Optimization
Return-to-Go Is More Than a Number: Q-Guided Alignment for Return-Conditioned Supervised Learning
Moment Matching Q-Learning
Parallel Adaptive Multi-Objective Evolutionary Learning of Discretized Bayesian Network Classifiers for Clinical Data
Ensemble Score Filtering for Real-Data Energy Consumption Forecast Correction
Knowledge Offloading: Decomposing LLMs into Sparse Backbones and Memory Modules
OISD: On-Policy Internal Self-Distillation of Language Models
Model Merging by Output-Space Projection
Bridging Chemists and AI: An Expert-Augmented Framework for Interpretable Route Evaluation
When and How Long? The Readout-Mediator Angle in Temporal Reasoning
Apertus LLM Family Expansion via Distillation and Quantization
Unveiling Multi-regime Patterns in SciML: Distinct Failure Modes and Regime-specific Optimization
RUBRIC-ARROW: Alternating Pointwise Rubric Reward Modeling for LLM Post-training in Non-verifiable Domains
Parallax: Parameterized Local Linear Attention for Language Modeling
PROTOCOL: Late Interaction Retrieval for Protein Homolog Search
Evolutionary Refinement of Generative Graph Topologies: A Hybrid WGAN-GA Approach
Probabilistic bias adjustment of seasonal forecasts using generative machine learning: A case study of Arctic sea ice predictions
Lightweight Multimodal LLM-Enabled Cost-Effective Defect Grading of Power Transmission Equipment