←── back to feed
/topics/arxiv-stat-ml-papers-may-29-2026
arXiv stat.ML papers May 29 2026
49 items●1 sources●updated 19d ago●trend 0
On May 29, 2026, arXiv's stat.ML category published 20 papers spanning theoretical advances in federated learning, causal inference, diffusion models, conformal prediction, and optimization. Topics include sparse momentum dynamics, anytime-valid federated inference, heterogeneous treatment effects, privacy-aware online learning, and uncertainty quantification for generative models.
- Federated learning papers address bandwidth constraints: FC-RAG extends to anytime-valid coverage; FPLD achieves KL rate O(·) under bit budgets; optimal allocation studied across K nodes with B bits per query
- Causal inference advances: prediction-powered inference scales across many tasks with few labels per hypothesis; heterogeneous treatment effects estimated via matrix completion; optimal individualized treatment rules for bivariate survival outcomes
- Theoretical foundations: diffusion models proven statistically optimal for low-dimensional multi-modal distributions; Wasserstein contraction established for coordinate ascent variational inference; saddle networks preserve convex-concave geometry
- Conformal prediction extended: Conf-Gen adapts conformal risk control to generative models; leave-window-out jackknife modifies conformal methods for time series with temporal dependence
- Optimization and learning: instance-dependent Lipschitz bandit improves regret via asymptotic level sets; joint model-data sparsification via marginal likelihood; policy-aware simulator learning formulated as minimax game
[BLG]blog/rss49
Dynamics of Stochastic Momentum with Sparse Updates in High Dimensions
Anytime-Valid Federated Conformal RAG for LLM Swarms
Prediction-Powered Inference Across Many Tasks for AI Evaluation & Social Science Research
Deep Optimal Individualized Treatment Rules for Bivariate Survival Outcomes via Adaptive Prediction-Powered Learning
Matching Rates and Optimal Allocation for Federated Probe-Logit Distillation under Heterogeneous Bandwidth Budgets
Eigen-Spike Emergence and Quadratic Equivalents for Conjugate Kernels on Nonlinearly Separable Data
Instance-dependent Stochastic Lipschitz bandit
Joint Model and Data Sparsification via the Marginal Likelihood
Diffusion Models Are Statistically Optimal for Learning Low-Dimensional Multi-Modal Distributions
Visual Spatial Learning: Single-Field Spatial Interpolation Using Convolutional Neural Networks
Wasserstein Contraction of Coordinate Ascent Variational Inference
Leave a Window Out: Modifying the Jackknife for Predictive Inference in Time Series
Improved Guarantees for Heterogeneous Treatment-Effect Estimation via Matrix Completion
Saddle Networks: Structure-Preserving Architectures for Convex-Concave Functions
Conf-Gen: Conformal Uncertainty Quantification for Generative Models
Theoretical Foundations and Effective Algorithms for Policy-Aware Simulator Learning
Optimal Gap-Dependent Regret for Private Stochastic Decision-Theoretic Online Learning
Do Deep Networks Forget Initialization? A Forgetting-Time View of Practical Inductive Bias
Bayesian Multiplicity Correction in the Probabilistic Forward Stepwise Framework
Causal Label Recovery in Payment Networks
Attention as In-Context Empirical Bayes: A Two-Stage View via Particle Dynamics
Kernel-based potential mean-field games with unbiased random Fourier $U$-statistics
On the Optimizer Dependence of Neural Scaling Laws
Low Rank for Rank: Uncertainty-Aware Task-Specific LLM Ranking under Sparse Pairwise Comparisons
The Good, the Bad, and the Ugly of Markov Boundary for Tabular Prediction
Constructing efficient channels for ideal observers using the conjugate gradient method
On the Construction and Implications of Low-Loss Valleys in LoRA-based Bayesian Inference
The Sample Complexity of Multiclass and Sparse Contextual Bandits
Kernel Renormalization in Bayesian Deep Neural Networks: the Equivalent Wishart Ansatz in the Proportional Regime
CB-SLICE: Concept-Based Interpretable Error Slice Discovery
The Topological Stability Index: A Variance-Based Measure for Persistence Barcodes
Open Problem: Separating Geometric and Algorithmic Compression via Cayley-Table Completion
Ridge Regression from Poisson Resetting: A Renewal Perspective on Spectral Regularization
Conformal Certification of Reasoning Trace Prefixes
Learning to Extrapolate to New Tasks: A Relational Approach to Task Extrapolation
A new completely parameter-free clustering algorithm for unsupervised classification of BATSE gamma-ray bursts
CalArena: A Large-Scale Post-Hoc Calibration Benchmark
Statistical Embeddings for Similarity, Retrieval, and Interpretable Alignment of Numeric Tabular Datasets
On Language Generation in the Limit with Bounded Memory
Reasoning with Sampling: Cutting at Decision Points
Noise-Aware Differentially Private Variational Inference
From Sublinear to Linear: Local Convergence in Finite-Width Networks via Locally Polyak-Lojasiewicz Regions
Risk-averse Fair Multi-class Classification
SADA: Safe and Adaptive Aggregation of Multiple Black-Box Predictions in Semi-Supervised Learning
Permutation-Invariant Spectral Learning via Dyson Diffusion
Calibrating Generative Models to Distributional Constraints
Follow-the-Perturbed-Leader for Decoupled Bandits: Best-of-Both-Worlds and Practicality
Diffusion differentiable resampling
Aggregate Models, Not Explanations: Improving Feature Importance Estimation