←── back to feed

/topics/arxiv-stat-ml-papers-may-29-2026

arXiv stat.ML papers May 29 2026

49 items●1 sources●updated 19d ago●trend 0

┌─ summary ─────────────────────────────┐

On May 29, 2026, arXiv's stat.ML category published 20 papers spanning theoretical advances in federated learning, causal inference, diffusion models, conformal prediction, and optimization. Topics include sparse momentum dynamics, anytime-valid federated inference, heterogeneous treatment effects, privacy-aware online learning, and uncertainty quantification for generative models.

┌─ key points ──────────────────────────┐

Federated learning papers address bandwidth constraints: FC-RAG extends to anytime-valid coverage; FPLD achieves KL rate O(·) under bit budgets; optimal allocation studied across K nodes with B bits per query
Causal inference advances: prediction-powered inference scales across many tasks with few labels per hypothesis; heterogeneous treatment effects estimated via matrix completion; optimal individualized treatment rules for bivariate survival outcomes
Theoretical foundations: diffusion models proven statistically optimal for low-dimensional multi-modal distributions; Wasserstein contraction established for coordinate ascent variational inference; saddle networks preserve convex-concave geometry
Conformal prediction extended: Conf-Gen adapts conformal risk control to generative models; leave-window-out jackknife modifies conformal methods for time series with temporal dependence
Optimization and learning: instance-dependent Lipschitz bandit improves regret via asymptotic level sets; joint model-data sparsification via marginal likelihood; policy-aware simulator learning formulated as minimax game

┌─ items (49) ──────────────────────────┐

[BLG]blog/rss49

Dynamics of Stochastic Momentum with Sparse Updates in High Dimensions

arXiv stat.ML · Katie Everett, Elliot Paquette · 19d

Anytime-Valid Federated Conformal RAG for LLM Swarms

arXiv stat.ML · Prasanjit Dubey, Xiaoming Huo · 19d

Prediction-Powered Inference Across Many Tasks for AI Evaluation & Social Science Research

arXiv stat.ML · Nicolas Emmenegger, Ellery Stahler, Chara Podimata · 19d

Deep Optimal Individualized Treatment Rules for Bivariate Survival Outcomes via Adaptive Prediction-Powered Learning

arXiv stat.ML · Kun Ren, Yifan Cui, Wen Su · 19d

Matching Rates and Optimal Allocation for Federated Probe-Logit Distillation under Heterogeneous Bandwidth Budgets

arXiv stat.ML · Prasanjit Dubey, Xiaoming Huo · 19d

Eigen-Spike Emergence and Quadratic Equivalents for Conjugate Kernels on Nonlinearly Separable Data

arXiv stat.ML · Collin Cranston, Zhichao Wang, Todd Kemp, Michael W. Mahoney · 19d

Instance-dependent Stochastic Lipschitz bandit

arXiv stat.ML · Marius Potfer, Vianney Perchet · 19d

Joint Model and Data Sparsification via the Marginal Likelihood

arXiv stat.ML · Alexander Timans, Thomas M\"ollenhoff, Christian A. Naesseth, Mohammad Emtiyaz Khan, Eric Nalisnick · 19d

Diffusion Models Are Statistically Optimal for Learning Low-Dimensional Multi-Modal Distributions

arXiv stat.ML · Jingda Wu, Changxiao Cai · 19d

Visual Spatial Learning: Single-Field Spatial Interpolation Using Convolutional Neural Networks

arXiv stat.ML · Daniel Tinoco, Raquel Menezes, Carlos Baquero, Alexandra Silva · 19d

Wasserstein Contraction of Coordinate Ascent Variational Inference

arXiv stat.ML · Rocco Caprio, Adrien Corenflos, Sam Power · 19d

Leave a Window Out: Modifying the Jackknife for Predictive Inference in Time Series

arXiv stat.ML · Hanyang Jiang, Rina Foygel Barber, Ashwin Pananjady, Yao Xie · 19d

Improved Guarantees for Heterogeneous Treatment-Effect Estimation via Matrix Completion

arXiv stat.ML · Anay Mehrotra, Phuc Tran, Van H. Vu, Manolis Zampetakis · 19d

Saddle Networks: Structure-Preserving Architectures for Convex-Concave Functions

arXiv stat.ML · Xavier Warin · 19d

Conf-Gen: Conformal Uncertainty Quantification for Generative Models

arXiv stat.ML · Gabriel Loaiza-Ganem, Kevin Zhang, Wei Cui, Marc T. Law, Kin Kwan Leung · 19d

Theoretical Foundations and Effective Algorithms for Policy-Aware Simulator Learning

arXiv stat.ML · Christoph Dann, Yishay Mansour, Mehryar Mohri · 19d

Optimal Gap-Dependent Regret for Private Stochastic Decision-Theoretic Online Learning

arXiv stat.ML · Tommaso Cesari, Roberto Colomboni · 19d

Do Deep Networks Forget Initialization? A Forgetting-Time View of Practical Inductive Bias

arXiv stat.ML · Mohua Das, Pierfrancesco Beneventano, Shibshankar Dey, Gareth H. McKinkey, Tomaso Poggio · 19d

Bayesian Multiplicity Correction in the Probabilistic Forward Stepwise Framework

arXiv stat.ML · Andrew Womack, Daniel Taylor-Rodriguez · 19d

Causal Label Recovery in Payment Networks

arXiv stat.ML · Gaurav Dhama · 19d

Attention as In-Context Empirical Bayes: A Two-Stage View via Particle Dynamics

arXiv stat.ML · Matthew Smart, Soumya Ganguly, Nilava Metya, Alexandre V. Morozov, Anirvan M. Sengupta · 19d

Kernel-based potential mean-field games with unbiased random Fourier $U$-statistics

arXiv stat.ML · Yumiharu Nakano · 19d

On the Optimizer Dependence of Neural Scaling Laws

arXiv stat.ML · Vansh Ramani, Shourya Vir Jain · 19d

Low Rank for Rank: Uncertainty-Aware Task-Specific LLM Ranking under Sparse Pairwise Comparisons

arXiv stat.ML · Jiachun Li, David Simchi-Levi, Will Wei Sun · 19d

The Good, the Bad, and the Ugly of Markov Boundary for Tabular Prediction

arXiv stat.ML · Shu Wan, Abhinav Gorantla, Huan Liu, K. Sel\c{c}uk Candan · 19d

Constructing efficient channels for ideal observers using the conjugate gradient method

arXiv stat.ML · Weimin Zhou · 19d

On the Construction and Implications of Low-Loss Valleys in LoRA-based Bayesian Inference

arXiv stat.ML · Daniel Dold, Emanuel Sommer, Julius Kobialka, Oliver D\"urr, David R\"ugamer · 19d

The Sample Complexity of Multiclass and Sparse Contextual Bandits

arXiv stat.ML · Liad Erez, Fan Chen, Alon Cohen, Tomer Koren, Yishay Mansour, Shay Moran, Alexander Rakhlin · 19d

Kernel Renormalization in Bayesian Deep Neural Networks: the Equivalent Wishart Ansatz in the Proportional Regime

arXiv stat.ML · Paolo Baglioni, Christian Keup, Vincenzo Zimbardo, Rosalba Pacelli, Alessandro Vezzani, Raffaella Burioni, Pietro Rotondo · 19d

CB-SLICE: Concept-Based Interpretable Error Slice Discovery

arXiv stat.ML · Yael Konforti, Mateo Espinosa Zarlenga, Elaf Almahmoud, Mateja Jamnik · 19d

The Topological Stability Index: A Variance-Based Measure for Persistence Barcodes

arXiv stat.ML · Joris Kirchner, Ioannis Diamantis · 19d

Open Problem: Separating Geometric and Algorithmic Compression via Cayley-Table Completion

arXiv stat.ML · Dongsung Huh · 19d

Ridge Regression from Poisson Resetting: A Renewal Perspective on Spectral Regularization

arXiv stat.ML · Petar Jolakoski · 19d

Conformal Certification of Reasoning Trace Prefixes

arXiv stat.ML · Matt Y. Cheung, Ashok Veeraraghavan, Hanjie Chen, Guha Balakrishnan · 19d

Learning to Extrapolate to New Tasks: A Relational Approach to Task Extrapolation

arXiv stat.ML · Adam Ousherovitch, Yixin Wang · 19d

A new completely parameter-free clustering algorithm for unsupervised classification of BATSE gamma-ray bursts

arXiv stat.ML · Soumita Modak · 19d

CalArena: A Large-Scale Post-Hoc Calibration Benchmark

arXiv stat.ML · Eug\`ene Berta, David Holzm\"uller, Francis Bach, Michael I. Jordan · 19d

Statistical Embeddings for Similarity, Retrieval, and Interpretable Alignment of Numeric Tabular Datasets

arXiv stat.ML · M. Ross Kunz, John Merickel, Keith Wilson · 19d

On Language Generation in the Limit with Bounded Memory

arXiv stat.ML · Jon Kleinberg, Anay Mehrotra, Amin Saberi, Grigoris Velegkas · 19d

Reasoning with Sampling: Cutting at Decision Points

arXiv stat.ML · Felix Zhou, Anay Mehrotra, Quanquan C. Liu · 19d

Noise-Aware Differentially Private Variational Inference

arXiv stat.ML · Talal Alrawajfeh, Joonas J\"alk\"o, Antti Honkela · 19d

From Sublinear to Linear: Local Convergence in Finite-Width Networks via Locally Polyak-Lojasiewicz Regions

arXiv stat.ML · Agnideep Aich, Ashit Baran Aich, Bruce Wade · 19d

Risk-averse Fair Multi-class Classification

arXiv stat.ML · Darinka Dentcheva, Xiangyu Tian · 19d

SADA: Safe and Adaptive Aggregation of Multiple Black-Box Predictions in Semi-Supervised Learning

arXiv stat.ML · Jiawei Shan, Zhifeng Chen, Yiming Dong, Yazhen Wang, Jiwei Zhao · 19d

Permutation-Invariant Spectral Learning via Dyson Diffusion

arXiv stat.ML · Tassilo Schwarz, Cai Dieball, Constantin Kogler, Renaud Lambiotte, Arnaud Doucet, Alja\v{z} Godec, George Deligiannidis · 19d

Calibrating Generative Models to Distributional Constraints

arXiv stat.ML · Henry D. Smith, Nathaniel L. Diamant, Brian L. Trippe · 19d

Follow-the-Perturbed-Leader for Decoupled Bandits: Best-of-Both-Worlds and Practicality

arXiv stat.ML · Chaiwon Kim, Jongyeong Lee, Min-hwan Oh · 19d

Diffusion differentiable resampling

arXiv stat.ML · Jennifer Rosina Andersson, Zheng Zhao · 19d

Aggregate Models, Not Explanations: Improving Feature Importance Estimation

arXiv stat.ML · Joseph Paillard, Angel Reyero Lobo, Denis A. Engemann, Bertrand Thirion · 19d