ICLR 2025 ML papers

less than 1 minute read

Published: February 18, 2025

I am curating (mainly) ICLR ‘25 submitted papers related to hyperparameter tuning of large-scale training.

Title	Summary
Scaling Optimal LR Across Token Horizons	${\rm LR} \propto N^{-0.23}T^{-0.32}$ (fixed batch size)
How Does Critical Batch Size Scale in Pre-training?	${\rm crit. BS} \propto T$ (fixed LR)
Time Transfer: On Optimal Learning Rate and Batch Size In The Infinite Data Limit	Relations of BS, LR, and $T$ are complicated
How to set AdamW’s weight decay as you scale model and dataset size	the “timescale” 1/(LR * WD) should be constant

Privacy papers in ICML 2021

2 minute read

Published: June 06, 2021

I have curated and am beginning to read ICML ‘21 papers related to privacy and federated learning. The list will be constantly updated with the paper summaries. Stay tuned!
Note that I wrote a simple script to scrape the links to the paper and the links may not be accurate.

Seng Pei Liew

ICLR 2025 ML papers

You May Also Enjoy

ICLR 2023 privacy/FL papers

ICML 2022 privacy papers

Awesome list of DP resources

Privacy papers in ICML 2021