AdamW-McGinley Dynamic Optimizer

An implementation of an AdamW variant using McGinley Dynamic-inspired adaptive momentum smoothing. This optimizer dynamically adapts the beta smoothing coefficients based on gradient change rates, potentially offering better performance on non-stationary or noisy training regimes.

Concept

The McGinley Dynamic indicator from technical analysis proposes adaptive smoothing based on the rate of change of the signal. This project applies that concept to the AdamW optimizer's beta coefficients, dynamically adjusting them based on gradient changes:

Instead of fixed β₁ and β₂, we use dynamic β₁ₜ and β₂ₜ computed per step
β₁ₜ adapts based on gradient changes: β₁ₜ = β₁ / (1 + |grad_change|²)
β₂ₜ adapts based on squared gradient changes: β₂ₜ = β₂ / (1 + |grad_sq_change|²)
Minimum thresholds for beta values ensure stability

Features

CustomAdamW optimizer with dynamic smoothing capability
Optional global gradient norm scaling
Beta statistics logging integration with Weights & Biases
Comprehensive sweep infrastructure for comparative analysis

Installation

To install the required dependencies, run:

pip install -r requirements.txt

Or set up a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate
pip install -r requirements.txt

For exact versions used in our environment (Torch 2.7.x), use the pinned set:

pip install -r requirements-pinned.txt

Usage

Basic Usage

from custom_optimizer import CustomAdamW

# Create optimizer with dynamic smoothing
optimizer = CustomAdamW(
    model.parameters(),
    lr=0.001,
    weight_decay=0.01,
    dynamic_smoothing=True,  # Enable dynamic betas
    min_beta1=0.5,          # Minimum value for beta1
    min_beta2=0.9,          # Minimum value for beta2
    log_betas=True          # Track beta statistics
)

Running Experiments

Before running experiments, make sure to set up your Weights & Biases API key:

export WANDB_API_KEY=your_api_key_here

To run experiments comparing CustomAdamW to standard AdamW:

python sweep_train.py

This launches a Weights & Biases sweep that compares both optimizers across different configurations.

Testing

Install pytest and run the unit tests:

pip install pytest
PYTHONPATH=. pytest -q

Results

Sample Comparison Test

Full experiment results for this sweep on Weights & Biases are available; contact me for access.

Project Structure

custom_optimizer.py: Implementation of CustomAdamW optimizer
sweep_train.py: Main training and sweep script for CIFAR-10 experiments
adamw-mcginley-dynamic-optimizer.md: Detailed explanation of the algorithm

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
tests		tests
.gitignore		.gitignore
AGENTS.md		AGENTS.md
LICENSE		LICENSE
README.md		README.md
custom_optimizer.py		custom_optimizer.py
requirements-pinned.txt		requirements-pinned.txt
requirements.txt		requirements.txt
sample-comparison.png		sample-comparison.png
sweep_train.py		sweep_train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AdamW-McGinley Dynamic Optimizer

Concept

Features

Installation

Usage

Basic Usage

Running Experiments

Testing

Results

Sample Comparison Test

Project Structure

License

About

Uh oh!

Releases

Packages

Languages

AdrianScott/adamwtest

Folders and files

Latest commit

History

Repository files navigation

AdamW-McGinley Dynamic Optimizer

Concept

Features

Installation

Usage

Basic Usage

Running Experiments

Testing

Results

Sample Comparison Test

Project Structure

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages