Implement lbfgs and bfgs by abhijeetgangan · Pull Request #365 · TorchSim/torch-sim

abhijeetgangan · 2025-11-26T17:00:14Z

Summary

Add L-BFGS and BFGS optimization schemes.

Checklist

Before a pull request can be merged, the following items must be checked:

Doc strings have been added in the Google docstring format.
Run ruff on your code.
Tests have been added for any new functionality or bug fixes.

abhijeetgangan · 2025-11-26T17:14:03Z

See detailed comment here.

orionarcher

Going to leave the physics review to someone else, just one API comment

curtischong

Just learned the algorithms today so I may have missed things. Great work, especially because the ase implementation isn't very documented.

curtischong · 2025-11-28T02:23:56Z

+    sys_max = torch.zeros(state.n_systems, device=device, dtype=dtype)
+    sys_max.scatter_reduce_(0, state.system_idx, norms, reduce="amax", include_self=False)
+
+    # Scaling factors per system: <= 1.0


consider saying: "scale down step so atoms move at most max_step" or "# Scale step if it exceeds max_step" like you mention in bfgs

abhijeetgangan · 2025-12-03T14:53:05Z

Thanks for the comments. I will update those over the weekend.

orionarcher · 2026-01-16T16:15:33Z

@abhijeetgangan is this still on the docket for you?

I know it's a much requested feature so it'd be awesome to get it in. Thanks for your time on it.

abhijeetgangan · 2026-01-20T05:53:41Z

Will prioritize this on the weekend.

abhijeetgangan · 2026-02-04T15:22:37Z

Hi @sihoonchoi and @Andrew-S-Rosen the integration is pretty much done. I also tested on 1K structures on WBM with mace comparing FIRE and L-BFGS (with cell filters):

For fmax of 0.02 eV/A

Optimizer    Time (s)     Converged       Max Force (eV/Å)
FIRE         288.55       1000/1000          0.0200         
L-BFGS       101.91       1000/1000          0.0196

For fmax of 1e-4 eV/A

Optimizer    Time (s)     Converged       Max Force (eV/Å)
FIRE         744.09       995/1000          0.0145         
L-BFGS       192.27       1000/1000          0.0001

Clearly, here it seem that FIRE can struggle sometimes to relax especially for small fmax. It did not finish all even after 500 steps. BFGS is working but need linesearch to the competitive. That's going to be a separate PR. I am going to merge this as the tests are very comprehensive but would be nice if someone can try it out.

import time

import torch
from mace.calculators.foundations_models import mace_mp
from matbench_discovery.data import DataFiles, ase_atoms_from_zip

import torch_sim as ts
from torch_sim.models.mace import MaceModel


N_STRUCTURES = 1000
MAX_STEPS = 500
FORCE_TOL = 1e-4

device = torch.device("cuda" if torch.cuda.is_available() else "cpu")
dtype = torch.float64
raw_mace = mace_mp(model="small", return_raw_model=True, default_dtype="float64")
mace_model = MaceModel(model=raw_mace, device=device, dtype=dtype, compute_stress=True)

ase_atoms_list = ase_atoms_from_zip(DataFiles.wbm_initial_atoms.path, limit=N_STRUCTURES)
initial_state = ts.io.atoms_to_state(ase_atoms_list, device=device, dtype=dtype)

convergence_fn = ts.generate_force_convergence_fn(
    force_tol=FORCE_TOL, include_cell_forces=True
)

optimizers = [
    ("FIRE", ts.Optimizer.fire, {}),
    ("L-BFGS", ts.Optimizer.lbfgs, {"max_history": 20, "step_size": 0.5}),
]

results = {}

for name, optimizer, init_kwargs in optimizers:
    print(f"Running {name}...")

    state = initial_state.clone()

    autobatcher = ts.autobatching.InFlightAutoBatcher(
        model=mace_model,
        memory_scales_with="n_atoms",
        max_memory_scaler=600.0,
        max_iterations=MAX_STEPS // 2,
    )

    if device.type == "cuda":
        torch.cuda.synchronize()
    start_time = time.perf_counter()

    final_state = ts.optimize(
        system=state,
        model=mace_model,
        optimizer=optimizer,
        max_steps=MAX_STEPS,
        convergence_fn=convergence_fn,
        init_kwargs=dict(cell_filter=ts.CellFilter.frechet, **init_kwargs),
        autobatcher=autobatcher,
    )

    if device.type == "cuda":
        torch.cuda.synchronize()
    elapsed = time.perf_counter() - start_time

    max_forces = torch.norm(final_state.forces, dim=-1).max().item()
    converged = convergence_fn(final_state, None)
    n_converged = int(converged.sum().item())

    results[name] = {
        "time": elapsed,
        "max_force": max_forces,
        "n_converged": n_converged,
    }

    n_total = len(ase_atoms_list)
    print(
        f"Time: {elapsed:.2f}s | Converged: {n_converged}/{n_total} | "
        f"Max force: {max_forces:.4f} eV/Å"
    )

print("SUMMARY")
print(f"{'Optimizer':<12} {'Time (s)':<12} {'Converged':<15} {'Max Force (eV/Å)':<15}")

for name, data in results.items():
    print(
        f"{name:<12} {data['time']:<12.2f} "
        f"{data['n_converged']}/{len(ase_atoms_list):<13} {data['max_force']:<15.4f}"
    )

sihoonchoi · 2026-02-05T23:47:43Z

@abhijeetgangan I've tried L-BFGS and BFGS along with FIRE with UMA-OMC on 100 structures and it works well. This is great - thanks for your work!

Optimizer    Time (s)     Converged    
FIRE         838.81       100/100          
L-BFGS       530.74       100/100          
BFGS         545.62       100/100

Below is a comparison with serial relaxations in ASE.

Optimizer    Time (s)     Converged    
FIRE         1670.04     
L-BFGS       1001.56     
BFGS         1133.86

abhijeetgangan · 2026-02-06T00:55:49Z

@sihoonchoi Thanks for looking into it. This is very valuable!

There are more improvements that can be made on top of this like introducing line-search and trust regions. Some of which is batching friendly and some of it is not. I am pretty sure the optimization algorithms can be made more robust with it (ASE doesn't use many of these known tricks). I would be curious on what the community thinks but given that there are no benchmarks in matsci it's hard to make comparisons. Maybe guys at Rowan can set this up? Would like to know what others think.

Andrew-S-Rosen · 2026-02-06T01:10:26Z

I personally find it a bit wild that we don't have good benchmarks on this, but I guess it's historically because DFT is expensive to run. Rowan doesn't have a ton of infra for materials but they have obviously thought about this.

Actually, my group has a funded NSF collaboration with the Argonne folks supporting TorchSim (Ben Blaiszik et al) that might be relevant here. That might be a good avenue to kickstart this. I'd be interested in seeing if we can start something through that and open it up to the community to participate.

abhijeetgangan · 2026-02-06T01:54:43Z

One thing I always found confusing was when comparing MLIPs vs DFT for relaxation. MLIPs would use FIRE but VASP uses CG and QE uses BFGS both with linesearch and trust regions which makes them very different. Another aspect is the cost. I am sure the devs of DFT codes must have chosen this so that it reduces the DFT calls during relaxation. I think a fair comparison would be ideal but that means going away from exact ASE behavior.

Actually, my group has a funded NSF collaboration with the Argonne folks supporting TorchSim (Ben Blaizsik et al) that might be relevant here. That might be a good avenue to kickstart this. I'd be interested in seeing if we can start something through that and open it up to the community to participate.

I think this is a great idea! Maybe we can start an email thread to discuss this? I can add more optimizers to test.

Andrew-S-Rosen · 2026-02-06T02:34:59Z

One thing I always found confusing was when comparing MLIPs vs DFT for relaxation. MLIPs would use FIRE but VASP uses CG and QE uses BFGS both with linesearch and trust regions which makes them very different. Another aspect is the cost. I am sure the devs of DFT codes must have chosen this so that it reduces the DFT calls during relaxation. I think a fair comparison would be ideal but that means going away from exact ASE behavior.

I think the reason people have been using MLIPs with FIRE is simply a cultural thing, not based on much other than the original FIRE paper. FIRE isn't in VASP (without external plugins), but yeah, CG is simple enough and works.

I think this is a great idea! Maybe we can start an email thread to discuss this? I can add more optimizers to test.

Sure, happy to kick off an email tomorrow. Can you send me an email from your account? I'm not sure what your email is.

abhijeetgangan · 2026-02-06T02:53:14Z

Sure, happy to kick off an email tomorrow. Can you send me an email from your account? I'm not sure what your email is.

Sounds good! abhijeetgangan@g.ucla.edu

Implement lbfgs

4a4d43b

abhijeetgangan requested a review from orionarcher November 26, 2025 17:00

abhijeetgangan mentioned this pull request Nov 26, 2025

Implement LBFGS optimizer #89

Closed

abhijeetgangan added 2 commits November 26, 2025 09:45

fix example

793de83

Implement bfgs

b4c692f

orionarcher requested a review from thomasloux November 26, 2025 19:28

orionarcher reviewed Nov 26, 2025

View reviewed changes

Comment thread torch_sim/optimizers/state.py

curtischong reviewed Nov 28, 2025

View reviewed changes

Comment thread torch_sim/optimizers/bfgs.py Outdated

abhijeetgangan and others added 5 commits December 3, 2025 09:53

Merge branch 'main' into ag/opt_lbfgs_bfgs

49579d3

Merge branch 'main' into ag/opt_lbfgs_bfgs

a2e9b0d

Merge branch 'main' into ag/opt_lbfgs_bfgs

4e85aa0

Merge branch 'main' into ag/opt_lbfgs_bfgs

57458ad

clean scripts per #385

b41a8ef

Merge branch 'main' into ag/opt_lbfgs_bfgs

a5688c5

CompRhys added the keep-open PRs to be ignored by StaleBot label Jan 20, 2026

abhijeetgangan and others added 10 commits February 2, 2026 21:22

Merge branch 'main' into ag/opt_lbfgs_bfgs

a0ebb47

correct unit cell filter and add CellLBFGS and CellBFGS states

cea301d

global attr for LBFGS history

9478d85

Add methods to init

9c20599

Add test comparing with ASE

a9ede47

Update batched bfgs

802c966

charge and spin

8983439

Update tests

f51a0db

BFGS with constraints

2da4c44

Add batched lbfgs

bd360b4

abhijeetgangan added 4 commits February 4, 2026 05:28

bump max history and fix shape issue

0cd42b3

add optimizer test and remove redundant autobatcher test

69ce9f4

why exact check for floating point?

790236f

keep example script same

7c20da1

CompRhys linked an issue Feb 4, 2026 that may be closed by this pull request

Implement LBFGS optimizer #89

Closed

abhijeetgangan merged commit be3da21 into main Feb 4, 2026
67 checks passed

abhijeetgangan deleted the ag/opt_lbfgs_bfgs branch February 4, 2026 19:09

janosh mentioned this pull request Feb 6, 2026

Feature: Add fix symmetry constraint #438

Merged

3 tasks

abhijeetgangan mentioned this pull request Feb 6, 2026

Vectorize atom index computation in L-BFGS optimizer #444

Merged

Conversation

abhijeetgangan commented Nov 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Checklist

Uh oh!

abhijeetgangan commented Nov 26, 2025

Uh oh!

orionarcher left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

curtischong left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

curtischong Nov 28, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

abhijeetgangan commented Dec 3, 2025

Uh oh!

orionarcher commented Jan 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

abhijeetgangan commented Jan 20, 2026

Uh oh!

abhijeetgangan commented Feb 4, 2026

Uh oh!

Uh oh!

sihoonchoi commented Feb 5, 2026

Uh oh!

abhijeetgangan commented Feb 6, 2026

Uh oh!

Andrew-S-Rosen commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

abhijeetgangan commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Andrew-S-Rosen commented Feb 6, 2026

Uh oh!

abhijeetgangan commented Feb 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

abhijeetgangan commented Nov 26, 2025 •

edited

Loading

orionarcher commented Jan 16, 2026 •

edited

Loading

Andrew-S-Rosen commented Feb 6, 2026 •

edited

Loading

abhijeetgangan commented Feb 6, 2026 •

edited

Loading