Accelerated ts.static + added scaling scripts #427

falletta · 2026-01-30T01:03:49Z

Summary

Changes:

Optimized the memory scaler and split operations, providing a substantial speedup for ts.static (up to 48% for batches larger than 5000).
Added scaling scripts for ts.static, ts.relax, and ts.integrate (NVE, NVT) to analyze scaling performance.
Added tests for memory scaler values for non periodic systems.

The figure below shows the speedup achieved for static evaluations, 10-step atomic relaxation, 10-step NVE MD, and 10-step NVT MD. Prior results are shown in blue, while new results are shown in red. The speedup is calculated as
speedup (%) = (baseline_time / current_time − 1) × 100. We observe that:

ts.static achieves a 43.9% speedup for 100,000 structures
ts.relax achieves a 2.8% speedup for 1,500 structures
ts.integrate (NVE) achieves a 0.9% speedup for 10,000 structures
ts.integrate (NVT) achieves a 1.4% speedup for 10,000 structures

Comments:

From the scaling plots, we can see that the timings of ts.static and ts.integrate are all consistent with each other. Indeed:

ts.static → 85s for 100'000 evaluations
ts.integrate NVE → 87s for 10'000 structures (10 MD steps each) → 87s for 100'000 evaluations
ts.integrate NVT → 89s for 10'000 structures (10 MD steps each) → 89s for 100'000 evaluations

However, when looking at the relaxation:

ts.relax → 63s for 1'000 structures (10 relax steps each) → 63s for 10'000 evaluations → ~630s for 100'000 evaluations

So ts.relax is about 7x slower than ts.static or ts.integrate. The unbatched FrechetCellFilter clearly contributes to that. I'm wondering if there are additional bottlenecks in the code that we might optimize to reduce that massive 7x cost.

orionarcher · 2026-01-30T14:12:46Z

torch_sim/autobatching.py

-                    bbox[i] += 2.0
-            volume = bbox.prod() / 1000  # convert A^3 to nm^3
-        number_density = state.n_atoms / volume.item()
+        # Use cell volume (O(1)); SimState always has a cell. Avoids O(N) position scan.


non-periodic systems don't have a sensible cell, see #412

I now minimized the differences compared to the initial code

In addition, I added explicit tests for the memory scaler values and verified that the changes in this PR do not affect the test’s success

orionarcher · 2026-01-30T14:16:10Z

torch_sim/autobatching.py

+            self.memory_scalers = calculate_batched_memory_scalers(
+                states, self.memory_scales_with
+            )
+            self.state_slices = states.split()


batching makes sense here

orionarcher · 2026-01-30T14:17:59Z

torch_sim/autobatching.py

+        if isinstance(states, SimState):
+            self.batched_states = [[states[index_bin]] for index_bin in self.index_bins]


state.split() is identical to this and faster

Reusing self.state_slices instead of calling states.split() again makes the code 5% faster, so I'd keep it

CompRhys · 2026-01-30T16:44:53Z

examples/scaling/__init__.py

this file isn't needed

tests/test_autobatching.py

CompRhys · 2026-01-30T22:09:07Z

torch_sim/autobatching.py

+            )
+            self.state_slices = states.split()
+        else:
+            self.state_slices = states


why not concat and then called the batched logic?

falletta marked this pull request as draft January 30, 2026 01:22

orionarcher reviewed Jan 30, 2026

View reviewed changes

falletta marked this pull request as ready for review January 30, 2026 15:42

falletta added 2 commits January 30, 2026 07:53

revisions

d28527a

added explicit checks on max memory scaler values

e91fe92

falletta force-pushed the speedup_static branch from 3138aed to e91fe92 Compare January 30, 2026 15:54

Merge branch 'main' into speedup_static

e60ecfb

CompRhys reviewed Jan 30, 2026

View reviewed changes

examples/scaling/__init__.py Outdated

Copy link

Member

CompRhys Jan 30, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this file isn't needed

falletta reacted with thumbs up emoji

falletta added 3 commits January 30, 2026 09:45

rm init

3a6b3e4

fixes for tests failing

8a91dad

avoid testing scaling repo

5b24a62

CompRhys reviewed Jan 30, 2026

View reviewed changes

tests/test_autobatching.py Show resolved Hide resolved

CompRhys reviewed Jan 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accelerated ts.static + added scaling scripts #427

Accelerated ts.static + added scaling scripts #427

falletta commented Jan 30, 2026 •

edited

Loading

Uh oh!

orionarcher Jan 30, 2026

Uh oh!

falletta Jan 30, 2026

Uh oh!

falletta Jan 30, 2026

Uh oh!

orionarcher Jan 30, 2026

Uh oh!

orionarcher Jan 30, 2026

Uh oh!

falletta Jan 30, 2026

Uh oh!

CompRhys Jan 30, 2026

Uh oh!

Uh oh!

CompRhys Jan 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		if isinstance(states, SimState):
		self.batched_states = [[states[index_bin]] for index_bin in self.index_bins]

Accelerated ts.static + added scaling scripts #427

Are you sure you want to change the base?

Accelerated ts.static + added scaling scripts #427

Conversation

falletta commented Jan 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

falletta commented Jan 30, 2026 •

edited

Loading