build time improvement #169

varshith15 · 2025-12-14T19:17:03Z

this PR adds the idea on timing cache which improves the build times significantly
with timing cache UNet tensorrt compilation takes 70 sec, controlnet takes 25 sec and vae encoder/decoder 1.5 sec
the timing cache file is the same for all models, it basically keep note of fastest kernel for exact requirement
for instance: conv2d for shape 128 x 128 -- this doesnt need to searched for again as we have cache

victorges · 2025-12-15T18:42:13Z

src/streamdiffusion/acceleration/tensorrt/utilities.py

+        if isinstance(timing_cache, (str, Path)) and not os.path.exists(timing_cache):
+            load_timing_cache = None


So we add a default value for the timing cache but if it's not created by something else we don't use it? Can you add instructions on how to use this timing cache? Ideally it should be completely transparent to the lib user (incl creating the cache and maybe not leaving any dangling files behind?)

varshith15 added 2 commits December 14, 2025 19:11

fix: build time

2498b3b

fix: depth compile

ac74625

victorges reviewed Dec 15, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

build time improvement #169

build time improvement #169

Uh oh!

varshith15 commented Dec 14, 2025

Uh oh!

victorges Dec 15, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		if isinstance(timing_cache, (str, Path)) and not os.path.exists(timing_cache):
		load_timing_cache = None

build time improvement #169

Are you sure you want to change the base?

build time improvement #169

Uh oh!

Conversation

varshith15 commented Dec 14, 2025

Uh oh!

victorges Dec 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

victorges Dec 15, 2025 •

edited

Loading