Draft: first draft of a aot autotuning runner and cache and heuristics gener… #1278

v0i0 · 2025-12-17T01:42:09Z

s…ator

proposal for #1161

WIP, needs more testing & features

basic idea: you run python -m helion.autotuner.aot_runner --benchmark "python your_benchmark.py" and you get files called _{filename}_{arch}.py containing the heuristic that maps shapes to configs.

yf225 · 2025-12-17T05:28:35Z

examples/aot_autotuning_example.py

@@ -0,0 +1,105 @@
+#!/usr/bin/env python3


thinking about Horace's example, and curious: Would it make sense to support a "benchmark only" mode in the collect phase (or a separate phase) that skips autotuning and just measures existing configs against additional shapes (similar to secondary_inputs in Horace's RFC)? This would let users:

Run collect on a small set of representative shapes (to do full autotune on)

Run benchmark-only on a larger set of shapes (just measure)

Build heuristics using the full timing matrix

The script already lets you specify different benchmarks for all three phases of measurement, so you can collect on a different benchmark than the one you measure.

Is that what you are asking about or something else?

ah nice! curious should we add an example showing this workflow? cc. @Chillee would this cover the original need of primary_inputs / secondary_inputs ?

yf225 · 2025-12-17T23:09:26Z

examples/aot_autotuning_example.py

+
+The AOT workflow consists of three phases:
+1. Collect: Run benchmarks, autotuning each shape individually
+2. Measure: Re-run benchmarks, measuring all configs across all shapes


(as we discussed, maybe "all shapes" is not exactly accurate, as user can customize what shapes to run in each phase)

gmagogsfm · 2025-12-18T17:40:15Z

helion/autotuner/aot_cache.py

+    """Represents a unique shape/dtype combination for a kernel."""
+
+    kernel_name: str
+    specialization_key: tuple[Any, ...]


So ShapeKey may correspond to a partially specialized shape? If so

Can ShapeKey collide? as in an input shape can be matched against multiple ShapeKeys

How much confidence can we have about a config that's best for a ShapeKey actually works well for an input shape that's wildly different at runtime? Take your tall-skinny, short-wide tensor as an example, if first dimension changes by 100x while second dimension stays the same, it would jump from tall-skinny to short-wide or vice-versa.

the datastore stores all shape information (all shapes, strides, and data types) in addition to this

gmagogsfm · 2025-12-18T18:59:43Z

helion/autotuner/heuristic_generator.py

+    return model, accuracy, feature_names
+
+
+def generate_heuristic_code(


Is the generated heuristic-based selection code ultimately what users are supposed to call in deployment?

This may be helpful to help tackle one of the challenges in Helion + vLLM, that is selecting best config based on batch_size, which varies per token. Note that it would have a fairly high bar for latency since it is triggered per token, so invoking a chunk of (complex?) Python logic might be too slow.

yes. this means it is interpretable and can be version controlled. depending on how complex it gets, we could compose it with the existing caching logic to cache the heuristic result.

@gmagogsfm this is what an example decision tree output might look like: https://gist.github.com/v0i0/d6604662d7095a040ce0db049e192c14

…ator

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Dec 17, 2025

v0i0 changed the title ~~first draft of a aot autotuning runner and cache and heuristics gener…~~ Draft: first draft of a aot autotuning runner and cache and heuristics gener… Dec 17, 2025

v0i0 requested review from Chillee, jansel and yf225 December 17, 2025 01:43

yf225 reviewed Dec 17, 2025

View reviewed changes

yf225 requested a review from mengluy0125 December 17, 2025 05:31

yf225 reviewed Dec 17, 2025

View reviewed changes

choijon5 requested a review from gmagogsfm December 18, 2025 16:14

gmagogsfm reviewed Dec 18, 2025

View reviewed changes

v0i0 added 6 commits December 22, 2025 15:19

first draft of a aot autotuning runner and cache and heuristics gener…

fdf0655

…ator

add data type support

40eb06c

fixing code gen

a5e9030

heuristics bbackends

ff0c8cb

better decision tree, better output in runner, more analysis

e0ab1f7

make it easy to use

15f568d

v0i0 force-pushed the v0i0/autotune-heuristic branch from 3f0ced4 to 15f568d Compare December 22, 2025 15:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Draft: first draft of a aot autotuning runner and cache and heuristics gener… #1278

Draft: first draft of a aot autotuning runner and cache and heuristics gener… #1278

Uh oh!

v0i0 commented Dec 17, 2025 •

edited

Loading

Uh oh!

yf225 Dec 17, 2025 •

edited

Loading

Uh oh!

v0i0 Dec 17, 2025

Uh oh!

yf225 Dec 17, 2025

Uh oh!

yf225 Dec 17, 2025 •

edited

Loading

Uh oh!

gmagogsfm Dec 18, 2025

Uh oh!

v0i0 Dec 18, 2025

Uh oh!

gmagogsfm Dec 18, 2025

Uh oh!

v0i0 Dec 18, 2025

Uh oh!

v0i0 Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

		return model, accuracy, feature_names


		def generate_heuristic_code(

Draft: first draft of a aot autotuning runner and cache and heuristics gener… #1278

Are you sure you want to change the base?

Draft: first draft of a aot autotuning runner and cache and heuristics gener… #1278

Uh oh!

Conversation

v0i0 commented Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yf225 Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

v0i0 Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

yf225 Dec 17, 2025

Choose a reason for hiding this comment

Uh oh!

yf225 Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gmagogsfm Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

v0i0 Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

gmagogsfm Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

v0i0 Dec 18, 2025

Choose a reason for hiding this comment

Uh oh!

v0i0 Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

v0i0 commented Dec 17, 2025 •

edited

Loading

yf225 Dec 17, 2025 •

edited

Loading

yf225 Dec 17, 2025 •

edited

Loading