Lean kernel-artifact telemetry for cost-model dataset#2770
Lean kernel-artifact telemetry for cost-model dataset#2770IshanAryendu wants to merge 6 commits into
Conversation
…he decorator already computed for sample_id and record it per row, completing the kernel-artifact set (source, input shapes, decorator).
9abb150 to
1cd901e
Compare
ef76d6f to
bedcb75
Compare
|
Claude finished @IshanAryendu's task in 7m 8s —— View job Code Review: Lean kernel-artifact telemetry
Overall this is a clean, well-reasoned refactor. The schema is genuinely leaner, the 1.
|
- Skip data collection for searches restricted to user-pinned configs
(configs=[...] without force_autotune): no .meta.jsonl/.csv written.
- Store full helion.settings in the per-run .meta.jsonl record (JSON-safe).
- Store per-run config_defaults (the ConfigSpec defaults) so a consumer can
rebuild each minimized row's config via {**config_defaults, **row_config}.
- Drop kernel_id/sample_id/decorator; derive run_id directly from
(kernel_source, codegen-settings signature, shapes, dtypes, hardware).
- Single source of truth for the codegen signature (codegen_decorator_parts),
shared by format_kernel_decorator and run_id so they cannot drift.
- Store per-config 'config' as minimized, compact one-line JSON.
- Minimize/record only when a log sink is active.
- Remove now-unused Kernel.kernel_id()/_settings_signature().
7d22677 to
69b7af1
Compare
Tightens the autotuner's kernel-artifact telemetry into a clean dataset for an offline cost model (predict a config's performance for a kernel without running it). Three changes:
Every row in the dataset must be a representative (kernel, config) → measured perf sample, and the files should carry only what a consumer can't trivially derive. The prior schema (a) emitted rows for config-restricted runs, biasing the training distribution; (b) kept no record of the codegen-affecting settings as analyzable fields; and (c) carried three content-hash IDs where only one (run_id, the join key) is structurally needed, plus a decorator string that duplicates config + settings.
Tests: test/test_kernel_metadata.py + test/test_autotuner.py (123 passed).