Replace HPO framework NNI with Optuna and update CLI by WenjieDu · Pull Request #852 · WenjieDu/PyPOTS

WenjieDu · 2026-05-03T20:44:15Z

What does this PR do?

fixing Replace Microsoft NNI with another hyperparameter optimization toolkit #589;

Before submitting

This PR is made to fix a typo or improve the docs (you can dismiss the other checks if this is the case);
Was this discussed/approved via a GitHub issue? Please add a link to it if that's the case;
I have commented my code, particularly in hard-to-understand areas;
I have written the necessary tests and already run them locally;

Add comprehensive CLI commands enabling users to operate all PyPOTS functionalities from the command line: New commands: - train: Train models from YAML/JSON config files with CLI overrides - predict: Run inference with saved .pypots models using config for correct model architecture reconstruction - evaluate: Evaluate predictions against ground truth with task-specific metrics (MSE, MAE, RMSE, accuracy, F1, etc.) - data: Convert/split/describe datasets (H5, CSV, NumPy, Pickle) - model: List/describe/inspect/generate-config for 100+ models across 6 task types - tune: Improved HPO wrapper with NNI integration and config files - info: Show environment, version, device, and model count information - benchmark: Compare multiple models on the same dataset with metrics Architecture: - Config-first design: YAML/JSON configs as primary input, CLI args override - Lazy model imports via importlib for fast CLI startup - Dynamic model registry using __all__ from each task module - Parameter filtering via inspect.signature for safe model instantiation - All commands extend existing BaseCommand ABC pattern Also adds unit tests for all 8 commands (16 test cases total). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Replace the deprecated Microsoft NNI framework with Optuna for all hyperparameter optimization functionality. Key changes: - pypots/base.py: Remove NNI import/reporting, add optional optuna_trial parameter to BaseNNModel for in-training pruning support - pypots/cli/tune.py: Complete rewrite using Optuna study.optimize() with in-process objective function (no more separate trial processes) - pypots/cli/hpo.py: Removed (NNI trial runner no longer needed) - 3 model files (usgan, vader, crli): Replace NNI reporting with Optuna trial.report() + should_prune() pattern - requirements.txt: Add optuna dependency - Docs: Update NNI references to Optuna in README.md, README_zh.md, docs/index.rst New Optuna config format uses int/float/categorical search space types with low/high/choices params, and supports TPE/Random/CmaEs/Grid samplers plus MedianPruner/PercentilePruner/HyperbandPruner pruners. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Add 'data list' action to list 260+ benchmark datasets from TSDB - Add 'data load' action to download, preprocess, and save benchmark datasets as train/val/test H5 splits via benchpots - Support dataset-specific params: --subset, --rate, --n_steps, --pattern - Use inspect.signature() to filter kwargs for each preprocess function - Add tests for list and load actions (physionet_2012) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

- Replace all 11 command files from argparse class-based pattern to Click decorators (@click.command, @click.group, @click.option) - Convert data and model commands to Click groups with subcommands - Remove BaseCommand ABC class; keep execute_command() and check_if_under_root_dir() as module-level functions in base.py - Rename merge_config_with_args to merge_config_with_overrides (takes dict) - Rewrite pypots_cli.py entry point as Click group with cli.add_command() - Update all 11 test files to use Click's CliRunner - Add 'click' to requirements.txt - Fix info.py: replace NNI with Optuna in optional dependencies list - Net reduction: -1075 lines (from 3007 removed, 1932 added) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

1. CLI startup: ~7s → <100ms (97x faster) - Make pypots/__init__.py use lazy imports via __getattr__ - Implement LazyGroup in pypots_cli.py for deferred command loading - Move heavy imports (torch, numpy, transformers, tsdb) from module level to inside command functions across all 11 CLI modules 2. HPO reproducibility: Reset random seed before each Optuna trial - Call set_random_seed(seed) at the start of each trial's objective() - Ensures identical hyperparams produce identical model initialization 3. Model file metadata: Enrich .pypots files with: - model_class: The model class name (e.g., 'SAITS') - hyperparameters: All JSON-serializable model constructor parameters - save_timestamp: ISO 8601 timestamp of when the model was saved - Version compatibility warnings on load when versions differ - Enhanced 'model inspect' CLI command to display all metadata - Full backward compatibility with older .pypots files Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

…s-pypots integration Add three key CLI capabilities that bridge the gap between ai4ts CSV data format and PyPOTS H5 model input format: - data prepare: Converts CSV files (with SAMPLE_ID, features, CLAF_TARGET) to PyPOTS-compatible H5 with proper 3D arrays (X, X_ori, y). Supports batch mode (--train/--val/--test) and single-file mode. Handles SAMPLE_ID grouping, artificial missing injection for val/test sets, and label extraction. - data describe: Enhanced to accept CSV files in addition to H5, with --json flag for machine-readable output. Shows n_samples, n_steps, n_features, missing_rate, labels, and per-feature missing rates. - recommend: New command that suggests model hyperparameters based on data properties. Accepts CSV or H5 files, auto-detects data dimensions, and generates ready-to-use YAML config files with appropriate hyperparameters for all 5 supported models (SAITS, TimesNet, TEFN, CRLI, TimeMixer). Updated pypots-tsa and ai4ts-skills with unified CLI-only workflow. Added 10 new tests (all passing). Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

…tegration - Add 'data profile' subcommand: analyzes CSV datasets and outputs DataProfile JSON with sample statistics, schema mapping, timestamp info, and recommended windowing strategy - Add 'data reconstruct' subcommand: reverses windowing transformation using window registry to reconstruct original-shape data from model predictions (strips padding, reassembles by sample ID) - Enhance 'data prepare': integrates ai4ts pipeline for intelligent variable-length sample handling with automatic strategy selection (pad_only/direct/sliding_window) and window registry generation - Add CLI tests for profile, profile --json, prepare with registry, and end-to-end reconstruct (tests 10-13) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

…ng_mask - Auto-compute indicating_mask when ground truth X_ori has NaN (natural missing) - Evaluate only on artificially masked positions (observed in X_ori, missing in X) - Replace NaN in targets with 0 at non-evaluated positions to pass metric assertions - Add informative log message showing number of evaluated vs excluded positions - Include test_set in recommend config auto-discovery (was missing train/val only) Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

…er calls Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

# Conflicts: # README_zh.md

sonarqubecloud · 2026-05-04T14:15:47Z

Quality Gate failed

Failed conditions
1 Security Hotspot
E Security Rating on New Code (required ≥ A)

See analysis details on SonarQube Cloud

Catch issues before they fail your Quality Gate with our IDE extension SonarQube for IDE

coveralls · 2026-05-04T14:27:17Z

Coverage Report for CI Build 25324056397

Warning

Build has drifted: This PR's base is out of sync with its target branch, so coverage data may include unrelated changes.
Quick fix: rebase this PR. Learn more →

Coverage decreased (-0.3%) to 79.831%

Details

Coverage decreased (-0.3%) from the base build.
Patch coverage: 526 uncovered changes across 20 files (1343 of 1869 lines covered, 71.86%).
6 coverage regressions across 2 files.

Uncovered Changes

Top 10 Files by Coverage Impact	Changed	Covered	%
pypots/cli/data.py	600	422	70.33%
pypots/cli/model.py	138	62	44.93%
pypots/cli/benchmark.py	175	126	72.0%
pypots/cli/evaluate.py	124	75	60.48%
pypots/cli/tune.py	183	134	73.22%
pypots/cli/utils.py	88	68	77.27%
pypots/cli/pypots_cli.py	17	0	0.0%
pypots/cli/recommend.py	161	146	90.68%
pypots/cli/train.py	74	62	83.78%
pypots/cli/info.py	52	41	78.85%

Coverage Regressions

6 previously-covered lines in 2 files lost coverage.

File	Lines Losing Coverage	Coverage
pypots/timeseries_ai/client.py	4	0.0%
pypots/timeseries_ai/init.py	2	0.0%

Coverage Stats


Relevant Lines:	20755
Covered Lines:	16569
Line Coverage:	79.83%
Coverage Strength:	1.6 hits per line

💛 - Coveralls

WenjieDu and others added 30 commits March 2, 2026 02:17

docs: update docs;

42e6391

docs: update docs;

504227a

Merge branch 'dev' into (docs)update

295a346

docs: add citation of TKAN;

4ce5cd3

docs: update docs;

cb09995

docs: update docs;

a0b515d

Merge branch 'dev' into (docs)update

61f3016

docs: add ask deepwiki;

48e9b42

docs: update the publication info of TEFN;

414fe90

docs: remove about page;

2f1ffc7

docs: update citing information of SegRNN;

a347acc

docs: update citing information of SegRNN;

56d74d0

Merge branch 'main' into (docs)update

780d328

docs: update pypots citation info;

25f8eee

docs: update abbr of task names;

8844b87

docs: update docs;

2b64bd5

docs: update docs;

81f6988

docs: add developer docs;

55bc523

refactor: clean up CLI test files - remove xfail markers, fix CliRunn…

c2cae90

…er calls Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

docs: rename sec examples into tutorials;

99aa2eb

Merge branch '(docs)update' into (feat)v2_cli+tuner+login

d560096

Merge branch 'main' into (docs)update

3d3e351

# Conflicts: # README_zh.md

WenjieDu added 6 commits April 8, 2026 04:16

Merge branch 'main' into (docs)update

c3fca4b

Merge branch '(docs)update' into (feat)v2_cli+tuner+login

0d534f8

Merge branch 'dev' into (feat)v2_cli+tuner+login

e8a484a

Merge branch 'dev' into (feat)update_cli_hpo

0c89f8e

fix: set weights_only= False;

e1a5a97

test: downgrade weights_only to False;

3e33042

WenjieDu merged commit f3aaa9d into dev May 4, 2026
2 of 4 checks passed

WenjieDu deleted the (feat)update_cli_hpo branch May 4, 2026 16:59

WenjieDu mentioned this pull request May 6, 2026

Discussion on model hyperparameter optimization (HPO) with PyPOTS<2.0 and NNI #408

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Replace HPO framework NNI with Optuna and update CLI#852

Replace HPO framework NNI with Optuna and update CLI#852
WenjieDu merged 36 commits into
devfrom
(feat)update_cli_hpo

WenjieDu commented May 3, 2026

Uh oh!

sonarqubecloud Bot commented May 4, 2026

Uh oh!

Uh oh!

coveralls commented May 4, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

WenjieDu commented May 3, 2026

What does this PR do?

Before submitting

Uh oh!

sonarqubecloud Bot commented May 4, 2026

Quality Gate failed

Uh oh!

Uh oh!

coveralls commented May 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Coverage Report for CI Build 25324056397

Coverage decreased (-0.3%) to 79.831%

Details

Uncovered Changes

Coverage Regressions

Coverage Stats

💛 - Coveralls

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

coveralls commented May 4, 2026 •

edited

Loading