Add ML Flow by ccs-gs · Pull Request #14 · Crown-Commercial-Service/ccs-contract-map

ccs-gs · 2026-03-19T14:27:33Z

Summary

Added optional MLflow tracking to evaluation/run_evaluation.py (modeled after the WIS evaluation flow) so evaluations can log:
- run params (mapper, truth set path, prompt name/path, sample count)
- run metrics (accuracy_percent, accuracy_fraction, correct_predictions, evaluation_duration_seconds)
- artifacts (selected prompt file and output results CSV)
Added new CLI flags for MLflow configuration:
- --mlflow
- --mlflow-tracking-uri
- --mlflow-experiment-name
- --mlflow-run-name
Added mlflow dependency to requirements.txt.
Updated README.md with MLflow usage options.
Added/extended tests in tests/test_run_evaluation.py:
- existing output CSV behavior remains covered
- new test verifies MLflow params, metrics, and artifacts are logged when enabled
Added .env.example with required Azure OpenAI variables and optional MLflow environment variables.

Why

This enables reproducible, trackable evaluation runs with experiment history and artifacts, making prompt/model comparison easier over time and aligning this repo with the evaluation observability pattern used in the referenced WIS script.

SamuelHLewis · 2026-03-20T11:19:57Z

@copilot review this PR, focusing on how results are logged in Azure MLFlow

Copilot · 2026-03-20T11:20:04Z

@SamuelHLewis I've opened a new pull request, #15, to work on those changes. Once the pull request is ready, I'll request review from you.

SamuelHLewis · 2026-03-20T11:35:52Z

Copilot review only flagged the possibility of making MLFlow optional, which we're deliberately not doing, so no changes are needed after this

Add ML Flow

43b5c28

ccs-gs marked this pull request as ready for review March 19, 2026 14:29

SamuelHLewis and others added 2 commits March 20, 2026 10:57

Add configuration for AzureML-MLFlow

de7fd4b

Always send data to azure mlflow

1c1acc2

Copilot AI mentioned this pull request Mar 20, 2026

Make Azure MLflow opt-in; fix metric type and test fidelity #15

Closed

SamuelHLewis merged commit e92cbea into develop Mar 20, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ML Flow#14

Add ML Flow#14
SamuelHLewis merged 3 commits intodevelopfrom
add-ml-flow

ccs-gs commented Mar 19, 2026 •

edited

Loading

Uh oh!

SamuelHLewis commented Mar 20, 2026

Uh oh!

Copilot AI commented Mar 20, 2026

Uh oh!

SamuelHLewis commented Mar 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ccs-gs commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why

Uh oh!

SamuelHLewis commented Mar 20, 2026

Uh oh!

Copilot AI commented Mar 20, 2026

Uh oh!

SamuelHLewis commented Mar 20, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ccs-gs commented Mar 19, 2026 •

edited

Loading