Skip to content
This repository was archived by the owner on Nov 16, 2023. It is now read-only.
This repository was archived by the owner on Nov 16, 2023. It is now read-only.

KMeansPlusPlus PredictedLabel Type  #470

@kere-nel

Description

@kere-nel

The expected type for PredictedLabel should be uint32, since ML.NET returns key values, but it's actually int32.

`Taken from example code:

from nimbusml import Pipeline, FileDataStream
from nimbusml.cluster import KMeansPlusPlus
from nimbusml.datasets import get_dataset
from nimbusml.feature_extraction.categorical import OneHotVectorizer

path = get_dataset('infert').as_filepath()

data = FileDataStream.read_csv(path)
print(data.head())
pipeline = Pipeline([
OneHotVectorizer(columns={'edu': 'education'}),
KMeansPlusPlus(n_clusters=5, feature=['induced', 'edu', 'parity'])
])
metrics, predictions = pipeline
.fit(data)
.test(data, 'induced', output_scores=True)
print(predictions.head())
print(predictions['PredictedLabel'].dtype)
`

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions