I would like to propose the enhancement of the presidio-research repository by introducing functionalities that enable the evaluation of how accurately columns in a table or a JSON containing Personally Identifiable Information (PII) are identified, utilizing the capabilities of the newly introduced package presidio-structured.
A starting point could be simply assess the precision, recall, and F1 score of PII column identification.
I would like to propose the enhancement of the presidio-research repository by introducing functionalities that enable the evaluation of how accurately columns in a table or a JSON containing Personally Identifiable Information (PII) are identified, utilizing the capabilities of the newly introduced package presidio-structured.
A starting point could be simply assess the precision, recall, and F1 score of PII column identification.