Skip to content

hamiltoq/CS4800

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

41 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Automation for Data Accessioner, XSLTProcessor, hashlib, OpenShield, and DART.

Data Accessioner

  • Transfers files from disks onto a file server
  • Output: XML file and the full directory from the input
  • How the function works:
    • Recursively copies or moves files from the input directory
    • Preserves the original file timestamps
    • Generates an XML report
    • Recreates the full directory structure from the input directory
  • Sample output: Screenshot 2025-12-08 131822

XSLT Processor

  • Parses the report from Data Accessioner
  • Output: CSV and HTML reports
  • How the function works:
    • Transfroms the XML report from previous function into a CSV and HTML report
  • Sample output
    • CSV: Screenshot 2025-12-08 131338

    • HTML: Screenshot 2025-12-08 131635

Hashlib function

  • Parses the report from Data Accessioner, creates a list of files and their checksums, then validates those checksums with the checksums on the disk
  • Output: CSV and log report
  • How the function works:
    • Recomputes MD5 checksums for all of the files in the input directory
    • Compares original checksum to stored checksum
    • Flags MISSING, MISMATCH, or ERROR (OK if they match)
  • Sample output
    • CSV: Screenshot 2025-12-08 132042

    • Log: Screenshot 2025-12-08 132132

Virus scanning

  • Runs Open Shield, a simple and fast antivirus check written in Python and using a database from MalwareBazaar.

Bagging function (optional)

  • Bundles files and metadata based on APTrust's Digital Archivist's Resource Tool (DART).

This project mimics these 5 programs for automation.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors