Repository of personal data science projects for self-learning and hobby purposes.
-
- Mammogram Project: Image classification pipeline utilising PyTorch in SageMaker, Docker/ECS for transformations and Terraform
- Drug Sentiment: Using Amazon Textract and Comprehend to compare drug sentiment from paper prescriptions (winning solution for AWS Marketplace Hackathon)
-
- Smartphone Activity Prediction: Data cleaning and simple model of smartphone accelerometer data
- Oscar Winners and IMDb: Data exploration and generating insights from film data
- Credit Risk Analysis: End-to-end project building data science application for analysing credit risk
-
- MLOps Terraform Pipeline: Automating tasks associated with training and deploying ML models leveraging Step Functions, Lambda, ECR, Glue and SageMaker