This project utilizes a Random Forest Regressor to predict the Global Warming Potential (GWP), measured in kg CO2 eq, of various beverage containers based on their life cycle data.
The analysis explores how different materials, manufacturing processes, and transportation factors contribute to the environmental impact of packaging. The model identifies key variables that drive carbon emissions throughout the entire life cycle of a product.
- Data Preprocessing: Handled categorical variables using One-Hot Encoding (for nominal columns) and Target Encoding (for the high-cardinality "Manufacturing process" column).
- Model: Random Forest Regressor with 100 trees and a max depth of 10.
- Evaluation: Performance is measured using R-squared (RΒ²), Root Mean Squared Error (RMSE), and Mean Absolute Error (MAE).
- Insights: Feature importance ranking to identify the primary drivers of GWP.
- Samples: 54
- Features: 13 (raw) β 32 (after encoding)
- Target Variable: GWP (kg CO2 eq)
| Feature Category | Examples |
|---|---|
| Material Info | Bottle Material, Cap Material, Filled Material, Label |
| Logistics | Transport Mode, Transport Distance, No. of uses, Mass |
| Production | Manufacturing Process, Electricity Mix, Electricity Value |
| End of Life | EOL Scenario, EOL Percentage |
The model demonstrated strong predictive performance on the test set:
- RΒ² Score: 0.8847
- RMSE: 13.1459 kg CO2 eq
- MAE: 7.0454 kg CO2 eq
According to the model's Feature Importance, the most significant drivers of emissions are:
- Manufacturing Process (0.8265)
- EOL Scenario: Landfilling (0.0266)
- Label Type: Paper (0.0210)
- No of uses (0.0199)
- Cap Material: Aluminium (0.0193)