We have seen when processing multiple RFDs that contain a large number of large images (multi-MB images), we are stressing the CPU and memory of an AWS t3.medium. Note that we are not hitting burst CPU caps. There are two issues at root here:
- We over-process RFDs that have included documents that are on the main branch. We should instead be only generating jobs where at least one required file has changed in the RFD's subdirectory.
- PDF generation should use a queue with a maximum concurrent threshold as this is the primary CPU and memory consumer.
We have seen when processing multiple RFDs that contain a large number of large images (multi-MB images), we are stressing the CPU and memory of an AWS t3.medium. Note that we are not hitting burst CPU caps. There are two issues at root here: