Search before asking
What happened
GitHub GraphQL issue refresh no longer fails immediately on Could not resolve to an Issue after #8637, but deleted or transferred issues can still leave stale local rows in DevLake.
Those stale rows remain eligible for future refresh attempts because the local _tool_github_issues and raw rows are not cleaned up when the source issue no longer resolves.
In the reproduced case, a previously collected GitHub issue was deleted from the source repository. DevLake had already collected it into local tables, and later refresh attempts still tried to resolve it through GitHub GraphQL.
What do you expect to happen
When GitHub GraphQL returns Could not resolve to an Issue for an issue in the refresh-open-issues path, DevLake should:
- continue processing the rest of the batch
- remove or tombstone the stale local issue rows that keep the missing issue in the refresh path
- avoid retrying the same orphaned issue forever
How to reproduce
- Create a GitHub issue in repository A
- Run a DevLake pipeline that collects GitHub issues for repository A
- Delete the GitHub issue from repository A, or transfer it to another repository
- Re-run the DevLake pipeline
- Observe that DevLake still has stale local rows for the old issue and continues to encounter the missing issue in the refresh path
Anything else
Related upstream fix:
The remaining gap seems to be stale local data cleanup rather than just skipping the GraphQL error.
Version
Reproduced on v1.0.3-beta9@1410838
Are you willing to submit PR?
Code of Conduct
Search before asking
What happened
GitHub GraphQL issue refresh no longer fails immediately on
Could not resolve to an Issueafter #8637, but deleted or transferred issues can still leave stale local rows in DevLake.Those stale rows remain eligible for future refresh attempts because the local
_tool_github_issuesand raw rows are not cleaned up when the source issue no longer resolves.In the reproduced case, a previously collected GitHub issue was deleted from the source repository. DevLake had already collected it into local tables, and later refresh attempts still tried to resolve it through GitHub GraphQL.
What do you expect to happen
When GitHub GraphQL returns
Could not resolve to an Issuefor an issue in the refresh-open-issues path, DevLake should:How to reproduce
Anything else
Related upstream fix:
The remaining gap seems to be stale local data cleanup rather than just skipping the GraphQL error.
Version
Reproduced on
v1.0.3-beta9@1410838Are you willing to submit PR?
Code of Conduct