[prompt-clustering] Copilot agent prompt clusters — 2026-05-21 (n=1052, last 30 days) #33741
Closed
Replies: 1 comment
-
|
This discussion has been marked as outdated by Copilot Agent Prompt Clustering Analysis. A newer discussion is available at Discussion #33980. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
NLP clustering of copilot-swe-agent PR prompts from the last 30 days. TF-IDF features + K-means surfaced 9 task themes; one cluster ("awf / firewall / config") merges 20 pts below the 80.5% overall average and is the strongest candidate for prompt-engineering investment.
Summary
Cluster Overview
Cluster details, keywords, and examples
Cluster 1: mcp / cli / added
map[string]anyfields: removeWorkflowData.ToolsandToolsConfig.rawCluster 2: workflow / agent / prompt
emojifrontmatter field/reflectfetch into agent prompt in daily-model-inventory workflowCluster 3: bug / fix / did
Cluster 4: pr / comment / branch
on.pull_request_reviewer: slash_commandsynthetic trigger and reviewer lifecycle routing--docker-host-path-prefixin generated workflowsCluster 5: otlp / engine / span
endpointfieldGH_AW_INFO_ENGINE_IDinto setup steps so setup OTel spans emitgh-aw.engine.idCluster 6: safe / output / outputs
@copilotmentions in PR Sous Chef safe outputsCluster 7: awf / firewall / config
firewall.effective-token-steeringcompiler supportworkflow_callinputCluster 8: alias / model / inventory
Cluster 9: actions / job / github actions
actions: readpermission to smoke-water.yml (#investigate-smoke-water-failure)Key Findings
Recommendations
Full PR cluster table
required_field_removalstotoolsMetaJSDoc typelogidentifier collisions and stdlib/reflectfetch into agent prompt in daily-model-inventory workflomax-patch-size/max-patch-filesunder `safe-outputs.create-pull-reqbody-allowed: falsetoclose-discussionandclose-issuesafe-outputsallowed-branchesenforcement forcreate-pull-requestsafe outputapplyToacreate_issuehandler execution to eliminate shared-state race condittests/chaos/**in safe-output PR creatipatch-diff.githubusercontent.comin the GitHub domain ecosystemsub_agent_strategyA/B experiment tosmoke-geminiworkflowNewValidationErrorwith actionapull_request_reviewfromon.pull_request_reviewerhybrid routinglog.*identifiers with explicitpkg/loggervariables in lipull_request_reviewactivation reactions inadd_reactionsubmit_pull_request_reviewwhen no PR review context is availablegemini-3.5-flashET multiplier to model inventoryadd/add-wizardoutputpull_request_reviewevents in activation reaction/status condpull_request_reviewerpull_request_reviewertrigger and centralized reviewer routingnetworkschema deprecation semantics with runtimeproxy-argstoolset(singular) deprecated property from schemamcp_inspect_inspectorgorqueue: maxwithcancel-in-progress: trueigh aw fixcodemod forsupportsLLMGateway→llmGatewayPortmigrationrequired-title-prefix, add migragh aw compilegitutil.ReadFileFromHEADWithRoottoReadFileFromHEADand align call sprompt_compressionA/B experiment andcavemanprompt variant to agent-pemaininto PR branch, resolve workflow lock conflict, and regenerate compgen_ai.response.modelon agent spanson.pull_request_reviewer: slash_commandsynthetic trigger and reviewer liflog.Print*usage in library packages with package-scoped logger ingithub-proxy-defaultimport to DRY gh-proxy defaurun_idalias for audit MCP tool parametersgh-aw.agent.conclusionspanssetup-gh-awinstall idempotent whengh-awis already availablemcp inspectbackground goroutinescontext.Background()fallbacks in action resolution and Dockworkflow_callinputworkflowsonworkflow_runduring compile#aw_*temporary IDs during bundle-based signed commit replaysyncutil.OnceLoaderls-remotein signed commit push pathraptor-minialias coverage and missing GPT-5 sresolve_pull_request_review_threadintegration-token 403 as actionablegithub/gh-awowner-prefix in `gh extension l||expressions in prompt markdown never substituting at runtimemap[string]anyfields: removeWorkflowData.Toolsand `ToolsConfipr-to-go-linterskill for PR-driven custom linter generationtools.github.allowed-repos: currentfor repo-scoped MCP guard policiesgithub-app.missing-keyignore mode and guard App token minting across worktools.serenain parser and align with schemaq.lock.ymlfor comment-triggered runsShowing 200 most recent of 1052 analyzed PRs. Full CSV:
pr-clusters.csv.Methodology
## Changes/## Summary/ similar) across 1054 PRs in cache.aw_info.jsonlogs cached); analysis uses PR-level proxies (commits, comments, file count) as iteration signals.References:
pr-clusters.csvcluster-analysis.jsonBeta Was this translation helpful? Give feedback.
All reactions