Sync sensei skill + GEPA integration from upstream#15097
Conversation
07f8f41 to
3b0e1d8
Compare
There was a problem hiding this comment.
Pull request overview
Note
Copilot was unable to run its full agentic suite in this review.
Syncs the Sensei skill content with upstream and adds GEPA-based quality scoring automation in CI for SKILL.md changes.
Changes:
- Added GEPA quality scoring workflow + a follow-up workflow to comment results on PRs.
- Introduced a Python auto-evaluator that scores (and can optionally optimize) skills based on content heuristics and discovered trigger tests.
- Refreshed Sensei documentation (SKILL.md + references) with updated loop/scoring guidance.
Show a summary per file
| File | Description |
|---|---|
| .github/workflows/gepa-quality-score.yml | Adds CI job to score skills and upload score JSON artifact. |
| .github/workflows/gepa-quality-score-comment.yml | Adds workflow_run job to post/update a PR comment with score results. |
| .github/skills/sensei/scripts/gepa/auto_evaluator.py | Adds GEPA evaluator CLI to score/optimize skills using discovered test harness data. |
| .github/skills/sensei/references/TOKEN-INTEGRATION.md | Documents token-budget workflow and CLI usage. |
| .github/skills/sensei/references/SCORING.md | Updates scoring guidance (anti-trigger exception rules, links, examples). |
| .github/skills/sensei/references/LOOP.md | Updates test command examples and token-budget references/links. |
| .github/skills/sensei/references/EXAMPLES.md | Updates examples + skill name references (e.g., azure-cost). |
| .github/skills/sensei/SKILL.md | Updates Sensei skill definition, help banner, loop steps, GEPA mode, references. |
| .github/skills/sensei/README.md | Updates documentation + Jest CLI flag (testPathPatterns) and links. |
Copilot's findings
- Files reviewed: 9/9 changed files
- Comments generated: 13
|
Hey @spboyer — nice work pulling this together! Heads up on the failing CI check: The The good news is the Two options to fix:
Option 1 is probably cleanest since it avoids carrying an unused dependency. Let me know if you want me to push a suggestion! |
|
All 13 review threads addressed in 0e9516f. Fixed all remaining plugin/skills → .github/skills references. CI optimization suggestion noted for future. @ronniegeraghty — ready for another look. |
Updates sensei skill from v1.0.0 to v1.0.5 and adds missing GEPA integration. Changes synced from microsoft/GitHub-Copilot-for-Azure. Updated files: - SKILL.md: v1.0.5 with help banner, batch mode, overlap disambiguation - SCORING.md: routing-regression guard, trigger-overlap rules - README.md, LOOP.md, EXAMPLES.md: testPathPatterns fix, link updates New files: - references/TOKEN-INTEGRATION.md: separate token integration docs - scripts/gepa/auto_evaluator.py: GEPA quality scoring evaluator - .github/workflows/gepa-quality-score.yml: CI quality scoring - .github/workflows/gepa-quality-score-comment.yml: PR score comments Path references updated from plugin/skills/ to .github/skills/ for this repository's layout. Fixes Azure#15096 Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Updates path references across sensei skill docs, GEPA auto_evaluator defaults/usage examples, and workflow PR comment help text to match this repository's .github/skills/ layout. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
The upstream source pinned gepa==0.7.0 which is not available on PyPI. Latest available is 0.1.1. Using >= to pick up future releases. Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
1df2728 to
31f2b63
Compare
|
/skip check-enforcer |
spboyer
left a comment
There was a problem hiding this comment.
2 findings (0 HIGH, 2 MEDIUM, 0 LOW)
Syncs the sensei skill from upstream with path corrections and adds the GEPA auto-evaluator for CI-based quality scoring. Documentation updates are clean and the path migration from plugin/skills/ to .github/skills/ looks complete. Two medium-severity items below on the new code.
|
/check-enforcer override |
Summary
Syncs the
.github/skills/senseifolder with the upstream source at microsoft/GitHub-Copilot-for-Azure.Fixes #15096
Changes
Sensei Skill Updates (v1.0.0 → v1.0.5)
--skip-integration, trigger-overlap disambiguationtestPathPatternsfix, link alignmentGEPA Integration (new)
scripts/gepa/auto_evaluator.py— Quality scoring evaluator (score, optimize, score-all)references/TOKEN-INTEGRATION.md— Token budget integration docs.github/workflows/gepa-quality-score.yml— CI pipeline for SKILL.md quality scoring.github/workflows/gepa-quality-score-comment.yml— Posts score comments on PRsPath Adjustments
plugin/skills/→.github/skills/for this repo's layoutPreserved (not modified)
eval.yaml,evals/,tasks/— local eval configs kept as-is