Skip to content

Sync sensei skill + GEPA integration from upstream#15097

Merged
spboyer merged 3 commits intoAzure:mainfrom
spboyer:sync-sensei-from-upstream
Apr 16, 2026
Merged

Sync sensei skill + GEPA integration from upstream#15097
spboyer merged 3 commits intoAzure:mainfrom
spboyer:sync-sensei-from-upstream

Conversation

@spboyer
Copy link
Copy Markdown
Member

@spboyer spboyer commented Apr 13, 2026

Summary

Syncs the .github/skills/sensei folder with the upstream source at microsoft/GitHub-Copilot-for-Azure.

Fixes #15096

Changes

Sensei Skill Updates (v1.0.0 → v1.0.5)

  • SKILL.md: Rich help banner, batch mode, --skip-integration, trigger-overlap disambiguation
  • SCORING.md: Routing-regression guard logic, overlap exception rules, guard pseudocode
  • README.md / LOOP.md / EXAMPLES.md: testPathPatterns fix, link alignment

GEPA Integration (new)

  • scripts/gepa/auto_evaluator.py — Quality scoring evaluator (score, optimize, score-all)
  • references/TOKEN-INTEGRATION.md — Token budget integration docs
  • .github/workflows/gepa-quality-score.yml — CI pipeline for SKILL.md quality scoring
  • .github/workflows/gepa-quality-score-comment.yml — Posts score comments on PRs

Path Adjustments

  • Pipeline refs updated from plugin/skills/.github/skills/ for this repo's layout

Preserved (not modified)

  • eval.yaml, evals/, tasks/ — local eval configs kept as-is

@spboyer spboyer force-pushed the sync-sensei-from-upstream branch from 07f8f41 to 3b0e1d8 Compare April 13, 2026 16:29
@spboyer spboyer enabled auto-merge (squash) April 13, 2026 16:29
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Note

Copilot was unable to run its full agentic suite in this review.

Syncs the Sensei skill content with upstream and adds GEPA-based quality scoring automation in CI for SKILL.md changes.

Changes:

  • Added GEPA quality scoring workflow + a follow-up workflow to comment results on PRs.
  • Introduced a Python auto-evaluator that scores (and can optionally optimize) skills based on content heuristics and discovered trigger tests.
  • Refreshed Sensei documentation (SKILL.md + references) with updated loop/scoring guidance.
Show a summary per file
File Description
.github/workflows/gepa-quality-score.yml Adds CI job to score skills and upload score JSON artifact.
.github/workflows/gepa-quality-score-comment.yml Adds workflow_run job to post/update a PR comment with score results.
.github/skills/sensei/scripts/gepa/auto_evaluator.py Adds GEPA evaluator CLI to score/optimize skills using discovered test harness data.
.github/skills/sensei/references/TOKEN-INTEGRATION.md Documents token-budget workflow and CLI usage.
.github/skills/sensei/references/SCORING.md Updates scoring guidance (anti-trigger exception rules, links, examples).
.github/skills/sensei/references/LOOP.md Updates test command examples and token-budget references/links.
.github/skills/sensei/references/EXAMPLES.md Updates examples + skill name references (e.g., azure-cost).
.github/skills/sensei/SKILL.md Updates Sensei skill definition, help banner, loop steps, GEPA mode, references.
.github/skills/sensei/README.md Updates documentation + Jest CLI flag (testPathPatterns) and links.

Copilot's findings

  • Files reviewed: 9/9 changed files
  • Comments generated: 13

Comment thread .github/workflows/gepa-quality-score-comment.yml Outdated
Comment thread .github/skills/sensei/scripts/gepa/auto_evaluator.py Outdated
Comment thread .github/skills/sensei/scripts/gepa/auto_evaluator.py Outdated
Comment thread .github/skills/sensei/scripts/gepa/auto_evaluator.py Outdated
Comment thread .github/skills/sensei/scripts/gepa/auto_evaluator.py Outdated
Comment thread .github/skills/sensei/SKILL.md Outdated
Comment thread .github/skills/sensei/SKILL.md Outdated
Comment thread .github/skills/sensei/SKILL.md Outdated
Comment thread .github/workflows/gepa-quality-score.yml
Comment thread .github/workflows/gepa-quality-score.yml
@ronniegeraghty
Copy link
Copy Markdown
Member

Hey @spboyer — nice work pulling this together! Heads up on the failing CI check:

The score job in .github/workflows/gepa-quality-score.yml is failing because it installs gepa==0.7.0, but that version doesn't exist on PyPI — the latest published version is 0.1.1.

The good news is the score job doesn't actually need the gepa package at all. The score and score-all commands in auto_evaluator.py are fully self-contained local heuristics — gepa is only used lazily inside the optimize code path.

Two options to fix:

  1. Remove the gepa install from the workflow entirely — the score job never calls it. You can add it back later when/if an optimize job gets added.
  2. Pin to the actual version (gepa>=0.1.0 or gepa==0.1.1) if you want it pre-installed for future optimize support.

Option 1 is probably cleanest since it avoids carrying an unused dependency. Let me know if you want me to push a suggestion!

@spboyer
Copy link
Copy Markdown
Member Author

spboyer commented Apr 13, 2026

All 13 review threads addressed in 0e9516f. Fixed all remaining plugin/skills → .github/skills references. CI optimization suggestion noted for future. @ronniegeraghty — ready for another look.

spboyer and others added 3 commits April 14, 2026 11:48
Updates sensei skill from v1.0.0 to v1.0.5 and adds missing GEPA
integration. Changes synced from microsoft/GitHub-Copilot-for-Azure.

Updated files:
- SKILL.md: v1.0.5 with help banner, batch mode, overlap disambiguation
- SCORING.md: routing-regression guard, trigger-overlap rules
- README.md, LOOP.md, EXAMPLES.md: testPathPatterns fix, link updates

New files:
- references/TOKEN-INTEGRATION.md: separate token integration docs
- scripts/gepa/auto_evaluator.py: GEPA quality scoring evaluator
- .github/workflows/gepa-quality-score.yml: CI quality scoring
- .github/workflows/gepa-quality-score-comment.yml: PR score comments

Path references updated from plugin/skills/ to .github/skills/ for
this repository's layout.

Fixes Azure#15096

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
Updates path references across sensei skill docs, GEPA auto_evaluator
defaults/usage examples, and workflow PR comment help text to match
this repository's .github/skills/ layout.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
The upstream source pinned gepa==0.7.0 which is not available on PyPI.
Latest available is 0.1.1. Using >= to pick up future releases.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
@spboyer spboyer force-pushed the sync-sensei-from-upstream branch from 1df2728 to 31f2b63 Compare April 14, 2026 15:48
@spboyer
Copy link
Copy Markdown
Member Author

spboyer commented Apr 15, 2026

/skip check-enforcer

Copy link
Copy Markdown
Member Author

@spboyer spboyer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

2 findings (0 HIGH, 2 MEDIUM, 0 LOW)

Syncs the sensei skill from upstream with path corrections and adds the GEPA auto-evaluator for CI-based quality scoring. Documentation updates are clean and the path migration from plugin/skills/ to .github/skills/ looks complete. Two medium-severity items below on the new code.

Comment thread .github/workflows/gepa-quality-score.yml
Comment thread .github/skills/sensei/scripts/gepa/auto_evaluator.py
@ronniegeraghty
Copy link
Copy Markdown
Member

/check-enforcer override

@spboyer spboyer merged commit a411192 into Azure:main Apr 16, 2026
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Sync sensei skill with upstream microsoft/GitHub-Copilot-for-Azure

3 participants