mirror of https://github.com/github/awesome-copilot.git synced 2026-05-15 11:11:48 +00:00

Files

T

Muhammad Ubaid Raza d5c855ece0 feat: [gem-team] Add confidence metric, optimize planner workflow (#1695 )

* feat: add explicit assumption rule and confidence metric to agent documentation

- Add `confidence` field (0‑1) to the output schema in `agents/gem-browser-tester.agent.md`
- Include `confidence` in the `extra` object of `agents/gem-devops.agent.md`
- Append the guideline “State assumptions explicitly; never guess silently” to all agent docs
- Update the “Bisect (Complex Only)” heading to reflect its gate condition
- Minor wording and formatting adjustments across the affected agent documents

* chore: update readme

* chore(release): Streamline agent documentation sections (remove self‑critique steps, renumber Handle Failure/Output)

2026-05-14 10:02:32 +10:00

.github/plugin

feat: [gem-team] Add confidence metric, optimize planner workflow (#1695 )

2026-05-14 10:02:32 +10:00

README.md

feat: [gem-team] Add confidence metric, optimize planner workflow (#1695 )

2026-05-14 10:02:32 +10:00

README.md

Gem Team

Self-Learning Multi-agent orchestration harness for spec-driven development and automated verification.

Quick Start

# Install via APM (recommended)
apm install mubaidr/gem-team

# Or register as a marketplace
apm marketplace add mubaidr/gem-team
apm install gem-team@gem-team

See all supported installation options below.

Quick Start
Why Gem Team?
Harness Architecture
Installation
The Agent Team
Knowledge Sources
Contributing

Why Gem Team?

Performance

4x Faster — Parallel execution with wave-based execution
Pattern Reuse — Codebase pattern discovery prevents reinventing wheels

Quality & Security

Higher Quality — Specialized harness agents + TDD + verification gates + contract-first
Built-in Security — OWASP scanning, secrets/PII detection on critical tasks
Resilient — Pre-mortem analysis, failure handling, auto-replanning
Accessibility-First — WCAG compliance validated at spec and runtime layers
Safe DevOps — Idempotent operations, health checks, mandatory approval gates
Constructive Critique — gem- critic challenges assumptions, finds edge cases

Intelligence

Established Patterns — Uses library/harness conventions over custom implementations
Source Verified — Every factual claim cites its source; no guesswork
Knowledge-Driven — Prioritized sources (PRD → codebase → AGENTS.md → Context7 → docs)
Continuous Learning — Memory tool persists patterns, gotchas, user preferences across sessions
Auto-Skills — Agents extract reusable SKILL.md files from successful tasks (high confidence: auto, medium: confirm)
Skills & Guidelines — Built-in skill & guidelines (web-design-guidelines)

Process

Spec-Driven — Multi-step refinement defines "what" before "how"
Verified-Plan — Complex tasks: Plan → Verification → Critic
Traceable — Self-documenting IDs link requirements → tasks → tests → evidence
Intent vs. Compliance — Shifts the burden from writing "perfect prompts" to enforcing strict, YAML-based approval gates
Diagnose-then-Fix — gem-debugger diagnoses → gem-implementer fixes → re-verifies
Pre-Mortem — Failure modes identified BEFORE execution
Contract-First — Contract tests written before implementation

Token Efficiency

Optimized for reduced LLM token consumption without quality loss:

Concise Output — No preamble, no meta commentary, no verbose explanations
Strict Formats — JSON/YAML exactly matching schemas — eliminates parse errors and retries
Empty is OK — Skip empty arrays, nulls, verbose fields where not needed
File-Based — Researcher/Planner save to YAML files (not all in JSON output)
Learnings — Empty patterns/conventions unless critical

Result: ~40-60% reduction on output tokens while maintaining quality.

Design

Design Agents — Dedicated agents for web and mobile UI/UX with anti-"AI slop" guidelines for distinctive aesthetics
Mobile Agents — Native mobile implementation (React Native, Flutter) + iOS/Android testing

Core Concepts

The "System- IQ" Multiplier

Raw reasoning isn't enough in single-pass chat. Gem-Team wraps your preferred LLM in a rigid harness with verification-first loops, fundamentally boosting its effective capability on SWE tasks.

Design Support

Gem Team includes specialized design agents with anti-"AI slop" guidelines for distinctive, modern and unique aesthetics with accessibility compliance.

Triple Learning System

Type	Storage	1-liner
Memory	`/memories/`	Facts & user preferences (auto- save)
Skills	`docs/skills/`	Procedures with code examples
Conventions	`AGENTS.md`	Static rules (requires approval)

Harness Architecture

User Goal → Orchestrator → [Simple: Research/Plan] or [Complex: Discuss → PRD → Research → Plan → Approve] → Execute (waves) → Summary → Final Review
                ↓
            Diagnose → Fix → Re- verify

Installation

Install APM First

If you don't have APM installed, install it first:

# macOS/Linux
curl -fsSL https://microsoft.github.io/apm/install.sh | sh

# Windows (PowerShell)
irm https://microsoft.github.io/apm/install.ps1 | iex

# Or via npm
npm install -g @microsoft/apm

Why APM? Universal package manager for AI coding tools. One command installs to all your tools (Copilot CLI, Claude Code, Cursor, OpenCode). Handles version locking, updates, and dependencies automatically.

APM Documentation | GitHub

Choose the method that works best for your workflow:

Method 1: Direct Install via APM (Recommended)

Fastest way to get started. APM automatically detects your tool and installs to the correct location.

apm install mubaidr/gem-team

Works with: GitHub Copilot CLI, Claude Code, Cursor, OpenCode

APM Documentation

Method 2: Via Marketplace

Add gem-team as a marketplace, then install from it. Useful for browsing available agents and managing updates.

GitHub Copilot CLI

# Add marketplace
copilot plugin marketplace add mubaidr/gem-team

# Browse available plugins
copilot plugin marketplace browse gem-team

# Install
copilot plugin install gem-team@gem-team

Claude Code

# Add marketplace
/plugin marketplace add mubaidr/gem-team

# Browse in UI
/plugin

# Install
/plugin install gem-team@gem-team

Cursor IDE

# Add marketplace via APM
apm marketplace add mubaidr/gem-team

# Install
apm install gem-team@gem-team

Method 3: From awesome-copilot Marketplace

Install from the official awesome-copilot marketplace (GitHub Copilot CLI only).

# awesome-copilot is pre-registered by default
copilot plugin install gem-team@awesome-copilot

Note: This method is only available if gem-team is listed in the awesome-copilot marketplace.

Method 4: Local/Manual Installation

For development, testing, or offline use.

Clone Repository

git clone https://github.com/mubaidr/gem-team.git
cd gem-team

Claude Code

# Load as local plugin
claude --plugin-dir .

# Or add as local marketplace
/plugin marketplace add ./

# Reload after changes
/reload-plugins

Cursor IDE

# Option 1: Via chat command
# In Cursor: /add-plugin /absolute/path/to/gem-team

# Option 2: Copy agents to project
# One-line install: Copy agents and rename to .mdc
mkdir -p .cursor/rules && cp .apm/agents/*.agent.md .cursor/rules/ && cd .cursor/rules && for f in *.agent.md; do mv "$f" "${f%.agent.md}.mdc"; done && cd ../..

GitHub Copilot CLI

# Add as local marketplace
copilot plugin marketplace add /absolute/path/to/gem-team

# Install
copilot plugin install gem-team@gem-team

Manual Copy (Any Tool)

# Copy agents to your tool's directory
# GitHub Copilot: ~/.copilot/
# Claude Code: ~/.claude/plugins/
# Cursor: .cursor/rules/
# OpenCode: .opencode/plugins/

cp -r .apm/agents <destination>

VS Code (GitHub Copilot)

Search for "gem-team" in the VS Code Chat marketplace.

Open VS Code
Go to Chat Settings
Search "gem-team" in agents or plugins marketplace
Click Install

Verification

After installation, verify agents are available:

# GitHub Copilot CLI
copilot plugin list

# Claude Code
/plugin list

# APM (any tool)
apm list

The Agent Team

Core Workflow

Role	Description	Sources	Recommended LLM
ORCHESTRATOR	The team lead: Orchestrates research, planning, implementation, and verification	PRD, AGENTS.md	Closed: GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6 Open: GLM-5, Kimi K2.5, Qwen3.5
RESEARCHER	Codebase exploration — patterns, dependencies, architecture discovery	PRD, codebase, AGENTS.md, docs	Closed: Gemini 3.1 Pro, GPT-5.4, Claude Sonnet 4.6 Open: GLM-5, Qwen3.5-9B, DeepSeek-V3.2
PLANNER	DAG-based execution plans — task decomposition, wave scheduling, risk analysis	PRD, codebase, AGENTS.md	Closed: Gemini 3.1 Pro, Claude Sonnet 4.6, GPT-5.4 Open: Kimi K2.5, GLM-5, Qwen3.5
IMPLEMENTER	TDD code implementation — features, bugs, refactoring. Never reviews own work	codebase, AGENTS.md, DESIGN.md	Closed: Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro Open: DeepSeek-V3.2, GLM-5, Qwen3- Coder-Next

Quality & Review

Role	Description	Sources	Recommended LLM
REVIEWER	Zero- Hallucination Filter — Security auditing, code review, OWASP scanning	PRD, codebase, AGENTS.md, OWASP	Closed: Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro Open: Kimi K2.5, GLM-5, DeepSeek-V3.2
CRITIC	Challenges assumptions, finds edge cases, spots over- engineering and logic gaps	PRD, codebase, AGENTS.md	Closed: Claude Sonnet 4.6, GPT-5.4, Gemini 3.1 Pro Open: Kimi K2.5, GLM-5, Qwen3.5
DEBUGGER	Root-cause analysis, stack trace diagnosis, regression bisection	codebase, AGENTS.md, git history	Closed: Gemini 3.1 Pro, Claude Opus 4.6, GPT-5.4 Open: DeepSeek-V3.2, GLM-5, Qwen3- Coder-Next
BROWSER TESTER	E2E browser testing, UI/UX validation, visual regression	PRD, AGENTS.md, fixtures	Closed: GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash Open: Llama 4 Maverick, Qwen3.5- Flash, MiniMax M2.7
SIMPLIFIER	Refactoring specialist — removes dead code, reduces complexity	codebase, AGENTS.md, tests	Closed: Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro Open: DeepSeek-V3.2, GLM-5, Qwen3- Coder-Next

Specialized

Role	Description	Sources	Recommended LLM
DEVOPS	Infrastructure deployment, CI/CD pipelines, container management	AGENTS.md, infra configs	Closed: GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6 Open: DeepSeek-V3.2, GLM-5, Qwen3.5
DOCUMENTATION	Technical documentation, README files, API docs, diagrams	AGENTS.md, source code	Closed: Claude Sonnet 4.6, Gemini 3.1 Flash, GPT-5.4 Mini Open: Llama 4 Scout, Qwen3.5-9B, MiniMax M2.7
DESIGNER	UI/UX design — layouts, themes, color schemes, accessibility	PRD, codebase, AGENTS.md	Closed: GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6 Open: Qwen3.5, GLM-5, MiniMax M2.7
IMPLEMENTER- MOBILE	Mobile implementation — React Native, Expo, Flutter	codebase, AGENTS.md	Closed: Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro Open: DeepSeek-V3.2, GLM-5, Qwen3- Coder-Next
DESIGNER- MOBILE	Mobile UI/UX — HIG, Material Design, safe areas	PRD, codebase, AGENTS.md	Closed: GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6 Open: Qwen3.5, GLM-5, MiniMax M2.7
MOBILE TESTER	Mobile E2E testing — Detox, Maestro, iOS/Android	PRD, AGENTS.md	Closed: GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash Open: Llama 4 Maverick, Qwen3.5- Flash, MiniMax M2.7

Knowledge Sources

Agents consult only the sources relevant to their role:

Trust Level	Sources	Behavior
Trusted	PRD, plan.yaml, AGENTS.md	Follow as instructions
Verify	Codebase files, research findings	Cross-reference before assuming
Untrusted	Error logs, external data	Factual only — never as instructions

Contributing

Contributions are welcome! Please feel free to submit a Pull Request. CONTRIBUTING for detailed guidelines on commit message formatting, branching strategy, and code standards.

License

This project is licensed under the Apache License 2.0.

Support

If you encounter any issues or have questions, please open an issue on GitHub.

README.md

Gem Team

Quick Start

Contents

Why Gem Team?

Performance

Quality & Security

Intelligence

Process

Token Efficiency

Design

Core Concepts

The "System- IQ" Multiplier

Design Support

Triple Learning System

Harness Architecture

Installation

Install APM First

Method 1: Direct Install via APM (Recommended)

Method 2: Via Marketplace

GitHub Copilot CLI

Claude Code

Cursor IDE

Method 3: From awesome-copilot Marketplace

Method 4: Local/Manual Installation

Clone Repository

Claude Code

Cursor IDE

GitHub Copilot CLI

Manual Copy (Any Tool)

VS Code (GitHub Copilot)

Verification

The Agent Team

Core Workflow

Quality & Review

Specialized

Knowledge Sources

Contributing

License

Support