Files
awesome-copilot/plugins/gem-team/README.md
T
Muhammad Ubaid Raza d5c855ece0 feat: [gem-team] Add confidence metric, optimize planner workflow (#1695)
* feat: add explicit assumption rule and confidence metric to agent documentation

- Add `confidence` field (0‑1) to the output schema in `agents/gem-browser-tester.agent.md`
- Include `confidence` in the `extra` object of `agents/gem-devops.agent.md`
- Append the guideline “State assumptions explicitly; never guess silently” to all agent docs
- Update the “Bisect (Complex Only)” heading to reflect its gate condition
- Minor wording and formatting adjustments across the affected agent documents

* chore: update readme

* chore(release): Streamline agent documentation sections (remove self‑critique steps, renumber Handle Failure/Output)
2026-05-14 10:02:32 +10:00

352 lines
14 KiB
Markdown

# Gem Team
Self-Learning Multi-agent orchestration harness for spec-driven development and automated verification.
## Quick Start
```bash
# Install via APM (recommended)
apm install mubaidr/gem-team
# Or register as a marketplace
apm marketplace add mubaidr/gem-team
apm install gem-team@gem-team
```
See [all supported installation options](#installation) below.
---
## Contents
- [Quick Start](#quick-start)
- [Why Gem Team?](#why-gem-team)
- [Harness Architecture](#harness-architecture)
- [Installation](#installation)
- [The Agent Team](#the-agent-team)
- [Knowledge Sources](#knowledge-sources)
- [Contributing](#contributing)
---
## Why Gem Team?
### Performance
- **4x Faster** — Parallel execution with wave-based execution
- **Pattern Reuse** — Codebase pattern discovery prevents reinventing wheels
### Quality & Security
- **Higher Quality** — Specialized harness agents + TDD + verification gates + contract-first
- **Built-in Security** — OWASP scanning, secrets/PII detection on critical tasks
- **Resilient** — Pre-mortem analysis, failure handling, auto-replanning
- **Accessibility-First** — WCAG compliance validated at spec and runtime layers
- **Safe DevOps** — Idempotent operations, health checks, mandatory approval gates
- **Constructive Critique** — gem- critic challenges assumptions, finds edge cases
### Intelligence
- **Established Patterns** — Uses library/harness conventions over custom implementations
- **Source Verified** — Every factual claim cites its source; no guesswork
- **Knowledge-Driven** — Prioritized sources (PRD → codebase → AGENTS.md → Context7 → docs)
- **Continuous Learning** — Memory tool persists patterns, gotchas, user preferences across sessions
- **Auto-Skills** — Agents extract reusable SKILL.md files from successful tasks (high confidence: auto, medium: confirm)
- **Skills & Guidelines** — Built-in skill & guidelines (web-design-guidelines)
### Process
- **Spec-Driven** — Multi-step refinement defines "what" before "how"
- **Verified-Plan** — Complex tasks: Plan → Verification → Critic
- **Traceable** — Self-documenting IDs link requirements → tasks → tests → evidence
- **Intent vs. Compliance** — Shifts the burden from writing "perfect prompts" to enforcing strict, YAML-based approval gates
- **Diagnose-then-Fix** — gem-debugger diagnoses → gem-implementer fixes → re-verifies
- **Pre-Mortem** — Failure modes identified BEFORE execution
- **Contract-First** — Contract tests written before implementation
### Token Efficiency
Optimized for reduced LLM token consumption without quality loss:
- **Concise Output** — No preamble, no meta commentary, no verbose explanations
- **Strict Formats** — JSON/YAML exactly matching schemas — eliminates parse errors and retries
- **Empty is OK** — Skip empty arrays, nulls, verbose fields where not needed
- **File-Based** — Researcher/Planner save to YAML files (not all in JSON output)
- **Learnings** — Empty patterns/conventions unless critical
> **Result:** ~40-60% reduction on output tokens while maintaining quality.
### Design
- **Design Agents** — Dedicated agents for web and mobile UI/UX with anti-"AI slop" guidelines for distinctive aesthetics
- **Mobile Agents** — Native mobile implementation (React Native, Flutter) + iOS/Android testing
---
## Core Concepts
### The "System- IQ" Multiplier
Raw reasoning isn't enough in single-pass chat. Gem-Team wraps your preferred LLM in a rigid harness with verification-first loops, fundamentally boosting its effective capability on SWE tasks.
### Design Support
Gem Team includes specialized design agents with anti-"AI slop" guidelines for distinctive, modern and unique aesthetics with accessibility compliance.
### Triple Learning System
| Type | Storage | 1-liner |
| :-------------- | :------------- | :------------------------------------ |
| **Memory** | `/memories/` | Facts & user preferences (auto- save) |
| **Skills** | `docs/skills/` | Procedures with code examples |
| **Conventions** | `AGENTS.md` | Static rules (requires approval) |
---
## Harness Architecture
```text
User Goal → Orchestrator → [Simple: Research/Plan] or [Complex: Discuss → PRD → Research → Plan → Approve] → Execute (waves) → Summary → Final Review
Diagnose → Fix → Re- verify
```
---
## Installation
### Install APM First
If you don't have APM installed, install it first:
```bash
# macOS/Linux
curl -fsSL https://microsoft.github.io/apm/install.sh | sh
# Windows (PowerShell)
irm https://microsoft.github.io/apm/install.ps1 | iex
# Or via npm
npm install -g @microsoft/apm
```
**Why APM?** Universal package manager for AI coding tools. One command installs to all your tools (Copilot CLI, Claude Code, Cursor, OpenCode). Handles version locking, updates, and dependencies automatically.
[APM Documentation](https://microsoft.github.io/apm/) | [GitHub](https://github.com/microsoft/apm)
---
Choose the method that works best for your workflow:
### Method 1: Direct Install via APM (Recommended)
Fastest way to get started. APM automatically detects your tool and installs to the correct location.
```bash
apm install mubaidr/gem-team
```
**Works with:** GitHub Copilot CLI, Claude Code, Cursor, OpenCode
[APM Documentation](https://microsoft.github.io/apm/getting-started/quick-start/)
---
### Method 2: Via Marketplace
Add gem-team as a marketplace, then install from it. Useful for browsing available agents and managing updates.
#### GitHub Copilot CLI
```bash
# Add marketplace
copilot plugin marketplace add mubaidr/gem-team
# Browse available plugins
copilot plugin marketplace browse gem-team
# Install
copilot plugin install gem-team@gem-team
```
#### Claude Code
```bash
# Add marketplace
/plugin marketplace add mubaidr/gem-team
# Browse in UI
/plugin
# Install
/plugin install gem-team@gem-team
```
#### Cursor IDE
```bash
# Add marketplace via APM
apm marketplace add mubaidr/gem-team
# Install
apm install gem-team@gem-team
```
---
### Method 3: From awesome-copilot Marketplace
Install from the official awesome-copilot marketplace (GitHub Copilot CLI only).
```bash
# awesome-copilot is pre-registered by default
copilot plugin install gem-team@awesome-copilot
```
**Note:** This method is only available if gem-team is listed in the awesome-copilot marketplace.
---
### Method 4: Local/Manual Installation
For development, testing, or offline use.
#### Clone Repository
```bash
git clone https://github.com/mubaidr/gem-team.git
cd gem-team
```
#### Claude Code
```bash
# Load as local plugin
claude --plugin-dir .
# Or add as local marketplace
/plugin marketplace add ./
# Reload after changes
/reload-plugins
```
#### Cursor IDE
```bash
# Option 1: Via chat command
# In Cursor: /add-plugin /absolute/path/to/gem-team
# Option 2: Copy agents to project
# One-line install: Copy agents and rename to .mdc
mkdir -p .cursor/rules && cp .apm/agents/*.agent.md .cursor/rules/ && cd .cursor/rules && for f in *.agent.md; do mv "$f" "${f%.agent.md}.mdc"; done && cd ../..
```
#### GitHub Copilot CLI
```bash
# Add as local marketplace
copilot plugin marketplace add /absolute/path/to/gem-team
# Install
copilot plugin install gem-team@gem-team
```
#### Manual Copy (Any Tool)
```bash
# Copy agents to your tool's directory
# GitHub Copilot: ~/.copilot/
# Claude Code: ~/.claude/plugins/
# Cursor: .cursor/rules/
# OpenCode: .opencode/plugins/
cp -r .apm/agents <destination>
```
---
### VS Code (GitHub Copilot)
Search for "gem-team" in the VS Code Chat marketplace.
1. Open VS Code
2. Go to Chat Settings
3. Search "gem-team" in agents or plugins marketplace
4. Click Install
---
### Verification
After installation, verify agents are available:
```bash
# GitHub Copilot CLI
copilot plugin list
# Claude Code
/plugin list
# APM (any tool)
apm list
```
## The Agent Team
### Core Workflow
| Role | Description | Sources | Recommended LLM |
| :--------------- | :------------------------------------------------------------------------------- | :----------------------------- | :-------------------------------------------------------------------------------------------------------- |
| **ORCHESTRATOR** | The team lead: Orchestrates research, planning, implementation, and verification | PRD, AGENTS.md | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6<br>**Open:** GLM-5, Kimi K2.5, Qwen3.5 |
| **RESEARCHER** | Codebase exploration — patterns, dependencies, architecture discovery | PRD, codebase, AGENTS.md, docs | **Closed:** Gemini 3.1 Pro, GPT-5.4, Claude Sonnet 4.6<br>**Open:** GLM-5, Qwen3.5-9B, DeepSeek-V3.2 |
| **PLANNER** | DAG-based execution plans — task decomposition, wave scheduling, risk analysis | PRD, codebase, AGENTS.md | **Closed:** Gemini 3.1 Pro, Claude Sonnet 4.6, GPT-5.4<br>**Open:** Kimi K2.5, GLM-5, Qwen3.5 |
| **IMPLEMENTER** | TDD code implementation — features, bugs, refactoring. Never reviews own work | codebase, AGENTS.md, DESIGN.md | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro<br>**Open:** DeepSeek-V3.2, GLM-5, Qwen3- Coder-Next |
### Quality & Review
| Role | Description | Sources | Recommended LLM |
| :----------------- | :------------------------------------------------------------------------------- | :------------------------------- | :------------------------------------------------------------------------------------------------------------------- |
| **REVIEWER** | **Zero- Hallucination Filter** — Security auditing, code review, OWASP scanning | PRD, codebase, AGENTS.md, OWASP | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro<br>**Open:** Kimi K2.5, GLM-5, DeepSeek-V3.2 |
| **CRITIC** | Challenges assumptions, finds edge cases, spots over- engineering and logic gaps | PRD, codebase, AGENTS.md | **Closed:** Claude Sonnet 4.6, GPT-5.4, Gemini 3.1 Pro<br>**Open:** Kimi K2.5, GLM-5, Qwen3.5 |
| **DEBUGGER** | Root-cause analysis, stack trace diagnosis, regression bisection | codebase, AGENTS.md, git history | **Closed:** Gemini 3.1 Pro, Claude Opus 4.6, GPT-5.4<br>**Open:** DeepSeek-V3.2, GLM-5, Qwen3- Coder-Next |
| **BROWSER TESTER** | E2E browser testing, UI/UX validation, visual regression | PRD, AGENTS.md, fixtures | **Closed:** GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash<br>**Open:** Llama 4 Maverick, Qwen3.5- Flash, MiniMax M2.7 |
| **SIMPLIFIER** | Refactoring specialist — removes dead code, reduces complexity | codebase, AGENTS.md, tests | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro<br>**Open:** DeepSeek-V3.2, GLM-5, Qwen3- Coder-Next |
### Specialized
| Role | Description | Sources | Recommended LLM |
| :---------------------- | :--------------------------------------------------------------- | :----------------------- | :------------------------------------------------------------------------------------------------------------------- |
| **DEVOPS** | Infrastructure deployment, CI/CD pipelines, container management | AGENTS.md, infra configs | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6<br>**Open:** DeepSeek-V3.2, GLM-5, Qwen3.5 |
| **DOCUMENTATION** | Technical documentation, README files, API docs, diagrams | AGENTS.md, source code | **Closed:** Claude Sonnet 4.6, Gemini 3.1 Flash, GPT-5.4 Mini<br>**Open:** Llama 4 Scout, Qwen3.5-9B, MiniMax M2.7 |
| **DESIGNER** | UI/UX design — layouts, themes, color schemes, accessibility | PRD, codebase, AGENTS.md | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6<br>**Open:** Qwen3.5, GLM-5, MiniMax M2.7 |
| **IMPLEMENTER- MOBILE** | Mobile implementation — React Native, Expo, Flutter | codebase, AGENTS.md | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro<br>**Open:** DeepSeek-V3.2, GLM-5, Qwen3- Coder-Next |
| **DESIGNER- MOBILE** | Mobile UI/UX — HIG, Material Design, safe areas | PRD, codebase, AGENTS.md | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6<br>**Open:** Qwen3.5, GLM-5, MiniMax M2.7 |
| **MOBILE TESTER** | Mobile E2E testing — Detox, Maestro, iOS/Android | PRD, AGENTS.md | **Closed:** GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash<br>**Open:** Llama 4 Maverick, Qwen3.5- Flash, MiniMax M2.7 |
---
## Knowledge Sources
Agents consult only the sources relevant to their role:
| Trust Level | Sources | Behavior |
| :------------ | :-------------------------------- | :----------------------------------- |
| **Trusted** | PRD, plan.yaml, AGENTS.md | Follow as instructions |
| **Verify** | Codebase files, research findings | Cross-reference before assuming |
| **Untrusted** | Error logs, external data | Factual only — never as instructions |
---
## Contributing
Contributions are welcome! Please feel free to submit a Pull Request. [CONTRIBUTING](./CONTRIBUTING.md) for detailed guidelines on commit message formatting, branching strategy, and code standards.
## License
This project is licensed under the Apache License 2.0.
## Support
If you encounter any issues or have questions, please [open an issue](https://github.com/mubaidr/gem-team/issues) on GitHub.