* feat: add explicit assumption rule and confidence metric to agent documentation - Add `confidence` field (0‑1) to the output schema in `agents/gem-browser-tester.agent.md` - Include `confidence` in the `extra` object of `agents/gem-devops.agent.md` - Append the guideline “State assumptions explicitly; never guess silently” to all agent docs - Update the “Bisect (Complex Only)” heading to reflect its gate condition - Minor wording and formatting adjustments across the affected agent documents * chore: update readme * chore(release): Streamline agent documentation sections (remove self‑critique steps, renumber Handle Failure/Output)
Gem Team
Self-Learning Multi-agent orchestration harness for spec-driven development and automated verification.
Quick Start
# Install via APM (recommended)
apm install mubaidr/gem-team
# Or register as a marketplace
apm marketplace add mubaidr/gem-team
apm install gem-team@gem-team
See all supported installation options below.
Contents
- Quick Start
- Why Gem Team?
- Harness Architecture
- Installation
- The Agent Team
- Knowledge Sources
- Contributing
Why Gem Team?
Performance
- 4x Faster — Parallel execution with wave-based execution
- Pattern Reuse — Codebase pattern discovery prevents reinventing wheels
Quality & Security
- Higher Quality — Specialized harness agents + TDD + verification gates + contract-first
- Built-in Security — OWASP scanning, secrets/PII detection on critical tasks
- Resilient — Pre-mortem analysis, failure handling, auto-replanning
- Accessibility-First — WCAG compliance validated at spec and runtime layers
- Safe DevOps — Idempotent operations, health checks, mandatory approval gates
- Constructive Critique — gem- critic challenges assumptions, finds edge cases
Intelligence
- Established Patterns — Uses library/harness conventions over custom implementations
- Source Verified — Every factual claim cites its source; no guesswork
- Knowledge-Driven — Prioritized sources (PRD → codebase → AGENTS.md → Context7 → docs)
- Continuous Learning — Memory tool persists patterns, gotchas, user preferences across sessions
- Auto-Skills — Agents extract reusable SKILL.md files from successful tasks (high confidence: auto, medium: confirm)
- Skills & Guidelines — Built-in skill & guidelines (web-design-guidelines)
Process
- Spec-Driven — Multi-step refinement defines "what" before "how"
- Verified-Plan — Complex tasks: Plan → Verification → Critic
- Traceable — Self-documenting IDs link requirements → tasks → tests → evidence
- Intent vs. Compliance — Shifts the burden from writing "perfect prompts" to enforcing strict, YAML-based approval gates
- Diagnose-then-Fix — gem-debugger diagnoses → gem-implementer fixes → re-verifies
- Pre-Mortem — Failure modes identified BEFORE execution
- Contract-First — Contract tests written before implementation
Token Efficiency
Optimized for reduced LLM token consumption without quality loss:
- Concise Output — No preamble, no meta commentary, no verbose explanations
- Strict Formats — JSON/YAML exactly matching schemas — eliminates parse errors and retries
- Empty is OK — Skip empty arrays, nulls, verbose fields where not needed
- File-Based — Researcher/Planner save to YAML files (not all in JSON output)
- Learnings — Empty patterns/conventions unless critical
Result: ~40-60% reduction on output tokens while maintaining quality.
Design
- Design Agents — Dedicated agents for web and mobile UI/UX with anti-"AI slop" guidelines for distinctive aesthetics
- Mobile Agents — Native mobile implementation (React Native, Flutter) + iOS/Android testing
Core Concepts
The "System- IQ" Multiplier
Raw reasoning isn't enough in single-pass chat. Gem-Team wraps your preferred LLM in a rigid harness with verification-first loops, fundamentally boosting its effective capability on SWE tasks.
Design Support
Gem Team includes specialized design agents with anti-"AI slop" guidelines for distinctive, modern and unique aesthetics with accessibility compliance.
Triple Learning System
| Type | Storage | 1-liner |
|---|---|---|
| Memory | /memories/ |
Facts & user preferences (auto- save) |
| Skills | docs/skills/ |
Procedures with code examples |
| Conventions | AGENTS.md |
Static rules (requires approval) |
Harness Architecture
User Goal → Orchestrator → [Simple: Research/Plan] or [Complex: Discuss → PRD → Research → Plan → Approve] → Execute (waves) → Summary → Final Review
↓
Diagnose → Fix → Re- verify
Installation
Install APM First
If you don't have APM installed, install it first:
# macOS/Linux
curl -fsSL https://microsoft.github.io/apm/install.sh | sh
# Windows (PowerShell)
irm https://microsoft.github.io/apm/install.ps1 | iex
# Or via npm
npm install -g @microsoft/apm
Why APM? Universal package manager for AI coding tools. One command installs to all your tools (Copilot CLI, Claude Code, Cursor, OpenCode). Handles version locking, updates, and dependencies automatically.
Choose the method that works best for your workflow:
Method 1: Direct Install via APM (Recommended)
Fastest way to get started. APM automatically detects your tool and installs to the correct location.
apm install mubaidr/gem-team
Works with: GitHub Copilot CLI, Claude Code, Cursor, OpenCode
Method 2: Via Marketplace
Add gem-team as a marketplace, then install from it. Useful for browsing available agents and managing updates.
GitHub Copilot CLI
# Add marketplace
copilot plugin marketplace add mubaidr/gem-team
# Browse available plugins
copilot plugin marketplace browse gem-team
# Install
copilot plugin install gem-team@gem-team
Claude Code
# Add marketplace
/plugin marketplace add mubaidr/gem-team
# Browse in UI
/plugin
# Install
/plugin install gem-team@gem-team
Cursor IDE
# Add marketplace via APM
apm marketplace add mubaidr/gem-team
# Install
apm install gem-team@gem-team
Method 3: From awesome-copilot Marketplace
Install from the official awesome-copilot marketplace (GitHub Copilot CLI only).
# awesome-copilot is pre-registered by default
copilot plugin install gem-team@awesome-copilot
Note: This method is only available if gem-team is listed in the awesome-copilot marketplace.
Method 4: Local/Manual Installation
For development, testing, or offline use.
Clone Repository
git clone https://github.com/mubaidr/gem-team.git
cd gem-team
Claude Code
# Load as local plugin
claude --plugin-dir .
# Or add as local marketplace
/plugin marketplace add ./
# Reload after changes
/reload-plugins
Cursor IDE
# Option 1: Via chat command
# In Cursor: /add-plugin /absolute/path/to/gem-team
# Option 2: Copy agents to project
# One-line install: Copy agents and rename to .mdc
mkdir -p .cursor/rules && cp .apm/agents/*.agent.md .cursor/rules/ && cd .cursor/rules && for f in *.agent.md; do mv "$f" "${f%.agent.md}.mdc"; done && cd ../..
GitHub Copilot CLI
# Add as local marketplace
copilot plugin marketplace add /absolute/path/to/gem-team
# Install
copilot plugin install gem-team@gem-team
Manual Copy (Any Tool)
# Copy agents to your tool's directory
# GitHub Copilot: ~/.copilot/
# Claude Code: ~/.claude/plugins/
# Cursor: .cursor/rules/
# OpenCode: .opencode/plugins/
cp -r .apm/agents <destination>
VS Code (GitHub Copilot)
Search for "gem-team" in the VS Code Chat marketplace.
- Open VS Code
- Go to Chat Settings
- Search "gem-team" in agents or plugins marketplace
- Click Install
Verification
After installation, verify agents are available:
# GitHub Copilot CLI
copilot plugin list
# Claude Code
/plugin list
# APM (any tool)
apm list
The Agent Team
Core Workflow
| Role | Description | Sources | Recommended LLM |
|---|---|---|---|
| ORCHESTRATOR | The team lead: Orchestrates research, planning, implementation, and verification | PRD, AGENTS.md | Closed: GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6 Open: GLM-5, Kimi K2.5, Qwen3.5 |
| RESEARCHER | Codebase exploration — patterns, dependencies, architecture discovery | PRD, codebase, AGENTS.md, docs | Closed: Gemini 3.1 Pro, GPT-5.4, Claude Sonnet 4.6 Open: GLM-5, Qwen3.5-9B, DeepSeek-V3.2 |
| PLANNER | DAG-based execution plans — task decomposition, wave scheduling, risk analysis | PRD, codebase, AGENTS.md | Closed: Gemini 3.1 Pro, Claude Sonnet 4.6, GPT-5.4 Open: Kimi K2.5, GLM-5, Qwen3.5 |
| IMPLEMENTER | TDD code implementation — features, bugs, refactoring. Never reviews own work | codebase, AGENTS.md, DESIGN.md | Closed: Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro Open: DeepSeek-V3.2, GLM-5, Qwen3- Coder-Next |
Quality & Review
| Role | Description | Sources | Recommended LLM |
|---|---|---|---|
| REVIEWER | Zero- Hallucination Filter — Security auditing, code review, OWASP scanning | PRD, codebase, AGENTS.md, OWASP | Closed: Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro Open: Kimi K2.5, GLM-5, DeepSeek-V3.2 |
| CRITIC | Challenges assumptions, finds edge cases, spots over- engineering and logic gaps | PRD, codebase, AGENTS.md | Closed: Claude Sonnet 4.6, GPT-5.4, Gemini 3.1 Pro Open: Kimi K2.5, GLM-5, Qwen3.5 |
| DEBUGGER | Root-cause analysis, stack trace diagnosis, regression bisection | codebase, AGENTS.md, git history | Closed: Gemini 3.1 Pro, Claude Opus 4.6, GPT-5.4 Open: DeepSeek-V3.2, GLM-5, Qwen3- Coder-Next |
| BROWSER TESTER | E2E browser testing, UI/UX validation, visual regression | PRD, AGENTS.md, fixtures | Closed: GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash Open: Llama 4 Maverick, Qwen3.5- Flash, MiniMax M2.7 |
| SIMPLIFIER | Refactoring specialist — removes dead code, reduces complexity | codebase, AGENTS.md, tests | Closed: Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro Open: DeepSeek-V3.2, GLM-5, Qwen3- Coder-Next |
Specialized
| Role | Description | Sources | Recommended LLM |
|---|---|---|---|
| DEVOPS | Infrastructure deployment, CI/CD pipelines, container management | AGENTS.md, infra configs | Closed: GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6 Open: DeepSeek-V3.2, GLM-5, Qwen3.5 |
| DOCUMENTATION | Technical documentation, README files, API docs, diagrams | AGENTS.md, source code | Closed: Claude Sonnet 4.6, Gemini 3.1 Flash, GPT-5.4 Mini Open: Llama 4 Scout, Qwen3.5-9B, MiniMax M2.7 |
| DESIGNER | UI/UX design — layouts, themes, color schemes, accessibility | PRD, codebase, AGENTS.md | Closed: GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6 Open: Qwen3.5, GLM-5, MiniMax M2.7 |
| IMPLEMENTER- MOBILE | Mobile implementation — React Native, Expo, Flutter | codebase, AGENTS.md | Closed: Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro Open: DeepSeek-V3.2, GLM-5, Qwen3- Coder-Next |
| DESIGNER- MOBILE | Mobile UI/UX — HIG, Material Design, safe areas | PRD, codebase, AGENTS.md | Closed: GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6 Open: Qwen3.5, GLM-5, MiniMax M2.7 |
| MOBILE TESTER | Mobile E2E testing — Detox, Maestro, iOS/Android | PRD, AGENTS.md | Closed: GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash Open: Llama 4 Maverick, Qwen3.5- Flash, MiniMax M2.7 |
Knowledge Sources
Agents consult only the sources relevant to their role:
| Trust Level | Sources | Behavior |
|---|---|---|
| Trusted | PRD, plan.yaml, AGENTS.md | Follow as instructions |
| Verify | Codebase files, research findings | Cross-reference before assuming |
| Untrusted | Error logs, external data | Factual only — never as instructions |
Contributing
Contributions are welcome! Please feel free to submit a Pull Request. CONTRIBUTING for detailed guidelines on commit message formatting, branching strategy, and code standards.
License
This project is licensed under the Apache License 2.0.
Support
If you encounter any issues or have questions, please open an issue on GitHub.