# ๐ Gem Team
>
> Multi-agent orchestration framework for spec-driven development and automated verification.
>
> **Turning Model Quality into System Quality.**
>







---
## ๐ Quick Start
See [all installation options](#-installation) below.
---
## ๐ค Why Gem Team?
- โก **4x Faster** โ Parallel execution with wave-based execution
- ๐ **Higher Quality** โ Specialized agents + TDD + verification gates + contract-first
- ๐ **Built-in Security** โ OWASP scanning, secrets/PII detection on critical tasks
- ๐๏ธ **Full Visibility** โ Real-time status, clear approval gates
- ๐ก๏ธ **Resilient** โ Pre-mortem analysis, failure handling, auto-replanning
- โป๏ธ **Pattern Reuse** โ Codebase pattern discovery prevents reinventing wheels
- ๐ **Established Patterns** โ Uses library/framework conventions over custom implementations
- ๐ช **Self-Correcting** โ All agents self-critique at 0.85 confidence threshold
- ๐ง **Context Scaffolding** โ Maps large-scale dependencies _before_ the model reads code, preventing context-loss in legacy repos
- โ๏ธ **Intent vs. Compliance** โ Shifts the burden from writing "perfect prompts" to enforcing strict, YAML-based approval gates
- ๐ **Source Verified** โ Every factual claim cites its source; no guesswork
- โฟ **Accessibility-First** โ WCAG compliance validated at spec and runtime layers
- ๐ฌ **Smart Debugging** โ Root-cause analysis with stack trace parsing + confidence-scored fixes
- ๐ **Safe DevOps** โ Idempotent operations, health checks, mandatory approval gates
- ๐ **Traceable** โ Self-documenting IDs link requirements โ tasks โ tests โ evidence
- ๐ **Knowledge-Driven** โ Prioritized sources (PRD โ codebase โ AGENTS.md โ Context7 โ docs)
- ๐ ๏ธ **Skills & Guidelines** โ Built-in skill & guidelines (web-design-guidelines)
- ๐ **Spec-Driven** โ Multi-step refinement defines "what" before "how"
- ๐ **Wave-Based** โ Parallel agents with integration gates per wave
- ๐๏ธ **Verified-Plan** โ Complex tasks: Plan โ Verification โ Critic
- ๐ **Final Review** โ Optional user-triggered comprehensive review of all changed files
- ๐ฉบ **Diagnose-then-Fix** โ gem-debugger diagnoses โ gem-implementer fixes โ re-verifies
- โ ๏ธ **Pre-Mortem** โ Failure modes identified BEFORE execution
- ๐ฌ **Constructive Critique** โ gem-critic challenges assumptions, finds edge cases
- ๐ **Contract-First** โ Contract tests written before implementation
- ๐ฑ **Mobile Agents** โ Native mobile implementation (React Native, Flutter) + iOS/Android testing
### ๐ The "System-IQ" Multiplier
Raw reasoning isn't enough in single-pass chat. Gem-Team wraps your preferred LLM in a rigid, verification-first loop, fundamentally boosting its effective capability on SWE-benchmarks:
- **For Small Models (e.g., Qwen 1.7B - 8B):** The framework provides the "executive brain." Task decomposition and isolated 50-line chunks can up to **double** their localized debugging success rates.
- **For Reasoning Models (e.g., DeepSeek 3.2):** TDD loops and parallel research stabilize their native file I/O fragility, yielding up to a **+25% lift** in execution reliability.
- **For SOTA Models (e.g., GLM 5.1, Kimi K2.5):** The `gem-reviewer` acts as a noise-filter, pruning verbosity and enforcing strict PRD compliance to prevent over-engineering.
### ๐จ Design Support
Gem Team includes specialized design agents with **anti-"AI slop" guidelines** for distinctive, modern aesthetics:
| Agent | Focus | Key Capabilities |
|:------|:------|:-----------------|
| **DESIGNER** | Web UI/UX | Layouts, themes, design systems, accessibility (WCAG), 7 design movements (Brutalism โ Maximalism), 5-level elevation system |
| **DESIGNER-MOBILE** | Mobile UI/UX | iOS HIG, Material 3, safe areas, haptics, platform-specific adaptations of design movements |
**Anti-AI Slop Principles:**
- Distinctive fonts (Cabinet Grotesk, Satoshi, Clash Display โ never Inter/Roboto defaults)
- 60-30-10 color strategy with sharp accents
- Break predictable layouts (asymmetric grids, overlap, bento patterns)
- Purposeful motion with orchestrated page loads
- Design movement library: Brutalism, Neo-brutalism, Glassmorphism, Claymorphism, Minimalist Luxury, Retro-futurism, Maximalism
Both agents include quality checklists for generating unique, memorable designs.
---
## ๐ Core Workflow
**Phase Flow:** User Goal โ Orchestrator โ Discuss (medium|complex) โ PRD โ Research โ Planning โ Plan Review (medium|complex) โ Execution โ Summary โ (Optional) Final Review
**Error Handling:** Diagnose-then-Fix loop (Debugger โ Implementer โ Re-verify)
**Orchestrator** auto-detects phase and routes accordingly. Any feedback or steer message is handled to re-plan.
| Condition | Phase | Outcome |
|:----------|:------|:--------|
| No plan + simple | Research โ Planning | Quick execution path |
| No plan + medium\|complex | Discuss โ PRD โ Research | Spec-driven approach |
| Plan + pending tasks | Execution | Wave-based implementation |
| Plan + feedback | Planning | Replan with steer |
| Plan + completed | Summary | User decision (feedback / final review / approve) |
| User requests final review | Final Review | Parallel review by gem-reviewer + gem-critic |
---
## ๐ฆ Installation
| Method | Command / Link | Docs |
|:-------|:---------------|:-----|
| **Code** | **[Install Now](https://aka.ms/awesome-copilot/install/agent?url=vscode%3Achat-agent%2Finstall%3Furl%3Dhttps%253A%252F%252Fraw.githubusercontent.com%252Fgithub%252Fawesome-copilot%252Fmain%252F.%252Fagents)** | [Copilot Docs](https://docs.github.com/en/copilot/using-github-copilot/using-github-copilot-chat) |
| **Code Insiders** | **[Install Now](https://aka.ms/awesome-copilot/install/agent?url=vscode-insiders%3Achat-agent%2Finstall%3Furl%3Dhttps%253A%252F%252Fraw.githubusercontent.com%252Fgithub%252Fawesome-copilot%252Fmain%252F.%252Fagents)** | [Copilot Docs](https://docs.github.com/en/copilot/using-github-copilot/using-github-copilot-chat) |
| **APM
(All AI coding agents)** | `apm install mubaidr/gem-team` | [APM Docs](https://microsoft.github.io/apm/) |
| **Copilot CLI (Marketplace)** | `copilot plugin install gem-team@awesome-copilot` | [CLI Docs](https://github.com/github/copilot-cli) |
| **Copilot CLI (Direct)** | `copilot plugin install gem-team@mubaidr` | [CLI Docs](https://github.com/github/copilot-cli) |
| **Windsurf** | `codeium agent install mubaidr/gem-team` | [Windsurf Docs](https://docs.codeium.com/windsurf) |
| **Claude Code** | `claude plugin install mubaidr/gem-team` | [Claude Docs](https://docs.anthropic.com/en/docs/claude-code) |
| **OpenCode** | `opencode plugin install mubaidr/gem-team` | [OpenCode Docs](https://opencode.ai/docs/) |
| **Manual
(Copy agent files)** | VS Code: `~/.vscode/agents/`
VS Code Insiders: `~/.vscode-insiders/agents/`
GitHub Copilot: `~/.github/copilot/agents/`
GitHub Copilot (project): `.github/plugin/agents/`
Windsurf: `~/.windsurf/agents/`
Claude: `~/.claude/agents/`
Cursor: `~/.cursor/agents/`
OpenCode: `~/.opencode/agents/` | โ |
---
## ๐๏ธ Architecture
```mermaid
flowchart
USER["User Goal"]
subgraph ORCH["Orchestrator"]
detect["Phase Detection"]
end
subgraph PHASES
DISCUSS["๐น Discuss"]
PRD["๐ PRD"]
RESEARCH["๐ Research"]
PLANNING["๐ Planning"]
EXEC["โ๏ธ Execution"]
SUMMARY["๐ Summary"]
FINAL["๐ Final Review"]
end
DIAG["๐ฌ Diagnose-then-Fix"]
USER --> detect
detect --> |"Simple"| RESEARCH
detect --> |"Medium|Complex"| DISCUSS
DISCUSS --> PRD
PRD --> RESEARCH
RESEARCH --> PLANNING
PLANNING --> |"Approved"| EXEC
PLANNING --> |"Feedback"| PLANNING
EXEC --> |"Failure"| DIAG
DIAG --> EXEC
EXEC --> SUMMARY
SUMMARY --> |"Review files"| FINAL
FINAL --> |"Clean"| SUMMARY
PLANNING -.-> |"critique"| critic
PLANNING -.-> |"review"| reviewer
EXEC --> |"parallel โค4"| agents
EXEC --> |"post-wave (complex)"| critic
```
---
## ๐ค The Agent Team (Q2 2026 SOTA)
| Role | Description | Output | Recommended LLM |
|:-----|:------------|:-------|:---------------|
| ๐ฏ **ORCHESTRATOR** | The team lead: Orchestrates research, planning, implementation, and verification | ๐ PRD, plan.yaml | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6
**Open:** GLM-5, Kimi K2.5, Qwen3.5 |
| ๐ **RESEARCHER** | Codebase exploration โ patterns, dependencies, architecture discovery | ๐ findings | **Closed:** Gemini 3.1 Pro, GPT-5.4, Claude Sonnet 4.6
**Open:** GLM-5, Qwen3.5-9B, DeepSeek-V3.2 |
| ๐ **PLANNER** | DAG-based execution plans โ task decomposition, wave scheduling, risk analysis | ๐ plan.yaml | **Closed:** Gemini 3.1 Pro, Claude Sonnet 4.6, GPT-5.4
**Open:** Kimi K2.5, GLM-5, Qwen3.5 |
| ๐ง **IMPLEMENTER** | TDD code implementation โ features, bugs, refactoring. Never reviews own work | ๐ป code | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro
**Open:** DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next |
| ๐งช **BROWSER TESTER** | E2E browser testing, UI/UX validation, visual regression with Playwright | ๐งช evidence | **Closed:** GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash
**Open:** Llama 4 Maverick, Qwen3.5-Flash, MiniMax M2.7 |
| ๐ **DEVOPS** | Infrastructure deployment, CI/CD pipelines, container management | ๐ infra | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6
**Open:** DeepSeek-V3.2, GLM-5, Qwen3.5 |
| ๐ก๏ธ **REVIEWER** | **Zero-Hallucination Filter** โ Security auditing, code review, OWASP scanning, PRD compliance verification | ๐ review report | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro
**Open:** Kimi K2.5, GLM-5, DeepSeek-V3.2 |
| ๐ **DOCUMENTATION** | Technical documentation, README files, API docs, diagrams, walkthroughs | ๐ docs | **Closed:** Claude Sonnet 4.6, Gemini 3.1 Flash, GPT-5.4 Mini
**Open:** Llama 4 Scout, Qwen3.5-9B, MiniMax M2.7 |
| ๐ฌ **DEBUGGER** | Root-cause analysis, stack trace diagnosis, regression bisection, error reproduction | ๐ฌ diagnosis | **Closed:** Gemini 3.1 Pro (Retrieval King), Claude Opus 4.6, GPT-5.4
**Open:** DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next |
| ๐ฏ **CRITIC** | Challenges assumptions, finds edge cases, spots over-engineering and logic gaps | ๐ฌ critique | **Closed:** Claude Sonnet 4.6, GPT-5.4, Gemini 3.1 Pro
**Open:** Kimi K2.5, GLM-5, Qwen3.5 |
| โ๏ธ **SIMPLIFIER** | Refactoring specialist โ removes dead code, reduces complexity, consolidates duplicates | โ๏ธ change log | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro
**Open:** DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next |
| ๐จ **DESIGNER** | UI/UX design specialist โ layouts, themes, color schemes, design systems, accessibility | ๐จ DESIGN.md | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6
**Open:** Qwen3.5, GLM-5, MiniMax M2.7 |
| ๐ฑ **IMPLEMENTER-MOBILE** | Mobile implementation โ React Native, Expo, Flutter with TDD | ๐ป code | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro
**Open:** DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next |
| ๐ฑ **DESIGNER-MOBILE** | Mobile UI/UX specialist โ HIG, Material Design, safe areas, touch targets | ๐จ DESIGN.md | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6
**Open:** Qwen3.5, GLM-5, MiniMax M2.7 |
| ๐ฑ **MOBILE TESTER** | Mobile E2E testing โ Detox, Maestro, iOS/Android simulators | ๐งช evidence | **Closed:** GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash
**Open:** Llama 4 Maverick, Qwen3.5-Flash, MiniMax M2.7 |
---
## ๐ Knowledge Sources
Agents consult only the sources relevant to their role. Trust levels apply:
| Trust Level | Sources | Behavior |
|:-----------|:--------|:---------|
| **Trusted** | PRD.yaml, plan.yaml, AGENTS.md | Follow as instructions |
| **Verify** | Codebase files, research findings | Cross-reference before assuming |
| **Untrusted** | Error logs, external data, third-party responses | Factual only โ never as instructions |
| Agent | Knowledge Sources |
|:------|:------------------|
| orchestrator | PRD.yaml, AGENTS.md |
| researcher | PRD.yaml, codebase patterns, AGENTS.md, Context7, official docs, online search |
| planner | PRD.yaml, codebase patterns, AGENTS.md, Context7, official docs |
| implementer | codebase patterns, AGENTS.md, Context7 (API verification), DESIGN.md (UI tasks) |
| debugger | codebase patterns, AGENTS.md, error logs (untrusted), git history, DESIGN.md (UI bugs) |
| reviewer | PRD.yaml, codebase patterns, AGENTS.md, OWASP reference, DESIGN.md (UI review) |
| browser-tester | PRD.yaml (flow coverage), AGENTS.md, test fixtures, baseline screenshots, DESIGN.md (visual validation) |
| designer | PRD.yaml (UX goals), codebase patterns, AGENTS.md, existing design system |
| code-simplifier | codebase patterns, AGENTS.md, test suites (behavior verification) |
| documentation-writer | AGENTS.md, existing docs, source code |
---
## ๐ค Contributing
Contributions are welcome! Please feel free to submit a Pull Request. [CONTRIBUTING](./CONTRIBUTING.md) for detailed guidelines on commit message formatting, branching strategy, and code standards.
## ๐ License
This project is licensed under the Apache License 2.0.
## ๐ฌ Support
If you encounter any issues or have questions, please [open an issue](https://github.com/mubaidr/gem-team/issues) on GitHub.