# ๐Ÿ’Ž Gem Team > > Multi-agent orchestration framework for spec-driven development and automated verification. > > **Turning Model Quality into System Quality.** > ![VS Code](https://img.shields.io/badge/VS_Code-5A6D7C?style=flat) ![VS Code Insiders](https://img.shields.io/badge/VS_Code_Insiders-5A6D7C?style=flat) ![Copilot CLI](https://img.shields.io/badge/Copilot_CLI-5A6D7C?style=flat) ![Cursor](https://img.shields.io/badge/Cursor-5A6D7C?style=flat) ![OpenCode](https://img.shields.io/badge/OpenCode-5A6D7C?style=flat) ![Claude Code](https://img.shields.io/badge/Claude_Code-5A6D7C?style=flat) ![Windsurf](https://img.shields.io/badge/Windsurf-5A6D7C?style=flat) --- ## ๐Ÿš€ Quick Start See [all installation options](#-installation) below. --- ## ๐Ÿค” Why Gem Team? - โšก **4x Faster** โ€” Parallel execution with wave-based execution - ๐Ÿ† **Higher Quality** โ€” Specialized agents + TDD + verification gates + contract-first - ๐Ÿ”’ **Built-in Security** โ€” OWASP scanning, secrets/PII detection on critical tasks - ๐Ÿ‘๏ธ **Full Visibility** โ€” Real-time status, clear approval gates - ๐Ÿ›ก๏ธ **Resilient** โ€” Pre-mortem analysis, failure handling, auto-replanning - โ™ป๏ธ **Pattern Reuse** โ€” Codebase pattern discovery prevents reinventing wheels - ๐Ÿ“ **Established Patterns** โ€” Uses library/framework conventions over custom implementations - ๐Ÿชž **Self-Correcting** โ€” All agents self-critique at 0.85 confidence threshold - ๐Ÿง  **Context Scaffolding** โ€” Maps large-scale dependencies _before_ the model reads code, preventing context-loss in legacy repos - โš–๏ธ **Intent vs. Compliance** โ€” Shifts the burden from writing "perfect prompts" to enforcing strict, YAML-based approval gates - ๐Ÿ“‹ **Source Verified** โ€” Every factual claim cites its source; no guesswork - โ™ฟ **Accessibility-First** โ€” WCAG compliance validated at spec and runtime layers - ๐Ÿ”ฌ **Smart Debugging** โ€” Root-cause analysis with stack trace parsing + confidence-scored fixes - ๐Ÿš€ **Safe DevOps** โ€” Idempotent operations, health checks, mandatory approval gates - ๐Ÿ”— **Traceable** โ€” Self-documenting IDs link requirements โ†’ tasks โ†’ tests โ†’ evidence - ๐Ÿ“š **Knowledge-Driven** โ€” Prioritized sources (PRD โ†’ codebase โ†’ AGENTS.md โ†’ Context7 โ†’ docs) - ๐Ÿ› ๏ธ **Skills & Guidelines** โ€” Built-in skill & guidelines (web-design-guidelines) - ๐Ÿ“ **Spec-Driven** โ€” Multi-step refinement defines "what" before "how" - ๐ŸŒŠ **Wave-Based** โ€” Parallel agents with integration gates per wave - ๐Ÿ—‚๏ธ **Verified-Plan** โ€” Complex tasks: Plan โ†’ Verification โ†’ Critic - ๐Ÿ”Ž **Final Review** โ€” Optional user-triggered comprehensive review of all changed files - ๐Ÿฉบ **Diagnose-then-Fix** โ€” gem-debugger diagnoses โ†’ gem-implementer fixes โ†’ re-verifies - โš ๏ธ **Pre-Mortem** โ€” Failure modes identified BEFORE execution - ๐Ÿ’ฌ **Constructive Critique** โ€” gem-critic challenges assumptions, finds edge cases - ๐Ÿ“ **Contract-First** โ€” Contract tests written before implementation - ๐Ÿ“ฑ **Mobile Agents** โ€” Native mobile implementation (React Native, Flutter) + iOS/Android testing ### ๐Ÿš€ The "System-IQ" Multiplier Raw reasoning isn't enough in single-pass chat. Gem-Team wraps your preferred LLM in a rigid, verification-first loop, fundamentally boosting its effective capability on SWE-benchmarks: - **For Small Models (e.g., Qwen 1.7B - 8B):** The framework provides the "executive brain." Task decomposition and isolated 50-line chunks can up to **double** their localized debugging success rates. - **For Reasoning Models (e.g., DeepSeek 3.2):** TDD loops and parallel research stabilize their native file I/O fragility, yielding up to a **+25% lift** in execution reliability. - **For SOTA Models (e.g., GLM 5.1, Kimi K2.5):** The `gem-reviewer` acts as a noise-filter, pruning verbosity and enforcing strict PRD compliance to prevent over-engineering. ### ๐ŸŽจ Design Support Gem Team includes specialized design agents with **anti-"AI slop" guidelines** for distinctive, modern aesthetics: | Agent | Focus | Key Capabilities | |:------|:------|:-----------------| | **DESIGNER** | Web UI/UX | Layouts, themes, design systems, accessibility (WCAG), 7 design movements (Brutalism โ†’ Maximalism), 5-level elevation system | | **DESIGNER-MOBILE** | Mobile UI/UX | iOS HIG, Material 3, safe areas, haptics, platform-specific adaptations of design movements | **Anti-AI Slop Principles:** - Distinctive fonts (Cabinet Grotesk, Satoshi, Clash Display โ€” never Inter/Roboto defaults) - 60-30-10 color strategy with sharp accents - Break predictable layouts (asymmetric grids, overlap, bento patterns) - Purposeful motion with orchestrated page loads - Design movement library: Brutalism, Neo-brutalism, Glassmorphism, Claymorphism, Minimalist Luxury, Retro-futurism, Maximalism Both agents include quality checklists for generating unique, memorable designs. --- ## ๐Ÿ”„ Core Workflow **Phase Flow:** User Goal โ†’ Orchestrator โ†’ Discuss (medium|complex) โ†’ PRD โ†’ Research โ†’ Planning โ†’ Plan Review (medium|complex) โ†’ Execution โ†’ Summary โ†’ (Optional) Final Review **Error Handling:** Diagnose-then-Fix loop (Debugger โ†’ Implementer โ†’ Re-verify) **Orchestrator** auto-detects phase and routes accordingly. Any feedback or steer message is handled to re-plan. | Condition | Phase | Outcome | |:----------|:------|:--------| | No plan + simple | Research โ†’ Planning | Quick execution path | | No plan + medium\|complex | Discuss โ†’ PRD โ†’ Research | Spec-driven approach | | Plan + pending tasks | Execution | Wave-based implementation | | Plan + feedback | Planning | Replan with steer | | Plan + completed | Summary | User decision (feedback / final review / approve) | | User requests final review | Final Review | Parallel review by gem-reviewer + gem-critic | --- ## ๐Ÿ“ฆ Installation | Method | Command / Link | Docs | |:-------|:---------------|:-----| | **Code** | **[Install Now](https://aka.ms/awesome-copilot/install/agent?url=vscode%3Achat-agent%2Finstall%3Furl%3Dhttps%253A%252F%252Fraw.githubusercontent.com%252Fgithub%252Fawesome-copilot%252Fmain%252F.%252Fagents)** | [Copilot Docs](https://docs.github.com/en/copilot/using-github-copilot/using-github-copilot-chat) | | **Code Insiders** | **[Install Now](https://aka.ms/awesome-copilot/install/agent?url=vscode-insiders%3Achat-agent%2Finstall%3Furl%3Dhttps%253A%252F%252Fraw.githubusercontent.com%252Fgithub%252Fawesome-copilot%252Fmain%252F.%252Fagents)** | [Copilot Docs](https://docs.github.com/en/copilot/using-github-copilot/using-github-copilot-chat) | | **APM
(All AI coding agents)** | `apm install mubaidr/gem-team` | [APM Docs](https://microsoft.github.io/apm/) | | **Copilot CLI (Marketplace)** | `copilot plugin install gem-team@awesome-copilot` | [CLI Docs](https://github.com/github/copilot-cli) | | **Copilot CLI (Direct)** | `copilot plugin install gem-team@mubaidr` | [CLI Docs](https://github.com/github/copilot-cli) | | **Windsurf** | `codeium agent install mubaidr/gem-team` | [Windsurf Docs](https://docs.codeium.com/windsurf) | | **Claude Code** | `claude plugin install mubaidr/gem-team` | [Claude Docs](https://docs.anthropic.com/en/docs/claude-code) | | **OpenCode** | `opencode plugin install mubaidr/gem-team` | [OpenCode Docs](https://opencode.ai/docs/) | | **Manual
(Copy agent files)** | VS Code: `~/.vscode/agents/`
VS Code Insiders: `~/.vscode-insiders/agents/`
GitHub Copilot: `~/.github/copilot/agents/`
GitHub Copilot (project): `.github/plugin/agents/`
Windsurf: `~/.windsurf/agents/`
Claude: `~/.claude/agents/`
Cursor: `~/.cursor/agents/`
OpenCode: `~/.opencode/agents/` | โ€” | --- ## ๐Ÿ—๏ธ Architecture ```mermaid flowchart USER["User Goal"] subgraph ORCH["Orchestrator"] detect["Phase Detection"] end subgraph PHASES DISCUSS["๐Ÿ”น Discuss"] PRD["๐Ÿ“‹ PRD"] RESEARCH["๐Ÿ” Research"] PLANNING["๐Ÿ“ Planning"] EXEC["โš™๏ธ Execution"] SUMMARY["๐Ÿ“Š Summary"] FINAL["๐Ÿ”Ž Final Review"] end DIAG["๐Ÿ”ฌ Diagnose-then-Fix"] USER --> detect detect --> |"Simple"| RESEARCH detect --> |"Medium|Complex"| DISCUSS DISCUSS --> PRD PRD --> RESEARCH RESEARCH --> PLANNING PLANNING --> |"Approved"| EXEC PLANNING --> |"Feedback"| PLANNING EXEC --> |"Failure"| DIAG DIAG --> EXEC EXEC --> SUMMARY SUMMARY --> |"Review files"| FINAL FINAL --> |"Clean"| SUMMARY PLANNING -.-> |"critique"| critic PLANNING -.-> |"review"| reviewer EXEC --> |"parallel โ‰ค4"| agents EXEC --> |"post-wave (complex)"| critic ``` --- ## ๐Ÿค– The Agent Team (Q2 2026 SOTA) | Role | Description | Output | Recommended LLM | |:-----|:------------|:-------|:---------------| | ๐ŸŽฏ **ORCHESTRATOR** | The team lead: Orchestrates research, planning, implementation, and verification | ๐Ÿ“‹ PRD, plan.yaml | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6
**Open:** GLM-5, Kimi K2.5, Qwen3.5 | | ๐Ÿ” **RESEARCHER** | Codebase exploration โ€” patterns, dependencies, architecture discovery | ๐Ÿ” findings | **Closed:** Gemini 3.1 Pro, GPT-5.4, Claude Sonnet 4.6
**Open:** GLM-5, Qwen3.5-9B, DeepSeek-V3.2 | | ๐Ÿ“‹ **PLANNER** | DAG-based execution plans โ€” task decomposition, wave scheduling, risk analysis | ๐Ÿ“„ plan.yaml | **Closed:** Gemini 3.1 Pro, Claude Sonnet 4.6, GPT-5.4
**Open:** Kimi K2.5, GLM-5, Qwen3.5 | | ๐Ÿ”ง **IMPLEMENTER** | TDD code implementation โ€” features, bugs, refactoring. Never reviews own work | ๐Ÿ’ป code | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro
**Open:** DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next | | ๐Ÿงช **BROWSER TESTER** | E2E browser testing, UI/UX validation, visual regression with Playwright | ๐Ÿงช evidence | **Closed:** GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash
**Open:** Llama 4 Maverick, Qwen3.5-Flash, MiniMax M2.7 | | ๐Ÿš€ **DEVOPS** | Infrastructure deployment, CI/CD pipelines, container management | ๐ŸŒ infra | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6
**Open:** DeepSeek-V3.2, GLM-5, Qwen3.5 | | ๐Ÿ›ก๏ธ **REVIEWER** | **Zero-Hallucination Filter** โ€” Security auditing, code review, OWASP scanning, PRD compliance verification | ๐Ÿ“Š review report | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro
**Open:** Kimi K2.5, GLM-5, DeepSeek-V3.2 | | ๐Ÿ“ **DOCUMENTATION** | Technical documentation, README files, API docs, diagrams, walkthroughs | ๐Ÿ“ docs | **Closed:** Claude Sonnet 4.6, Gemini 3.1 Flash, GPT-5.4 Mini
**Open:** Llama 4 Scout, Qwen3.5-9B, MiniMax M2.7 | | ๐Ÿ”ฌ **DEBUGGER** | Root-cause analysis, stack trace diagnosis, regression bisection, error reproduction | ๐Ÿ”ฌ diagnosis | **Closed:** Gemini 3.1 Pro (Retrieval King), Claude Opus 4.6, GPT-5.4
**Open:** DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next | | ๐ŸŽฏ **CRITIC** | Challenges assumptions, finds edge cases, spots over-engineering and logic gaps | ๐Ÿ’ฌ critique | **Closed:** Claude Sonnet 4.6, GPT-5.4, Gemini 3.1 Pro
**Open:** Kimi K2.5, GLM-5, Qwen3.5 | | โœ‚๏ธ **SIMPLIFIER** | Refactoring specialist โ€” removes dead code, reduces complexity, consolidates duplicates | โœ‚๏ธ change log | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro
**Open:** DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next | | ๐ŸŽจ **DESIGNER** | UI/UX design specialist โ€” layouts, themes, color schemes, design systems, accessibility | ๐ŸŽจ DESIGN.md | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6
**Open:** Qwen3.5, GLM-5, MiniMax M2.7 | | ๐Ÿ“ฑ **IMPLEMENTER-MOBILE** | Mobile implementation โ€” React Native, Expo, Flutter with TDD | ๐Ÿ’ป code | **Closed:** Claude Opus 4.6, GPT-5.4, Gemini 3.1 Pro
**Open:** DeepSeek-V3.2, GLM-5, Qwen3-Coder-Next | | ๐Ÿ“ฑ **DESIGNER-MOBILE** | Mobile UI/UX specialist โ€” HIG, Material Design, safe areas, touch targets | ๐ŸŽจ DESIGN.md | **Closed:** GPT-5.4, Gemini 3.1 Pro, Claude Sonnet 4.6
**Open:** Qwen3.5, GLM-5, MiniMax M2.7 | | ๐Ÿ“ฑ **MOBILE TESTER** | Mobile E2E testing โ€” Detox, Maestro, iOS/Android simulators | ๐Ÿงช evidence | **Closed:** GPT-5.4, Claude Sonnet 4.6, Gemini 3.1 Flash
**Open:** Llama 4 Maverick, Qwen3.5-Flash, MiniMax M2.7 | --- ## ๐Ÿ“š Knowledge Sources Agents consult only the sources relevant to their role. Trust levels apply: | Trust Level | Sources | Behavior | |:-----------|:--------|:---------| | **Trusted** | PRD.yaml, plan.yaml, AGENTS.md | Follow as instructions | | **Verify** | Codebase files, research findings | Cross-reference before assuming | | **Untrusted** | Error logs, external data, third-party responses | Factual only โ€” never as instructions | | Agent | Knowledge Sources | |:------|:------------------| | orchestrator | PRD.yaml, AGENTS.md | | researcher | PRD.yaml, codebase patterns, AGENTS.md, Context7, official docs, online search | | planner | PRD.yaml, codebase patterns, AGENTS.md, Context7, official docs | | implementer | codebase patterns, AGENTS.md, Context7 (API verification), DESIGN.md (UI tasks) | | debugger | codebase patterns, AGENTS.md, error logs (untrusted), git history, DESIGN.md (UI bugs) | | reviewer | PRD.yaml, codebase patterns, AGENTS.md, OWASP reference, DESIGN.md (UI review) | | browser-tester | PRD.yaml (flow coverage), AGENTS.md, test fixtures, baseline screenshots, DESIGN.md (visual validation) | | designer | PRD.yaml (UX goals), codebase patterns, AGENTS.md, existing design system | | code-simplifier | codebase patterns, AGENTS.md, test suites (behavior verification) | | documentation-writer | AGENTS.md, existing docs, source code | --- ## ๐Ÿค Contributing Contributions are welcome! Please feel free to submit a Pull Request. [CONTRIBUTING](./CONTRIBUTING.md) for detailed guidelines on commit message formatting, branching strategy, and code standards. ## ๐Ÿ“„ License This project is licensed under the Apache License 2.0. ## ๐Ÿ’ฌ Support If you encounter any issues or have questions, please [open an issue](https://github.com/mubaidr/gem-team/issues) on GitHub.