A single AI agent will miss things.
66 specialized agents will not.
AgentSpec brings Spec-Driven Development to GitHub Copilot CLI — 30 knowledge domains, 41 skills, and a 5-phase workflow that turns vague requests into production-grade data pipelines.
/plugin marketplace add arthur1511/agentspec-copilot The 5-Phase SDD Workflow
Every feature follows the same structured path — from vague idea to shipped code. No hallucinated SQL. No wrong incremental strategies. Just decisions backed by 30 knowledge domains.
Brainstorm
optionalExplore the idea space. Compare approaches, answer discovery questions, and apply YAGNI before a single requirement is written.
Define
Capture requirements in one pass. Structured problem statement, user personas, success criteria, and a clarity score that must reach 12/15 to proceed.
Design
Architecture, file manifest, inline ADRs, and copy-paste code patterns. Every design decision is documented with rationale and alternatives rejected.
Build
Execute the file manifest. Build auto-delegates each file to the matched specialist agent — spark-engineer for PySpark, dbt-specialist for SQL models, and so on.
Ship
Archive completed work with lessons learned and KB updates. Every shipped feature makes the next one faster.
66 Specialized Agents
Across 9 categories — each agent has deep domain knowledge, confidence scoring, and KB-first resolution against 30 knowledge domains.
Other
- agentspec:brainstorm-agent
- agentspec:build-agent
- agentspec:define-agent
- agentspec:design-agent
- +2 more
Architecture
- architect-data-platform-engineer
- architect-genai
- architect-kb
- architect-lakehouse
- +4 more
Cloud
- cloud-ai-data-engineer-cloud
- cloud-ai-data-engineer-gcp
- cloud-ai-prompt-specialist-gcp
- cloud-aws-data-architect
- +6 more
Data Engineering
- de-ai-data-engineer
- de-airflow-specialist
- de-dbt-specialist
- de-lakeflow-architect
- +11 more
Developer Tools
- dev-codebase-explorer
- dev-meeting-analyst
- dev-prompt-crafter
- dev-shell-script-specialist
Data Science & ML
- ds-eda-analyst
- ds-experiment-tracker
- ds-feature-engineer
- ds-ml-deployer
- +4 more
Microsoft Fabric
- fabric-ai-specialist
- fabric-architect
- fabric-cicd-specialist
- fabric-logging-specialist
- +2 more
Python & Code Quality
- python-ai-prompt-specialist
- python-code-cleaner
- python-code-documenter
- python-code-reviewer
- +2 more
Quality & Contracts
- test-data-contracts-engineer
- test-data-quality-analyst
- test-generator
30 Knowledge Domains
Every agent resolves against this KB before answering. No hallucinated patterns — just validated, up-to-date documentation on the tools you actually use.
pandas
pandas data manipulation — DataFrame wrangling, indexing, groupby, merging, missing data, performance
scikit-learn
scikit-learn ML — Estimator API, Pipelines, preprocessing, cross-validation, classification, regression, model selection
statistical-analysis
Statistical analysis — probability distributions, hypothesis testing, A/B test design, correlation, effect sizes, confidence intervals
data-visualization
Data visualization in Python — matplotlib foundations, seaborn statistical plots, plotly interactive charts, publication-quality figures
time-series
Time series analysis and forecasting in Python — stationarity, decomposition, ARIMA, Prophet, ML-based forecasting, and evaluation metrics
mlflow
MLflow experiment tracking and model registry — run logging, autologging, artifact management, model versioning, production serving
dbt
dbt development patterns — Fusion Engine, Mesh, Semantic Layer, models, macros, tests
spark
PySpark and Spark SQL patterns — DataFrames, performance, Real-Time Mode, Delta integration
airflow
Orchestration patterns — Airflow 3.x TaskFlow, Dagster, Prefect comparison, DAG design
sql-patterns
Cross-dialect SQL — window functions, CTEs, deduplication, DuckDB/Snowflake/BigQuery/Spark
streaming
Stream processing — Flink, Kafka, Spark Streaming, RisingWave, Materialize, CDC
data-modeling
Schema design — dimensional modeling, Data Vault, SCD types, schema evolution
data-quality
Data quality, contracts, and observability — Soda, GE, dbt tests, ODCS, Monte Carlo
lakehouse
Open table formats and catalogs — Iceberg v3, Delta Lake 4.1, DuckLake, Unity, Gravitino
cloud-platforms
Cloud data platforms — Snowflake Cortex, Databricks LakeFlow, BigQuery AI, cross-platform
medallion
Medallion Architecture — Bronze/Silver/Gold layer design, data quality progression, schema evolution
aws
AWS data engineering — Lambda, S3, Glue, Redshift, MWAA, SAM deployment
gcp
Google Cloud Platform — Cloud Run, Pub/Sub, GCS, BigQuery, IAM, Secret Manager
microsoft-fabric
Microsoft Fabric — Lakehouse, Data Warehouse, Pipelines, Real-Time Analytics, KQL, CI/CD, AI
lakeflow
Databricks Lakeflow — DLT pipelines, materialized views, streaming tables, expectations, DABs
ai-data-engineering
AI data engineering — RAG pipelines, vector DBs, feature stores, LLMOps, embeddings
modern-stack
Modern data tools — DuckDB, Polars, SQLMesh, Malloy, Evidence.dev, local-first analytics
prompt-engineering
Prompt engineering — chain-of-thought, structured extraction, few-shot, system prompts
genai
GenAI architecture — multi-agent systems, RAG, state machines, tool calling, guardrails
pydantic
Pydantic patterns — BaseModel, validators, LLM output validation, extraction schemas
python
Python patterns — dataclasses, type hints, generators, async, project structure
testing
Testing patterns — pytest, fixtures, mocking, parametrize, integration tests, Spark testing
terraform
Terraform IaC — resources, modules, providers, state, workspaces, GCP/AWS patterns
xgboost
XGBoost gradient boosted trees — training pipelines, hyperparameter tuning, early stopping, feature importance
supabase
Supabase development -- pgvector, RLS policies, Edge Functions, Auth, Realtime, migrations
Get started in 30 seconds
Three ways to install — pick the one that fits your workflow.
Marketplace (Recommended)
One command, always up to date
/plugin marketplace add arthur1511/agentspec-copilot /plugin install agentspec@agentspec
Clone & install locally
Install from disk for offline use
git clone https://github.com/Arthur1511/agentspec-copilot.git copilot plugin install ./agentspec-copilot/plugin-copilot
Build from source
For contributors and customization
git clone https://github.com/Arthur1511/agentspec-copilot.git cd agentspec-copilot ./build-copilot.sh # Linux / macOS .\build-copilot.ps1 # Windows (PowerShell) copilot plugin install ./plugin-copilot
After installing, every Copilot CLI session has 66 agents, 41 skills, and 30 KB domains available instantly.
Documentation
Everything you need to go from zero to shipping production data pipelines with AgentSpec.
Getting Started ↗
Install AgentSpec, run your first skill, and understand the SDD workflow in 5 minutes.
Concepts ↗
Understand agents, skills, KB domains, confidence scoring, and the SDD phase contracts.
Tutorials ↗
Step-by-step walkthroughs: build a Spark pipeline, design a star schema, run Airflow DAGs.
Reference ↗
Full agent list, skill catalogue, KB domain index, and SDD template reference.