Structured decision frameworks for the full ML lifecycle — from deciding whether to build, through architecture, evaluation, and deployment, to governance and monitoring.
Works with Claude Code · Codex · OpenCode · MIT License
| Code lane | → | Standard CI/CD · git-based |
| Model lane | → | Model registry + eval gate |
Skills activate automatically from plain language. Explicit slash commands available for every workflow stage.
From deciding whether to build, through data engineering and training, all the way to governance — skills activate automatically as you describe your problem.
Each stage has a dedicated skill with step-by-step workflows, decision tables, and artifact outputs
Every ML system decision — whether to build, how to architect, when to deploy — deserves a defensible answer backed by proven patterns, not guesswork.
Teams skip problem framing and jump straight to model training — only to discover months later that a rule-based system would have worked just as well.
The plugin routes to /mlops-opportunity-framing automatically. It scores ML fit across 5 dimensions, checks for label leakage and proxy traps, and writes a go/no-go artifact.
| Pattern complexity | ✓ High | (behavioral signals) |
| Label quality | ✓ Clear | (30-day churn window) |
| Data volume | ✓ 2M rows | (sufficient) |
| Leakage risk | ⚠ Check | (support tickets) |
The most common MLOps architectural mistake: mixing code and model deployment into a single pipeline. When a model needs rollback, the code rolls back too. The plugin catches this before you build it.
Select a workflow to see how the plugin responds — structured decision frameworks, not free-form conversation.
| Pattern complexity | ✓ High | (non-linear behavioral signals) |
| Label availability | ✓ Clear | (30-day churn window defined) |
| Data volume | ✓ 2M rows | (sufficient for training) |
| Feedback loop | ~ 30 days | (acceptable latency) |
| Actionability | ✓ High | (intervention exists) |
| Code lane | ✓ | Standard CI/CD · git-triggered |
| Model lane | ✗ | Missing — code+model in same pipeline |
| Layer 1 | Data validation | ✓ Schema + freshness checks |
| Layer 2 | Feature tests | ✓ Parity + leakage guards |
| Layer 3 | Unit tests | ✓ Preprocessing + transforms |
| Layer 4 | Behavioral tests | ⚠ Add: edge case assertions |
| Layer 5 | Integration tests | ✗ Missing end-to-end test |
| Phase 1 | Shadow mode | 7 days · compare vs current model |
| Phase 2 | Canary 10% | 3 days · monitor business KPIs |
| Phase 3 | Full rollout | On KPI threshold met |
Flagship skills write structured HTML reports to disk — stakeholder-ready, version-controllable, reopenable anytime. These are real outputs from the plugin.
No API keys. No config required. Skills activate automatically when you describe your problem.
One command in your terminal. Works with Claude Code, Codex, and OpenCode.
claude plugin install github:deepak-karkala/production-mlops-skills
Optional but recommended — initializes artifact paths and ML framework context.
/production-mlops:setup-production-mlops
Skills auto-route from plain language. Or use explicit slash commands for any workflow.
Should we use ML for this use case?
11 skills, each with step-by-step workflows, decision tables, and explicit scope boundaries.
11 skills. 4 specialist agents. Decision frameworks for the full ML lifecycle.
MIT LICENSE · CLAUDE CODE · CODEX · OPENCODE