What is Mrs. Kitty AI Controller?

It is a safety-first controller layer that wraps Claude Code with 59 enforcement gates, persistent memory, browser testing, and multi-agent orchestration. It does not replace Claude Code — it governs how Claude Code operates on your codebase.

How does the safety gate system work?

Every tool call Claude Code makes passes through a pipeline of 59 Python-based enforcement gates before execution. Each gate evaluates the proposed action against specific criteria. If any gate fails or crashes, all edits stop (fail-closed design). The full pipeline is implemented in 4,877 lines of Python.

Is Mrs. Kitty AI Controller free?

Yes, it is completely free during the beta period with no credit card, no account creation, and no usage tracking. The only requirement is a Claude Code subscription from Anthropic ($20/month for Pro or $200/month for Max).

What platforms does it support?

Windows 10+ with WSL2 (version 2.0.0 installer) and native Linux (coming soon). Docker Desktop is optional and only needed for cross-terminal coordination.

How is this different from using Claude Code without it?

Without Mrs. Kitty, the AI edits code the moment it has a guess. With Mrs. Kitty, 59 gates enforce a strict sequence: research the problem, reach 100% confidence, present a plan, get user approval, build with TDD, pass all tests, and verify the result. Failed approaches are remembered across sessions.

Mrs. Kitty AI Controller — AI Coding Safety Gates

You already know these problems.

It forgets everything.

Close the tab, start over. Every single time. An hour of context, gone. You are its memory, and you’re exhausted.

It destroys your work.

No warning. No backup. No undo. You step away for ten minutes. Your working code is gone. Replaced with something broken.

It lies about being done.

“All done!” You check. Nothing works. Half the features are missing. The other half are broken. You just became its unpaid QA.

It goes off the rails.

You let an AI agent run unsupervised. Come back later. $300 burned building the wrong thing entirely. No guardrails. Just a mess and a bill.

41% more bugs. Zero time saved.

GitClear, 2024 — 211 million lines of code analyzed

That’s not a tool. That’s a liability.

Same AI. Different result.

Without Controller

$ build — Something broke. Again. Rewriting everything from scratch... 23 files changed with no backup Error after error after error... Context lost. Starting over... FAILED: 12 things broken “Done!” (it was not done)

With Mrs. Kitty

✓ Project analyzed & understood ✓ Requirements clarified (100% confidence) ✓ Plan approved — you signed off ✓ All files versioned & backed up ✓ Code implemented cleanly ✓ 47/47 tests passing Done. Actually done.

What’s included — and what it’s worth.

Gate system (59 gates) ▸ 59-gate enforcement pipeline. Every edit passes through all of them. Tap to see the full pipeline $85/mo

Gates

Hook Files

Lifecycle Stages

4,877

Lines

100%

Confidence

<10ms

Per Gate

See full breakdown →

65 denied destructive commands — filesystem deletion, disk formatting, registry edits, dangerous git operations, process killing — all blocked before they execute
Deploy safety system (6 gates) — protects server deployments with auto-backup before every write. No blind overwrites, no stale uploads, no restores without diffing first
Test integrity gate — can’t modify failing tests to make them pass. Forces fixing the source code instead of gaming the test suite
Hedging scanner — blocks uncertainty language like “I’m not sure” or “this might work.” If the AI isn’t certain, it researches more instead of guessing
Plan file protection — append-only enforcement with auto-backup before any modification. Plans can’t be silently rewritten or truncated
7 approval types — maintenance mode, task termination, confidence override, CLAUDE.md edits, simplification proposals, gate blocks, and repair mode — each with its own confirmation flow

If a gate crashes, all edits stop. If the AI tries to bypass a gate (30+ patterns detected), it's blocked. Every edit, every time.

Self-Improvement Memory ▸ Learns from every session — gets smarter the longer you use it $20/mo

Auto-captures 8 types of knowledge: solutions, failures, decisions, patterns, preferences
TF-IDF scoring with time decay — recent lessons rank higher
Failed approaches tracked — never repeats the same mistake twice
Compounds knowledge across sessions like a real engineer

Other tools start from zero every session. Mrs. Kitty builds institutional knowledge.

See how memory works →

Agentic Browser Automation ▸ 140+ commands in your real Chrome — completely undetectable, replaces Playwright $99/mo

Your real Chrome, completely undetectable — no navigator.webdriver flag
~200 tokens per page vs 13,700 for Playwright MCP — 98.5% token reduction
140+ commands: click, fill, navigate, upload, state management, network control
Parallel sessions — 100s of browser instances working together agentically

Find 5-star Amazon sellers. Message 20 Alibaba suppliers. Whatever you do in Chrome, it handles.

See browser automation →

Voice Dictation ▸ 99-language Whisper model, CUDA-accelerated on NVIDIA GPUs, near-instant $15/mo

You speak 3x faster than you type — 150 WPM vs 40 typed. Up to 30 minutes of continuous speech
Whisper large-v3-turbo with CUDA GPU acceleration (float16) for near-instant results
Press Alt+X to start, speak naturally, press Alt+X to stop (Windows, Linux, macOS)
Screenshot integration: Alt+V pastes a screenshot path — Claude sees what you see

2am debugging session. You whisper the fix. It transcribes faster than you can think.

137 Skills & Agents ▸ 9 plugins + 14 superpowers skills + 114 custom skills — the largest skill library for Claude Code $69/mo

9 community-voted plugins (improved and pre-tuned), 14 superpowers skills, and 114 custom skills built for every workflow — from code review to deployment to browser automation. All ship ready to go.

Superpowers (by Jesse Vincent) — Upgraded: instant response (eliminated the 1-second delay on every new message), embedded top 5 skill cores directly to remove extra API round-trips
Commit Commands (Anthropic) — pre-configured with project commit conventions, zero setup needed
PR Review Toolkit (Anthropic) — 6 specialized review agents in parallel: comments, tests, silent failures, type design, code quality, simplification
Feature Dev (Anthropic) — architecture-first workflow: explore → design → implement → review with codebase-aware agents
Code Review (Anthropic) — automated review against your CLAUDE.md project guidelines on every change
Code Simplifier (Anthropic) — auto-simplifies after each coding task while preserving all functionality
Frontend Design (Anthropic) — Upgraded: Hormozi conversion-first architecture fused with 2026 design — glassmorphism, multi-layer backgrounds, cursor-following effects. 842-line research-backed skill with Core Web Vitals optimization and 14-point checklist
Greptile (by Daksh Gupta & team, YC-backed) — codebase-aware AI search and PR review with full repository context. 500M+ lines reviewed monthly
Context7 (by Upstash) — injects up-to-date, version-specific library docs directly into prompts. Eliminates API hallucinations
Agent-Browser (CalebDane7) — agentic browser automation replacing Playwright. 140+ commands, your real Chrome, completely undetectable. Includes Claude Code skill for autonomous UI research
Humanizer (v2.2.0) — removes 24 categories of AI writing patterns from text. Two-pass process makes AI-generated content sound naturally human-written
Poe Media — image and video generation via Poe API. 9+ image models (FLUX, Ideogram, Recraft), 4 video models, upscaling, img2img, and chaining
Claude API — intelligent assistance for building apps with the Claude API and Anthropic SDK. Auto-activates when code imports anthropic packages

137 skills covering every workflow. Each one upgraded and pre-wired — what would take hours of configuration works out of the box.

RTK Token Killer ▸ Rust CLI proxy — 82% average token savings across 30+ commands $29/mo

Intercepts command output before it reaches Claude — compresses intelligently
12 filtering strategies: stats extraction, error-only, JSON schema extraction
Verified: 3.2M tokens saved across 664 commands (81.9% average reduction)
API responses: up to 98.8% savings via JSON schema extraction
~10ms overhead — effectively invisible

A 30-minute session uses ~150K tokens without RTK, ~45K with it. That’s real money saved.

Cross-Terminal Sync ▸ Multiple AI sessions working simultaneously, zero conflicts $25/mo

Real-time session awareness across terminals via heartbeat
File-level locking prevents concurrent edits to the same file
Terminal 2 sees what Terminal 1 is working on — no collisions

Run 3 Claude sessions on 3 different features. They coordinate automatically.

Total value $342/mo

Your Price

Mrs. Kitty works with Claude Code, which already has a subscription. We're building reputation first, not recurring revenue. Free now, enterprise tier later.

8 minutes. One installer. Done.

You describe. It researches.

Tell it what you want. It reads your codebase, maps dependencies, checks patterns, and reaches 100% confidence before writing a single line of code.

Two AIs review the plan. You approve.

The first AI writes the plan. A second AI tears it apart, looking for unverified claims and missing edge cases. Only plans that survive both reach you for sign-off.

It builds, tests, and remembers.

59 gates enforce quality on every edit. If something breaks, it fingerprints the failure, tries a different approach, and remembers the fix permanently. No manual QA.

Found a problem? It goes back and fixes it.

How you actually use it.

You don't need to know how to code. You need to know where you're going.

Talk to it.

Press Alt+X and speak, or type. Tell it exactly what you want. Be specific about what's wrong or what you need built.

One thing at a time.

Don't dump ten tasks on it. Give it one clear goal. Let it finish. Then give it the next one.

It does the research.

It reads your codebase, finds dependencies, checks documentation. You don't have to explain your project from scratch every time.

It asks you questions.

If something is unclear, it asks before guessing. No more waking up to find it built the wrong thing.

Review the plan.

It shows you a plan before touching anything. Read it. Ask it to explain anything you don't understand. You sign off before it writes a single line.

You steer. It rows.

You're the captain. You don't need to know how the engine works. You need to know where the ship is going. The AI handles execution.

Ask it to prove it works.

"How do we test this? What edge cases could break? How do we fix those without breaking what we already have?" It runs the tests itself and shows you the results.

It finds the best way.

It researches who has solved this problem best, copies the most proven approaches, and combines them. You get battle-tested solutions, not first drafts.

8 innovations no other tool has.

Not one. We checked.

🎯

100% confidence threshold

It won't touch your code until it scores 100/100 on a 5-factor check: plan quality, file understanding, dependency mapping, research depth, and context. Other tools edit on a hunch.

How the scoring works →

🔄

Two AIs review every plan

The first AI writes the plan. A second AI tears it apart, looking for unverified claims and missing edge cases. Only plans that survive both reach you.

How adversarial review works →

📡

Automatic scope detection

Tests that used to pass now fail? Same fix attempted three times? New errors appearing? The system catches it and forces a replan before things get worse.

How scope detection works →

🧬

Never retries the same broken fix

Every fix attempt gets fingerprinted. If a new fix is the same idea as one that already failed, it's blocked. Forces genuinely different approaches instead of looping.

How dedup works →

🌐

Browser automation in your real Chrome

140+ commands. Completely undetectable. Reads a page in ~200 tokens instead of 13,700. Runs parallel sessions across hundreds of tabs. Open source.

⚡

~10ms overhead per gate

59 hooks, each under 10ms. Parallel agents across terminals. Rust-powered token compression saves 82% on average. You get results, not loading screens.

How the pipeline works →

🛡

Deploy safety system (6 gates)

Auto-backup before every server write. No blind overwrites, no stale file uploads, no restores without diffing first. Six dedicated gates protect production deployments from the most common AI mistakes.

🚫

65 blocked destructive commands

Filesystem deletion, disk formatting, registry edits, dangerous git operations, process killing — all denied before they execute. The AI literally cannot run rm -rf on your machine.

Oh, and it takes voice commands.

You speak 3x faster than you type. 150 words per minute vs 40 typed. Up to 30 minutes of continuous speech, transcribed near-instantly on your GPU
99-language Whisper model. Press Alt+X to start, speak naturally, press Alt+X to stop. Works on Windows, Linux, and macOS
2am debugging session. You whisper the fix. It transcribes faster than you can think
Alt+V pastes a screenshot. Claude sees exactly what you see

Zero risk. For real.

⏱

8 minutes to install. If it takes longer, uninstall in 2 clicks.

🖥

Windows: Everything lives in WSL — your system stays untouched. Linux: Installs to ~/.local — your system packages stay untouched.

🔓

No lock-in. Standard Claude Code underneath. Remove Mrs. Kitty anytime.

👤

No account. No email. No tracking. Just download and run.

EU AI Act enforcement begins August 2, 2026.

Fines up to €35M or 7% of global turnover.

Every company using AI in development will need governance tooling.

Mrs. Kitty is ready. Is your AI?

See documentation →

Built for Claude Code — the highest-benchmarked AI model. But Mrs. Kitty runs entirely on your machine. The gate system, memory engine, and all 59 enforcement layers are local infrastructure — adaptable to any AI model, including open-source.

Stop fixing your AI’s mistakes.