Free for Early Testers

Stop fixing your AI’s mistakes.

Mrs. Kitty does the research, makes the plan, writes the code, tests it, and fixes its own bugs. Parallel agents protect your context and cut token waste by 82%. You just describe what you want.

59 safety gates enforce every step. It can’t skip research, can’t skip planning, can’t ship untested code. Built and tested independently over 8 months.

v3.1.0 · Windows or Linux · No email, no tracking
claude code + mrs. kitty
$ claude "Add dark mode toggle to settings" ⚡ Researching codebase... Found: settings.tsx, theme-context.ts, globals.css 📋 Planning — 3 files, 1 new component ✓ GATE: research-verified ✓ GATE: plan-approved ✏️ Writing code... settings.tsx — toggle component theme-context.ts — state management globals.css — CSS variables 🔬 4/4 tests passing ✓ GATE: tests-verified ✓ GATE: no-regressions ✔ Done. Dark mode toggle shipped.

You already know these problems.

It forgets everything.

Close the tab, start over. Every single time. An hour of context, gone. You are its memory, and you’re exhausted.

It destroys your work.

No warning. No backup. No undo. You step away for ten minutes. Your working code is gone. Replaced with something broken.

It lies about being done.

“All done!” You check. Nothing works. Half the features are missing. The other half are broken. You just became its unpaid QA.

It goes off the rails.

You let an AI agent run unsupervised. Come back later. $300 burned building the wrong thing entirely. No guardrails. Just a mess and a bill.

41% more bugs. Zero time saved.

GitClear, 2024 — 211 million lines of code analyzed

That’s not a tool. That’s a liability.

Same AI. Different result.

Without Controller
$ build — Something broke. Again. Rewriting everything from scratch... 23 files changed with no backup Error after error after error... Context lost. Starting over... FAILED: 12 things broken “Done!” (it was not done)
With Mrs. Kitty
✓ Project analyzed & understood ✓ Requirements clarified (100% confidence) ✓ Plan approved — you signed off ✓ All files versioned & backed up ✓ Code implemented cleanly ✓ 47/47 tests passing Done. Actually done.

What’s included — and what it’s worth.

Gate system (59 gates) 59-gate enforcement pipeline. Every edit passes through all of them. Tap to see the full pipeline $85/mo
KITTYAI — FULL LIFECYCLE CIRCUIT
every hook, every gate, every stage — fail-closed
CLAUDE SESSION LAUNCHED STAGE 1: SESSION START — 5 hooks (parallel) PARALLEL session-start.sh Docker check, state cleanup session_start_continuity.py Handoff loading session-register.mjs Cross-session coordination memory-awareness.mjs Memory loading patch_superpowers.sh Plugin patches SESSION INITIALIZED STAGE 2: USER PROMPT — 4 hooks (parallel) PARALLEL Memory Awareness TF-IDF search, 3-tier inject HOT / WARM / COLD Correction Detector Frustration patterns Preference learning Automation Enforcer Complexity: TRIVIAL/MED/HIGH Phase orchestrate Media Detect Image/video context CONTEXT ENRICHED — AI RESPONDS STAGE 3: PRETOOLUSE — 51 gates (37 main + 5 parallel + 9 plan review) SUB-LANE A: Parallel Hook Files (5 hooks fire simultaneously) PARALLEL repo_boundary_gate.py Cross-repo protection read_gate.py Plan mode for research search_strategy_gate.py Parallel agent enforcement offload_enforcer.py Deploy safety — 6 sub-gates browser_swarm_gate.py 3+ open blocking BLOCK: 3+ open SUB-LANE B: Main Pipeline — pretooluse.py (37 gates, sequential) PRE-NUMBERED Maint Mode bypass if active File Claims Screenshot View Learning Gate Task Tool Bash Bypass Maintenance Perm Kill Perm ChromeCDP NUMBERED GATES Plan Mode Gates 0.3 - 0.5 BLOCK: not in plan Gate 0.6 Pre-validation Gate 0.7 Scope check Gate 01 Entry validation Gate 1.3 Tool classify Gate 1.5 Safety filter Gate 1.7 Permission check Thinking Partner Gate 02 — CRITICAL BLOCK: <100% conf Gate 2a Context verify Gate 2b Dependency check Gate 2c Impact analysis Hedging Gate 2.5 BLOCK: weasel words Gate 2.5b Quality filter Gate 2.6 Final pre-check Gate 03 Pre-execution Gate 04 Resource check Gate 05 Boundary enforce User Approval Gate 06 BLOCK: no consent Gate 07 Commit safety Gate 08 Destructive guard Simplification Gate 09 BLOCK: over-complex Test-Before-Deploy Gate 10 — final gate SUB-LANE C: ExitPlanMode (9 sub-checks) Fires when exiting plan mode Research Saturation Thinking Questions Dependency Check Content Evidence Research Evidence Verification 100% Conf BLOCK: <100% Subprocess Review 2nd AI reviewer Plan Content Final plan validation ALL 51 PRETOOLUSE GATES PASSED TOOL EXECUTES STAGE 4: POSTTOOLUSE — 3 hooks posttooluse.py Journal, testing gate, scope detect, plan progress auto_learning.py Error-resolution pairs, test results post_tool_use_tracker.py Edit tracking for testing gate TOOL RESULT PROCESSED STAGE 5: TDD LOOP — fail -> retry -> replan Tests Pass? YES NO — RETRY RETRY LOOP RETRY 2 3x FAIL — REPLAN REPLAN TESTS GREEN — CONTINUE STAGE 6: STOP — 13 gates Maintenance Check Automation Mandate 22 anti-patterns BLOCK: manual defer Escape Hatch 2x Ctrl+C override Handoff Writer State Completion State Cleanup 15+ directories Plan Enforcement Session Cleanup Testing Gate BLOCK: dirty edits Visual Verify Review Loop Deploy Verify Testing Debt ALL GATES PASS SHIPPED ANY GATE FAILS BLOCKED (fail-closed)
59
Gates
22
Hook Files
7
Lifecycle Stages
4,877
Lines
100%
Confidence
<10ms
Per Gate
See full breakdown →
  • 65 denied destructive commands — filesystem deletion, disk formatting, registry edits, dangerous git operations, process killing — all blocked before they execute
  • Deploy safety system (6 gates) — protects server deployments with auto-backup before every write. No blind overwrites, no stale uploads, no restores without diffing first
  • Test integrity gate — can’t modify failing tests to make them pass. Forces fixing the source code instead of gaming the test suite
  • Hedging scanner — blocks uncertainty language like “I’m not sure” or “this might work.” If the AI isn’t certain, it researches more instead of guessing
  • Plan file protection — append-only enforcement with auto-backup before any modification. Plans can’t be silently rewritten or truncated
  • 7 approval types — maintenance mode, task termination, confidence override, CLAUDE.md edits, simplification proposals, gate blocks, and repair mode — each with its own confirmation flow

If a gate crashes, all edits stop. If the AI tries to bypass a gate (30+ patterns detected), it's blocked. Every edit, every time.

Self-Improvement Memory Learns from every session — gets smarter the longer you use it $20/mo
  • Auto-captures 8 types of knowledge: solutions, failures, decisions, patterns, preferences
  • TF-IDF scoring with time decay — recent lessons rank higher
  • Failed approaches tracked — never repeats the same mistake twice
  • Compounds knowledge across sessions like a real engineer

Other tools start from zero every session. Mrs. Kitty builds institutional knowledge.

See how memory works →
Agentic Browser Automation 140+ commands in your real Chrome — completely undetectable, replaces Playwright $99/mo
  • Your real Chrome, completely undetectable — no navigator.webdriver flag
  • ~200 tokens per page vs 13,700 for Playwright MCP — 98.5% token reduction
  • 140+ commands: click, fill, navigate, upload, state management, network control
  • Parallel sessions — 100s of browser instances working together agentically

Find 5-star Amazon sellers. Message 20 Alibaba suppliers. Whatever you do in Chrome, it handles.

See browser automation →
Voice Dictation 99-language Whisper model, CUDA-accelerated on NVIDIA GPUs, near-instant $15/mo
  • You speak 3x faster than you type — 150 WPM vs 40 typed. Up to 30 minutes of continuous speech
  • Whisper large-v3-turbo with CUDA GPU acceleration (float16) for near-instant results
  • Press Alt+X to start, speak naturally, press Alt+X to stop (Windows, Linux, macOS)
  • Screenshot integration: Alt+V pastes a screenshot path — Claude sees what you see

2am debugging session. You whisper the fix. It transcribes faster than you can think.

137 Skills & Agents 9 plugins + 14 superpowers skills + 114 custom skills — the largest skill library for Claude Code $69/mo

9 community-voted plugins (improved and pre-tuned), 14 superpowers skills, and 114 custom skills built for every workflow — from code review to deployment to browser automation. All ship ready to go.

  • Superpowers (by Jesse Vincent) — Upgraded: instant response (eliminated the 1-second delay on every new message), embedded top 5 skill cores directly to remove extra API round-trips
  • Commit Commands (Anthropic) — pre-configured with project commit conventions, zero setup needed
  • PR Review Toolkit (Anthropic) — 6 specialized review agents in parallel: comments, tests, silent failures, type design, code quality, simplification
  • Feature Dev (Anthropic) — architecture-first workflow: explore → design → implement → review with codebase-aware agents
  • Code Review (Anthropic) — automated review against your CLAUDE.md project guidelines on every change
  • Code Simplifier (Anthropic) — auto-simplifies after each coding task while preserving all functionality
  • Frontend Design (Anthropic) — Upgraded: Hormozi conversion-first architecture fused with 2026 design — glassmorphism, multi-layer backgrounds, cursor-following effects. 842-line research-backed skill with Core Web Vitals optimization and 14-point checklist
  • Greptile (by Daksh Gupta & team, YC-backed) — codebase-aware AI search and PR review with full repository context. 500M+ lines reviewed monthly
  • Context7 (by Upstash) — injects up-to-date, version-specific library docs directly into prompts. Eliminates API hallucinations
  • Agent-Browser (CalebDane7) — agentic browser automation replacing Playwright. 140+ commands, your real Chrome, completely undetectable. Includes Claude Code skill for autonomous UI research
  • Humanizer (v2.2.0) — removes 24 categories of AI writing patterns from text. Two-pass process makes AI-generated content sound naturally human-written
  • Poe Media — image and video generation via Poe API. 9+ image models (FLUX, Ideogram, Recraft), 4 video models, upscaling, img2img, and chaining
  • Claude API — intelligent assistance for building apps with the Claude API and Anthropic SDK. Auto-activates when code imports anthropic packages

137 skills covering every workflow. Each one upgraded and pre-wired — what would take hours of configuration works out of the box.

RTK Token Killer Rust CLI proxy — 82% average token savings across 30+ commands $29/mo
  • Intercepts command output before it reaches Claude — compresses intelligently
  • 12 filtering strategies: stats extraction, error-only, JSON schema extraction
  • Verified: 3.2M tokens saved across 664 commands (81.9% average reduction)
  • API responses: up to 98.8% savings via JSON schema extraction
  • ~10ms overhead — effectively invisible

A 30-minute session uses ~150K tokens without RTK, ~45K with it. That’s real money saved.

Cross-Terminal Sync Multiple AI sessions working simultaneously, zero conflicts $25/mo
  • Real-time session awareness across terminals via heartbeat
  • File-level locking prevents concurrent edits to the same file
  • Terminal 2 sees what Terminal 1 is working on — no collisions

Run 3 Claude sessions on 3 different features. They coordinate automatically.

Total value $342/mo

Your Price

$0

Mrs. Kitty works with Claude Code, which already has a subscription. We're building reputation first, not recurring revenue. Free now, enterprise tier later.

8 minutes. One installer. Done.

1

You describe. It researches.

Tell it what you want. It reads your codebase, maps dependencies, checks patterns, and reaches 100% confidence before writing a single line of code.

2

Two AIs review the plan. You approve.

The first AI writes the plan. A second AI tears it apart, looking for unverified claims and missing edge cases. Only plans that survive both reach you for sign-off.

3

It builds, tests, and remembers.

59 gates enforce quality on every edit. If something breaks, it fingerprints the failure, tries a different approach, and remembers the fix permanently. No manual QA.

Found a problem? It goes back and fixes it.

How you actually use it.

You don't need to know how to code. You need to know where you're going.

1

Talk to it.

Press Alt+X and speak, or type. Tell it exactly what you want. Be specific about what's wrong or what you need built.

2

One thing at a time.

Don't dump ten tasks on it. Give it one clear goal. Let it finish. Then give it the next one.

3

It does the research.

It reads your codebase, finds dependencies, checks documentation. You don't have to explain your project from scratch every time.

4

It asks you questions.

If something is unclear, it asks before guessing. No more waking up to find it built the wrong thing.

5

Review the plan.

It shows you a plan before touching anything. Read it. Ask it to explain anything you don't understand. You sign off before it writes a single line.

6

You steer. It rows.

You're the captain. You don't need to know how the engine works. You need to know where the ship is going. The AI handles execution.

7

Ask it to prove it works.

"How do we test this? What edge cases could break? How do we fix those without breaking what we already have?" It runs the tests itself and shows you the results.

8

It finds the best way.

It researches who has solved this problem best, copies the most proven approaches, and combines them. You get battle-tested solutions, not first drafts.

8 innovations no other tool has.

Not one. We checked.

🎯

100% confidence threshold

It won't touch your code until it scores 100/100 on a 5-factor check: plan quality, file understanding, dependency mapping, research depth, and context. Other tools edit on a hunch.

How the scoring works →
🔄

Two AIs review every plan

The first AI writes the plan. A second AI tears it apart, looking for unverified claims and missing edge cases. Only plans that survive both reach you.

How adversarial review works →
📡

Automatic scope detection

Tests that used to pass now fail? Same fix attempted three times? New errors appearing? The system catches it and forces a replan before things get worse.

How scope detection works →
🧬

Never retries the same broken fix

Every fix attempt gets fingerprinted. If a new fix is the same idea as one that already failed, it's blocked. Forces genuinely different approaches instead of looping.

How dedup works →
🌐

Browser automation in your real Chrome

agent-browser

140+ commands. Completely undetectable. Reads a page in ~200 tokens instead of 13,700. Runs parallel sessions across hundreds of tabs. Open source.

~10ms overhead per gate

59 hooks, each under 10ms. Parallel agents across terminals. Rust-powered token compression saves 82% on average. You get results, not loading screens.

How the pipeline works →
🛡

Deploy safety system (6 gates)

Auto-backup before every server write. No blind overwrites, no stale file uploads, no restores without diffing first. Six dedicated gates protect production deployments from the most common AI mistakes.

🚫

65 blocked destructive commands

Filesystem deletion, disk formatting, registry edits, dangerous git operations, process killing — all denied before they execute. The AI literally cannot run rm -rf on your machine.

Oh, and it takes voice commands.

  • You speak 3x faster than you type. 150 words per minute vs 40 typed. Up to 30 minutes of continuous speech, transcribed near-instantly on your GPU
  • 99-language Whisper model. Press Alt+X to start, speak naturally, press Alt+X to stop. Works on Windows, Linux, and macOS
  • 2am debugging session. You whisper the fix. It transcribes faster than you can think
  • Alt+V pastes a screenshot. Claude sees exactly what you see

Zero risk. For real.

8 minutes to install. If it takes longer, uninstall in 2 clicks.

🖥

Windows: Everything lives in WSL — your system stays untouched. Linux: Installs to ~/.local — your system packages stay untouched.

🔓

No lock-in. Standard Claude Code underneath. Remove Mrs. Kitty anytime.

👤

No account. No email. No tracking. Just download and run.

EU AI Act enforcement begins August 2, 2026.

Fines up to €35M or 7% of global turnover.

Every company using AI in development will need governance tooling.

Mrs. Kitty is ready. Is your AI?

See documentation →

Built for Claude Code — the highest-benchmarked AI model. But Mrs. Kitty runs entirely on your machine. The gate system, memory engine, and all 59 enforcement layers are local infrastructure — adaptable to any AI model, including open-source.

Mrs. Kitty AI Controller

You have the ideas. Now you have the team.

Download Mrs. Kitty. Tell it what you want. Watch it get built.

$322/mo value — Your price: $0

v3.1.0 · Free right now · No credit card · Works with Claude Code