KindLM — Roadmap & Timeline

Planning assumptions:

  • Solo developer (Petr), ~20 hrs/week dedicated to KindLM
  • Claude Code handles bulk of implementation from specs
  • "Week 1" = first week of active development
  • Dates are relative — anchor Week 1 to actual start date

Phase 1: MVP (Weeks 1–8) — COMPLETED

Goal: Ship a working CLI that someone can npm install, write YAML, and get pass/fail in CI. Show HN.

Week 1–2: Foundation

TaskEpicDeliverable
Monorepo setup (Turborepo, npm workspaces, tsup, Vitest)Build passes, test runner works
Config parser (YAML → Zod validated typed config)1kindlm validate works
Provider adapter interface + OpenAI adapter2Can call GPT-4o and get typed response
Anthropic adapter2Can call Claude and get typed response
Basic test engine (single run, single suite)1kindlm test executes one test

Milestone: kindlm test runs one test against one provider and prints result.

Week 3–4: Assertions

TaskEpicDeliverable
tool_called assertion2Verify tool name + args
tool_not_called assertion2Verify tool absence
tool_order assertion2Verify tool sequence
schema assertion (AJV)2Validate JSON output against schema
judge assertion (LLM-as-judge)2Score responses with criteria
no_pii assertion2Detect SSN, CC, email, phone, IBAN
keywords_present / keywords_absent2Keyword guardrails
latency / cost assertions2Performance gates
Multi-turn tool simulation2Tool response injection loop

Milestone: All 11 assertion types working. Full test suite against mock providers passes.

Week 5–6: Reporters & CLI

TaskEpicDeliverable
Terminal reporter (colored output)3Pretty pass/fail in terminal
JSON reporter3Structured report file
JUnit XML reporter3CI-compatible output
Compliance reporter (markdown)5EU AI Act Annex IV document
Multi-run aggregation (N runs per test)2Configurable runs count
Pass/fail gates3--gate 90 sets threshold
kindlm init command (scaffolding)1Creates starter kindlm.yaml
kindlm validate command1Config validation without API calls
Suite/test filtering (-s, -t, --grep)1Run subset of tests
Ollama adapter2Local model support

Milestone: Full CLI feature-complete. Can init, validate, test, filter, report.

Week 7: Baselines & Polish

TaskEpicDeliverable
kindlm baseline set4Save current results
kindlm baseline compare4Diff against baseline
kindlm baseline list4Show saved baselines
drift assertion4Semantic drift detection
Multi-provider comparison6Same tests across models
CI auto-detection (GitHub Actions, GitLab CI)3Git metadata in reports
Error messages polishActionable, friendly errors
README with examplesInstall → first test in README

Milestone: Complete MVP. All features working, documented, tested.

Week 8: Launch

TaskEpicDeliverable
npm publish @kindlm/core + @kindlm/cli v0.1.0Package live on npm
GitHub repo publicMIT licensed, stars welcome
Landing page live (kindlm.com)React landing page deployed
Show HN postLaunch post with demo
Blog post: "Why we built KindLM"Origin story, problem statement
Twitter/X announcement thread5-tweet thread with terminal GIF
YouTube: "First test in 5 minutes" tutorialQuick start video

Milestone: Public launch. Target 200+ GitHub stars in first week.


Phase 2: Cloud Beta (Weeks 9–14) — COMPLETED

Goal: Ship Cloud dashboard for test history, trends, and team collaboration. Validate willingness to pay.

Week 9–10: Cloud API

TaskEpicDeliverable
Cloudflare Workers + Hono setup7/v1/health responds
D1 schema + migrations7All tables created
GitHub OAuth flow7kindlm login works
Token generation + auth middleware7Authenticated API calls
POST /v1/runs/upload7CLI can upload results
GET /v1/projects, CRUD7Project management
GET /v1/projects/:id/runs (list, filter)7Run history with pagination
Rate limiting middleware7Per-org limits enforced
Plan gating middleware7Free/Team/Enterprise limits

Milestone: API complete. CLI can login, upload, and API returns historical data.

Week 11–12: Dashboard

TaskEpicDeliverable
Dashboard app setup (React + Cloudflare Pages)7cloud.kindlm.com loads
Login page (GitHub OAuth redirect)7User can authenticate
Project list view7See all projects
Run history view (table + pass rate chart)7See trends over time
Run detail view (assertion breakdown)7Click into specific run
Run comparison view (diff against baseline)7Side-by-side comparison
Responsive mobile layout7Works on phone

Milestone: Dashboard usable. Beta testers can see their test history.

Week 13–14: Beta & Content — PARTIALLY COMPLETED

TaskEpicStatusDeliverable
Invite 20 beta testers from CLI usersPendingReal usage data
Webhook notifications (run.completed, run.failed)7DoneHMAC-signed webhook dispatch
Team management (invite, roles)7DoneMulti-user orgs (owner/admin/member)
Data retention cron7DoneAuto-cleanup per plan (daily at 02:00 UTC)
GitHub OAuth flow7Donekindlm login → browser → token paste
Blog: "EU AI Act compliance with KindLM"PendingCompliance content push
Product Hunt launchPendingCloud announcement
Blog: "From CLI to Cloud"PendingCloud launch story
YouTube: "Team dashboard walkthrough"PendingFeature demo

Milestone: Cloud beta live with 20 users. Product Hunt launch. Collecting feedback.


Phase 3: GA & Monetization (Weeks 15–22) — IN PROGRESS

Goal: Turn on billing, ship enterprise features, reach $7,890 MRR by month 6.

Week 15–16: Billing — COMPLETED

TaskEpicStatusDeliverable
Stripe integration (Team plan $49/mo)9DoneSelf-serve upgrade via Stripe Checkout
Billing API endpoints9DoneGET /billing, POST /checkout, POST /portal
Stripe webhook handler9DoneAutomatic plan activation on payment
Plan upgrade/downgrade flow9DoneImmediate feature access change
Enterprise contact form9Pending"Contact us" → Slack notification
Compliance PDF export (Team+)5PendingBranded PDF download

Milestone: Revenue! Stripe billing active.

Week 17–18: Enterprise Features

TaskEpicDeliverable
SSO / SAML integration8Okta, Azure AD support
Audit log API8Queryable compliance trail
Signed compliance reports (Ed25519)8Tamper-proof reports
SLA monitoring setup99.9% uptime tracking

Milestone: Enterprise tier feature-complete. Ready for regulated companies.

Week 19–20: Growth

TaskEpicDeliverable
Documentation site (docs.kindlm.com)Full docs with search
"AI Agent Testing Guide" (SEO content)Organic traffic driver
Conference talk (local Prague/Berlin)In-person credibility
Plugin system for custom assertionsCommunity extensibility
GitHub Actions marketplace action3One-click CI setup
VS Code extension (YAML autocomplete)DX improvement

Milestone: Organic growth flywheel active. Community contributing.

Week 21–22: Optimization — PARTIALLY COMPLETED

TaskEpicStatusDeliverable
Performance optimization (parallel test execution)PendingFaster runs
Additional providers (Google Gemini, Mistral, Cohere)2Done6 total providers
Webhook integrations (Teams, Discord, PagerDuty)7PendingBeyond Slack
Annual pricing option9PendingDiscount for commitment
Customer interviews (10 paying users)PendingRoadmap input

Milestone: Stable product, growing revenue, clear roadmap for H2.


Key Milestones Summary

WeekMilestoneSuccess MetricStatus
2First test runskindlm test works end-to-endDone
4All assertions11 assertion types passingDone
7MVP completeFull CLI, all features, testedDone
8Public launch200+ GitHub stars first weekDone
10Cloud API liveUpload + retrieve workingDone
12Dashboard liveBeta testers using itPending
14Cloud beta launchProduct Hunt, 50 Cloud signupsPending
16First revenueStripe billing activeDone (API ready)
18Enterprise GASSO, audit log, signed reportsPending
22Month 6 target$7,890 MRRPending

Dependencies & Blockers

DependencyBlocksMitigation
Cloudflare D1 GA stabilityCloud launchMonitor D1 status; Turso as fallback
GitHub OAuth App approvalCloud authApply early, use personal OAuth app for dev
Stripe account activationBillingApply during Phase 2
EU AI Act final technical standardsCompliance spec accuracyTrack EASA/CEN publications, update mapping
Domain registration (kindlm.com)Landing page launchRegister in Week 1

What's NOT on this roadmap (Backlog for H2 2026+)

  • Visual test builder (GUI for writing tests)
  • Real-time monitoring / observability mode
  • Prompt optimization suggestions from test results
  • Multi-language SDK (Python, Go)
  • Self-hosted Cloud option (Docker image)
  • Marketplace for community assertion plugins
  • SOC 2 certification for Cloud
  • Mobile app for test monitoring