~/will-diamond
~/will-diamond/projects
10 entries
$ ls -la --sort=impact --type=shipped

>Projects

Production AI systems, open-source tools, and experiments. The featured work has shipped to real users and solved real problems.

4featured4projects2archived
10/10
$ ls --tier=featured
4 featured
~/projects/claude-os
active

>Claude OS

Turned Claude Code into a stateful personal operating system. Running it to manage this job search right now.

$ cat highlights.md
  • Git-tracked markdown files ARE the memory -- TODAY.md and MEMORY.md load into every session via startup hooks. Inspectable, editable, versioned.
  • FastMCP server with 15+ tools: calendar events, contact lookup, email triage, scheduling, analytics -- all through conversation.
  • Multi-instance coordination: Chief orchestrates all day, Specialists spawn for deep work, Subagents run background tasks in parallel.
  • Running it to manage this actual job search right now -- scheduling interviews, processing emails, tracking pipeline. Not a demo.
Claude CodeNext.jsTypeScriptPythonFastAPISQLiteMCP
~/projects/contoural
completed

>Contoural

AI citation analysis pipeline for an information governance firm. Clients include 30% of the Fortune 500.

$ cat highlights.md
  • 40 hours → 10 hours per engagement (75% reduction), enabling a shift from hourly billing to fixed-price contracts.
  • 340,000+ legal citations classified at 80–90% precision through a 3-stage extraction + classification + correction pipeline.
  • Corrections that persist: a human-in-the-loop feedback system where reviewer corrections retrain the classifier via replay -- accuracy compounds over time.
  • Multi-LLM orchestration via LiteLLM with Instructor for structured extraction; LLM-as-judge error taxonomy for systematic quality tracking.
PythonFastAPIOpenAILiteLLMInstructorPostgreSQLpgvector
~/projects/texas-hold-llm
archived

>Texas Hold LLM

Full-stack poker simulation where LLM agents play Texas Hold'em. Why poker is a great LLM benchmark.

$ cat highlights.md
  • Three-prompt architecture: game state (cards, pot, position), conversation history (table talk), and player profile (archetype, history) unified into one context.
  • Table talk that actually influences decisions -- agents read bluffs, make threats, adjust strategy based on what others say.
  • 8 distinct agent archetypes with personalities (Aggressive, Passive, Bluffer, etc.) and ELO rating system with K-factor calibration.
  • Event-sourced game engine with 13 event types and transparent reasoning panels -- every agent decision is fully inspectable.
PythonOpenAIFastAPINext.jsTypeScript
Texas Hold LLM
~/projects/open-source-legislation
active

>Open Source Legislation

All 50 US states' statutes in a single queryable format with semantic search. Open sourced when the startup shut down.

$ cat highlights.md
  • 50 states standardized into a single schema: 2M+ legislative sections indexed with consistent hierarchy, citations, and metadata.
  • 4 embedding types per section -- text, summary, HyDE (hypothetical document embeddings), name -- for different retrieval strategies.
  • Built pgvector semantic search for legal text before RAG became mainstream. Still receiving commits through June 2025.
  • Open sourced when Recodify shut down. The startup is gone; the infrastructure persists as the most complete open dataset of US state law.
PythonPostgreSQLpgvectorOpenAINext.js
$ ls --tier=projects
4 projects
~/projects/paid-smol-podcaster
$ cat ~/projects/paid-smol-podcaster/README.md
#Paid Smol Podcaster
Production podcast pipeline for Latent Space. Claude picks the best clips; Levenshtein distance syncs audio timestamps when editorial cuts shift the timing.
PythonFlaskCelery
active

>Paid Smol Podcaster

Production podcast pipeline for Latent Space. Claude picks the best clips; Levenshtein distance syncs audio timestamps when editorial cuts shift the timing.

Python
Flask
Celery
Redis
Anthropic
+3 more
~/projects/chrome-history-mcp
$ cat ~/projects/chrome-history-mcp/README.md
#Chrome History MCP
MCP server that gives Claude access to your Chrome history. 6 tools -- search, reconstruct sessions, analyze patterns. Auto-detects profiles across macOS/Linux/Windows.
TypeScriptMCPSQLite
completed

>Chrome History MCP

MCP server that gives Claude access to your Chrome history. 6 tools -- search, reconstruct sessions, analyze patterns. Auto-detects profiles across macOS/Linux/Windows.

TypeScript
MCP
SQLite
~/projects/pg-typed-pydantic
pg-typed-pydantic
archived

>pg-typed-pydantic

Fork of PgTyped that outputs Pydantic models instead of TypeScript. SQL-first type safety for Python -- a gap PgTyped filled for TS but nothing filled for Python.

TypeScript
Node.js
Python
Pydantic
PostgreSQL
+1 more
~/projects/ask-abe
$ cat ~/projects/ask-abe/README.md
#Ask Abe
Legal RAG chatbot built before RAG had a name. Ask questions about US law; Abe retrieves the actual statute text and generates grounded answers with citations.
Next.jsTypeScriptPython
completed

>Ask Abe

Legal RAG chatbot built before RAG had a name. Ask questions about US law; Abe retrieves the actual statute text and generates grounded answers with citations.

Next.js
TypeScript
Python
PostgreSQL
pgvector
+2 more
$ ls --tier=archive
2 archived