I’m a hands-on engineer who keeps high-scale systems alive. Right now I architect AWS-native infrastructure handling hundreds of millions of daily requests for a real-time gaming platform — infrastructure as code, event-driven services, and data pipelines. After years leading as a CTO, I’m back where I do my best work: on the hardest problems.
The engineers who pull ahead won’t be the ones AI replaces — they’ll be the ones who wield it well. I treat models as production infrastructure: gated, non-blocking, and human-in-the-loop — never a bolt-on, never unsupervised. Used right, that leverage lets one engineer ship what used to take a team.
AI-augmented workflows let a small, senior team — or a single engineer — ship what used to take a floor of engineers. I design the pipelines, prompts, and guardrails that turn that leverage into reliable throughput.
Models handle the toil — boilerplate, migrations, first-draft code and docs. Architecture, tradeoffs, and the call on what ‘good’ means stay where they belong: with the engineer.
An AI feature is a distributed system like any other: async pipelines that never block the app, versioned prompts and models, observability, cost and latency budgets, and failure modes planned for up front.
The real multiplier isn’t my own output — it’s the tooling, standards, and review practices that make every engineer around me faster and safer with AI.
A fleet of production and near-production Claude systems — a live trading optimizer, an agentic inbox brief, an AI video pipeline, a fitness coach — built and run largely solo. That breadth is only possible with AI leverage.
Every model runs behind a gate. Claude annotates pull requests but never blocks a merge; a trading optimizer escalates suggest → paper → live only past a confidence bar, behind a circuit breaker; coaching stays descriptive, never prescriptive.
LLM calls are async, budgeted, and fallback-safe — a recording plays whether or not its AI summary is ready, and a two-tier Haiku/Sonnet split trades cost against depth per call.
A production fleet of algorithmic bots trading Kalshi’s short-window crypto prediction markets on momentum and trend signals — with Claude wired in to read live performance and propose strategy refinements. Containerized, deployed to AWS, running 24/7.
A daily agent that reads Email and Slack from the last 24h and synthesizes a ranked brief — urgent, committed, FYI — with Claude Sonnet. Each item opens a chat thread where the agent drafts tickets and replies.
An MCP server that exposes JIRA project health as tools any LLM client can call — surfacing blocked, stalled, and ‘bouncing’ tickets, at-risk epics, and transition bottlenecks for agentic project monitoring.
An open-source stack that puts whatever you’re building on a real HTTPS URL straight from your own machine — through a single outbound Cloudflare Tunnel, with Caddy routing each hostname to the right container. No server, no open ports, no deploy.
A fitness app with a conversational AI coach — Claude Sonnet with multi-turn memory and coaching personas — plus body-composition analysis from a photo (Claude Vision, through an S3 → SQS → Lambda pipeline). A real-time, on-device form coach is in progress.
A multi-tenant screen-recording platform — browser capture uploaded direct to S3 (the app server never touches video bytes), MediaConvert transcode to HLS, and a Claude + Transcribe pipeline for transcripts, summaries, chapters, and action items. In use internally and by external customers.
A cross-platform tracker for people running peptide, GLP-1, and hormone protocols — it remembers your regimen, schedules reminders, and handles the dosing and reconstitution math. It logs injections and bloodwork and imports Apple Health / Health Connect data, shipped as web plus native iOS and Android from one codebase.
A content-transformation platform that migrates, creates, and enhances documentation across SharePoint, Dozuki, and Confluence — including a desktop recorder that captures a workflow and turns it into a step-by-step SOP, with a versioned document-lifecycle engine underneath.
Architected the medallion data lake & warehouse strategy for a utility serving 3.7M+ consumers — Apache Iceberg tables, a two-phase CDC pipeline (row-hash updates, table-diff deletes), and a zero-downtime cutover from legacy Oracle.
Delivered security remediation across production apps and AWS environments — zero-trust microsegmentation with Illumio, Cognito/IAM identity hardening, encryption at rest and in transit, and an automated security-test suite.
Built an event-driven microservices platform for warehouse and supply-chain operations — decoupled services over a message backbone, with fault-tolerant integrations to third-party logistics systems.
Delivered an encrypted data-transport and financial reporting platform with full audit traceability — moving sensitive financial data across systems under SOX controls, end to end.
High-scale architecture, building with AI, or just trading notes on the craft — whatever it is, my inbox is open. Say hello.