Blog
2/13/2026Building AI text detection that explains itself
Most AI detectors give you a percentage and call it a day. We built one that shows you which sentences triggered the verdict, how much each one mattered, and why. It uses attention-based attribution on a sliding window transformer.
7/13/2025Are words the best building blocks for AI?
Language tokens are a poor substrate for grounded intelligence. This post argues for structured, world-centric tokens (geometry, dynamics, agency, causality) and outlines bridging mechanisms like cross-attention and V-JEPA to connect language with learned perceptual models.
7/9/2025Agents that learn in production
Most LLM agents don’t learn from deployment experience, but that’s changing. This post covers the research (DPO, GRPO, continual learning), the tooling (OpenPipe’s ART framework), and the first real production deployment (Cursor’s real time RL for Composer), plus the hard gaps still left to close.
3/25/2025The unique risks of audio deepfakes
Human detection of voice deepfakes is unreliable (60–73%); automated detectors hit 98%+ in-lab but fail to generalize to unseen attacks. Risks are rising; mitigation requires provenance standards, robust field-trained detectors, and on-device voice verification.
1/4/2025Design Principles for AI-Based Mental Health Tools
Lessons from building Flourish on addressing sycophancy and the risk of echo chambers, adding user-controlled stateful memory, structuring sessions, anchoring to evidence-based techniques, enforcing therapeutic boundaries, and building specialized, auditable systems.