Evals Don't Give You a Working Product
Two years and billions in funding later, 80% of AI projects still fail. Are evals the solution or the problem?
Long form notes on software and AI engineering
Two years and billions in funding later, 80% of AI projects still fail. Are evals the solution or the problem?
How Learning Machines work under the hood.
Every AI memory system I've used is missing something.
How to build agents that are not only capable, but learn and improve over time.
How to make agents better without fine-tuning or retraining. System-level learning that actually works.
Build a self-learning research agent that captures consensus, explains what changed, and improves over time.
Learn how to build a text-to-sql agent with dynamic context and poor man's continuous learning.
Stop sending your agent transactions to telemetry services. Give your Agents a database and keep your data private.
Understanding Agents by mapping out how they work.
A practical guide to Agent Engineering: the intersection of software, systems and security engineering.
An open-source experiment in collective memory and in-context cultural accumulation for multi-agent systems.
Lessons from 100s of conversations on AI products and how teams are adopting AI.
The multi-agent framework, runtime, and UI built for speed.