Reducing Hallucinations by 60% Without Changing the Model

Retrieval optimization, prompt engineering, and A/B testing for enterprise LLMs

Read More →


Measuring AI Impact When You Can't A/B Test

Using quasi-experimental methods to evaluate feature value at scale

Read More →


Some Toy Algorithms - Sentiment Classification

Implementing commonly used models from scratch

Read More →