AI Interview Companion (Second Brain)

Published: December 25, 2025

Technologies: Cloudflare Workers, Durable Objects, Vectorize (RAG), Llama 3.3, TypeScript, Python

RAG Architecture: Built a serverless Retrieval-Augmented Generation system using Cloudflare Vectorize. It converts user notes (STAR stories, technical definitions) into 1024-dimensional embeddings, allowing the Llama 3.3 LLM to recall specific personal experiences during mock interviews.
Engineering Rigor: Developed a custom Offline Evaluation Pipeline in Python (sentence-transformers). This automated benchmark regression-tests the agent’s memory, validating a 90.3% semantic accuracy against a “Golden Dataset” of ground-truth answers.
State Management: Implemented Durable Objects to manage real-time WebSocket connections and chat history, ensuring consistent context retention across distributed edge locations.

Benchmark Results

Figure: Automated benchmarking results showing >90% accuracy in retrieving Personal History and ML concepts.