Engineering Reliable Agents with Ragas
Moving beyond the "Vibe Check" by instrumenting LangChain agents with the RAG Triad metrics.
Senior AI Engineer & Technical Lead
Specializing in moving LLMs from prototype to production. I bridge the gap between research science and scalable product engineering.

Capabilities
Designing and deploying production-grade AI systems. From model architecture to inference optimization.
End-to-end LLM lifecycle management. Fine-tuning, evaluation, and deployment at scale.
Bridging technical capabilities with market needs. Building AI products that users love.
Architecting scalable, resilient systems that handle millions of requests with grace.
Growing high-performance engineering teams. Mentoring, hiring, and establishing engineering culture.
Portfolio
Researching neural network robustness against occlusions. Published comparative studies on "Masked Face Recognition" performance during the pandemic.
Developed a Reinforcement Learning stack for self-driving vehicles on NVIDIA Jetson Nano hardware. Deployed across 6 European countries.
Designing data governance frameworks and predictive maintenance use-cases for international automotive and pharma clients.
Writing
Moving beyond the "Vibe Check" by instrumenting LangChain agents with the RAG Triad metrics.
Why your AI Agents are failing: They are trying to drink from a swamp. The model is not the bottleneck, the data architecture is.
Stop telling the model to 'Act as Steve Jobs.' The engineering reality of reliable prompting.
Principles
The best AI systems are invisible. They do not demand attention; they just work.
My focus is building the boring, reliable infrastructure that makes this possible. I value simple architecture over clever code. Every system I build is designed to be read by humans, maintained by teams, and trusted by users.
I operate where research meets production. This is where complexity usually kills projects, and where simplicity creates the most value.