AI/ML Engineer • RAG & LLM Systems
I make AI systems practical, performant, and production-ready.
Currently at Radical Squares building RAG pipelines, LLM orchestration, and real-time inference infrastructure.
Selected Work
Projects
Brckt
Production RAG system for real-time tennis analytics.
CodePilot
Multi-agent AI system for autonomous code generation.
ML-Monitor
Production MLOps platform for real-time fraud detection.
VerbaQuery
Industrial RAG with hybrid retrieval (BM25 + dense embeddings) and cross-encoder re-ranking for enterprise document search.
About
Background
MS in Computer Science from Indiana University with a 3.9 GPA. Focused on making machine learning work in production.
My work spans RAG systems, LLM pipelines, and MLOps infrastructure. I care deeply about building AI that's reliable, fast, and actually useful.
Experience
GenAI Engineer
Jan 2026 — PresentDeveloping production GenAI applications integrating OpenAI GPT-4o APIs with LangChain. Built full-stack AI platform with React/FastAPI, implemented RAG pipeline with vector embeddings, and deployed microservices with Docker/Redis achieving 94.6% API cost reduction.
AI/ML Engineer
Dec 2024 — PresentBuilt real-time GenAI application using Llama 3.3-70B LLM with streaming responses via Server-Sent Events (SSE). Developed scalable backend API with FastAPI and async processing, containerized with Docker and deployed with Caddy reverse proxy.
GenAI Developer
Jun 2025 — Dec 2025Architected enterprise RAG system using GPT-4 API with LangChain orchestration. Implemented 5-stage pipeline with hybrid retrieval (BM25 + semantic), built vector search with ChromaDB/FAISS for 10,000+ documents achieving 94% retrieval accuracy.
Tech Stack