Building Scalable AI systems with LLMs, SLMs, Semantic Search, NLP, and AI Agents.

Writing

Notes on LLM systems, semantic search, embeddings, retrieval, attention, PyTorch, and NLP engineering.

Topics
LLM Architecture Attention PyTorch Decoding Search Embeddings Semantic Search Retrieval NLP Systems Text Processing

Implementation Notes

Concise explanations for specific technical problems, with examples and implementation details.

Long-Form Articles

Broader technical essays published externally, including Medium and Towards AI.