I turn complex problems into AI systems that hold up in production.

AI & ML systems engineer at Microsoft, specializing in production reliability, AI infrastructure, and anomaly detection at global scale. I also work with early-stage teams on building AI systems that last.

Siddharth Agrawal

About

I use AI and ML to solve real business problems — not just to build interesting models. At Microsoft, I work on the health and quality of Azure, helping the platform stay reliable for its customers at global scale.

What drives me is understanding a business's most important challenges, then figuring out how AI and ML can help — quickly, cleanly, and with impact. I work iteratively: scoping tightly, shipping early, and improving based on feedback from the system, the data, and the people using it.

Translate Problems

I break down complex problems into smaller, solvable chunks.

Scalable and Secure by Design

A good system is designed deliberately to be scalable and secure. I build systems that are easy to reason about, debug, and extend.

Keep it Simple

I believe complexity is an outcome of interaction between simpler components.

Areas of Focus

AI & ML Observability

Production Reliability for AI Systems

LLMs in Production

Fast AI Iteration for Resource-Constrained Teams

Building Trust in Model Outputs at Scale

AI Infrastructure for Early-Stage Teams

Let's Connect

Always interested in discussing ML systems, sharing ideas, or exploring opportunities to build something meaningful together. I'm also open to advising early-stage teams working on hard ML problems.

If you're working on something where AI feels both essential and uncertain — that's usually where the interesting work is.