Amin Mirlohi · PhD, Computer Science

I build AI systems that retrieve, reason, and hold up under evaluation.

Independent builder and AI engineer working at the intersection of retrieval, multi-agent systems, and the discipline of making models reliable. I build real products and write about the engineering, not the hype.

Read the writing Get in touch

Focused onAI for learningRetrieval systemsEvaluation

PhD

Computer Science

MSc

Artificial Intelligence

RAG · Agents · Eval

Core focus

Toronto

Based in Canada

What I work on

Engineering grounded, reliable AI.

Four threads run through everything I build, from research prototypes to production systems. Each leads with the same question: how do we know this actually works?

Retrieval & RAG systems

Designing retrieval-augmented systems that ground model output in real sources: hybrid retrieval (lexical + dense), semantic chunking, reranking, and citation-grounded answers.

Multi-agent orchestration

Building agent systems that plan, call tools, and manage state reliably, with deterministic guards, retries, and explicit boundaries instead of hopeful prompting.

Evaluation & reliability

Treating evaluation as engineering: golden datasets, faithfulness and groundedness metrics, regression gates, and adversarial testing so quality is measured, not assumed.

Applied ML & research

Bringing PhD-level research rigor to production: information retrieval, network analysis, and adversarial/applied machine learning, translated into systems that ship.

Latest writing

Recent articles

All writing

June 28, 20267 min read

Seeing Isn't Measuring: Fixing the Design-to-Code Plateau

AI can build a component from a screenshot. It still can't tell you whether it matched. The fix is the same one that applies to every agentic loop: separate the measurement from the judgment.

#Agents #Verification #Design to Code #Claude Code #Frontend

June 23, 20265 min read

Claude Tag: Anthropic Puts an Agent in the Channel

Anthropic's new Claude Tag turns @Claude into a persistent, shared teammate inside Slack. Here's what it actually does, how it's governed, and why the form factor matters more than the feature list.

#Claude #Agents #Slack #Enterprise AI #Claude Code

June 17, 20268 min read

Loop Engineering, Defined

The unit of agentic work has moved from the prompt to the loop. A working definition, an anatomy of what a loop is made of, and the one principle that separates loops you can trust from agents that agree with themselves.

#Loop Engineering #Agents #Claude Code #Reliability #AI Engineering

Shorter, dated notes live in the journal →

Read along, or reach out.

If you care about retrieval, agents, or evaluation done properly, the writing is the best place to start. For anything else, the door is open.

Browse the writing Get in touch