Build an agentic RAG pipeline with NVIDIA NeMo

The Agent Stack #008 — Monday Build NVIDIA just dropped something builders should care about. Their NeMo Retriever introduces “agentic retrieval” - moving beyond simple semantic similarity to actually reason about what information you need. Building Beyond Basic RAG Traditional RAG is broken. You throw documents at a vector database, hope semantic similarity finds the right chunks, and pray your LLM can piece together coherent answers. NVIDIA’s approach flips this. ...

March 16, 2026 · 2 min · Rob Taylor

Agent-Audit shows how to cost your AI before deploy

The Agent Stack #006 — Monday Build Chrome 146 just shipped WebMCP support. That means any website can now expose tools for your AI agents to use. The real story isn’t the API—it’s what happens when agents start hitting paid services at scale. Most builders are flying blind on costs until their OpenAI bill arrives. Build: Cost estimation before deployment The Agent-Audit repo tackles this head-on. It’s a linter that estimates costs before your agent runs wild. ...

March 2, 2026 · 2 min · Rob Taylor