Imagine trying to build a reliable AI assistant. The core task seems simple: you give it a goal, it thinks, uses some tools (like a calculator or a web search), and gives you an answer. However, the ...