I keep watching agents nail steps 1-4, then silently drop 5 and 6 while completing 7. These probabilistic failures are common, and a pain in the ass. Here’s what I’m doing to try to fix it.
Your agent forgets tasks, but not in order. And it’s a huge f’ing problem
