Researchers say AI agents are improving, but reliability lags behind – AIDIRECTORY