New benchmark study results show leading AI models, including ChatGPT, Claude, and Gemini, still lag humans in visual math reasoning.
AI-focused accounting ERP provider DualEntry tested some of the most popular AI models on various accounting workflows and found that, at best, they're 77.3% accurate.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results