New top story on Hacker News: OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computers

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computers
6 by kristianpaul | 0 comments on Hacker News.


Comments

Popular posts from this blog

New top story on Hacker News: Show HN: Zipy.ai - Like Sentry + Hotjar, but with less noise

Scallops row warnings 'fell on deaf ears', say UK fishermen, after French 'hurl rocks and smoke bombs' at boats