A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
AgentBench is a standout agent — with 3.2k GitHub stars and growing adoption, a solid trust score of 65.1/100, and native support for REST.
Leverage conversational for enhanced productivity
| Type | Agent |
| Language | Python |
| Trust Score | 65.1/100 (High) |
| Stars | ★ 3,250 |
| Categories | General |
| Protocols | REST |
| Source | https://github.com/THUDM/AgentBench |
Add a trust badge to your README:
[](https://fushu.dev/agent/52716105259d)
click to copy
Install now and integrate into your workflow in minutes.