AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)

by THUDM

65.1
High Trust
🔗 View Source 👤 Claim

🚀 Why AgentBench?

AgentBench is a standout agent — with 3.2k GitHub stars and growing adoption, a solid trust score of 65.1/100, and native support for REST.

🚀

Conversational

Leverage conversational for enhanced productivity

🔌 Protocols & Compatibility

REST
⚡ Capabilities
conversational

🔧 Technical Specifications

TypeAgent
LanguagePython
Trust Score65.1/100 (High)
Stars★ 3,250
CategoriesGeneral
ProtocolsREST
Sourcehttps://github.com/THUDM/AgentBench
65.1/100
Trust Score
Well-established with solid validation
3.2k
GitHub Stars
Strong traction and growing adoption
Unverified
Not yet claimed or verified
Tags:#chatgpt#gpt-4#llm#llm-agent

🏷️ Embed Badge

Add a trust badge to your README:

Trust Score Stars
[![Fushu](https://fushu.dev/badge/52716105259d/trust.svg)](https://fushu.dev/agent/52716105259d) click to copy

Get Started with AgentBench

Install now and integrate into your workflow in minutes.

Share this agent: Twitter / X LinkedIn
← Back to Directory