AI-Agent Benchmark

From GM-RKB
Jump to navigation Jump to search

An AI-Agent Benchmark is an AI benchmark task for AI agent-based systems.



References

2024-11-27

[1] https://arxiv.org/abs/2401.00741
[2] https://arxiv.org/abs/2402.01680
[3] https://arxiv.org/abs/2310.13077
[4] https://arxiv.org/abs/2402.14034
[5] https://arxiv.org/abs/2311.08592