AI-Agent Benchmark

From GM-RKB
Revision as of 23:06, 28 December 2024 by Gmelli (talk | contribs) (Text replacement - "<B>Examples:</B>" to "<B>Example(s):</B>")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

An AI-Agent Benchmark is an AI benchmark task for AI agent-based systems.



References

2024-11-27

[1] https://arxiv.org/abs/2401.00741
[2] https://arxiv.org/abs/2402.01680
[3] https://arxiv.org/abs/2310.13077
[4] https://arxiv.org/abs/2402.14034
[5] https://arxiv.org/abs/2311.08592