ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios
robustness
prompt engineering
ToolEyes evaluates LLMs’ tool learning using real-world scenarios, finding limitations and guiding future research.
Jan 1, 2024