Doing
doing
ยท
AI & ML interests
None yet
Recent Activity
liked
a Space
1 day ago
Qwen/QwQ-32B-preview
liked
a Space
2 days ago
reach-vb/2024-ai-timeline
commented
a paper
3 days ago
ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models
in Multi-Hop Tool Use
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet