Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up
Deng Benyong's picture
4

Deng Benyong

Watcher12

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago
PlanBench-XL: Evaluating Long-Horizon Planning of LLM Tool-Use Agents in Large-Scale Tool Ecosystems
upvoted a paper 20 days ago
AdaPlanBench: Evaluating Adaptive Planning in Large Language Model Agents under World and User Constraints
upvoted a paper 5 months ago
NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems
View all activity

Organizations

None yet

Watcher12 's datasets

None public yet
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs