Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

abhshkp
/
litm-benchmark-suite-v4

ml-intern
lost-in-the-middle
long-context
position-bias
benchmark
Model card Files Files and versions
xet
Community
litm-benchmark-suite-v4
162 kB
Ctrl+K
Ctrl+K
  • 1 contributor
History: 84 commits
abhshkp's picture
abhshkp
Fix scoring bug in Exp 4: check all numbers for expected answer, not just first number
3e790f1 verified 7 days ago
  • experiments
    Fix scoring bug in Exp 4: check all numbers for expected answer, not just first number 7 days ago
  • kaggle
    Add multi-model comparative runner with Mistral as default 7 days ago
  • src
    Add Curve Shape Classifier (CSC) + enhanced metrics 7 days ago
  • .gitattributes
    1.52 kB
    initial commit 17 days ago
  • README.md
    28.5 kB
    Upload README.md 16 days ago
  • analyze_taxonomy.py
    10.5 kB
    Add taxonomy analysis script with shape gallery, PBI charts, LaTeX tables 7 days ago
  • config.yaml
    1.66 kB
    Change default model to Mistral-7B-Instruct-v0.3 7 days ago
  • paper_latex_template.tex
    16.3 kB
    Add full LaTeX paper template with Mistral included in model list 7 days ago
  • requirements.txt
    105 Bytes
    Upload requirements.txt 17 days ago
  • run_all.py
    6.16 kB
    Change default model to Mistral-7B-Instruct-v0.3 7 days ago