Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
41
Jeremie Tisby
Frobenius
Follow
21world's profile picture
1 follower
Ā·
4 following
AI & ML interests
None yet
Recent Activity
replied
to
lewtun
's
post
about 18 hours ago
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute š„ How? By combining step-wise reward models with tree search algorithms :) We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think" We're open sourcing the full recipe and sharing a detailed blog post. In our blog post we cover: š Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time. š Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets. š§ Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM Here's the links: - Blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute - Code: https://github.com/huggingface/search-and-learn Enjoy!
Reacted to
lewtun
's
post
with š„
about 18 hours ago
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute š„ How? By combining step-wise reward models with tree search algorithms :) We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think" We're open sourcing the full recipe and sharing a detailed blog post. In our blog post we cover: š Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time. š Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets. š§ Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM Here's the links: - Blog post: https://huggingface.co/spaces/HuggingFaceH4/blogpost-scaling-test-time-compute - Code: https://github.com/huggingface/search-and-learn Enjoy!
View all activity
Organizations
None yet
Frobenius
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
5 models
4 months ago
google/paligemma-3b-pt-224
Image-Text-to-Text
ā¢
Updated
Sep 21
ā¢
28k
ā¢
274
internlm/internlm2_5-7b-chat
Text Generation
ā¢
Updated
Aug 20
ā¢
33.2k
ā¢
185
microsoft/Phi-3-small-128k-instruct
Text Generation
ā¢
Updated
Sep 12
ā¢
9.24k
ā¢
172
VAGOsolutions/Llama-3.1-SauerkrautLM-8b-Instruct
Text Generation
ā¢
Updated
Aug 14
ā¢
15.7k
ā¢
32
Lewdiculous/FuseChat-Kunoichi-10.7B-GGUF-IQ-Imatrix
Updated
Mar 6
ā¢
165
ā¢
8
liked
a dataset
4 months ago
gate369/Alpaca-Star
Viewer
ā¢
Updated
Apr 10
ā¢
418
ā¢
903
ā¢
17
liked
a model
4 months ago
01-ai/Yi-1.5-9B-Chat
Text Generation
ā¢
Updated
Jun 26
ā¢
23.7k
ā¢
135
liked
8 models
8 months ago
upstage/SOLAR-10.7B-v1.0
Text Generation
ā¢
Updated
Sep 10
ā¢
22.4k
ā¢
292
QuantFactory/Meta-Llama-Guard-2-8B-GGUF
Text Generation
ā¢
Updated
Apr 19
ā¢
372
ā¢
11
HuggingFaceH4/zephyr-7b-gemma-v0.1
Text Generation
ā¢
Updated
Mar 3
ā¢
815
ā¢
121
google/gemma-1.1-7b-it-GGUF
Updated
Jun 27
ā¢
1
ā¢
21
LoneStriker/gemma-1.1-7b-it-GGUF
Updated
Apr 6
ā¢
11
ā¢
4
microsoft/Phi-3-mini-4k-instruct-gguf
Text Generation
ā¢
Updated
Jul 2
ā¢
17.1k
ā¢
468
microsoft/Phi-3-mini-128k-instruct
Text Generation
ā¢
Updated
Aug 20
ā¢
872k
ā¢
1.61k
google/gemma-1.1-7b-it
Text Generation
ā¢
Updated
Jun 27
ā¢
16k
ā¢
ā¢
267
liked
5 models
9 months ago
stabilityai/stable-cascade
Text-to-Image
ā¢
Updated
Mar 16
ā¢
25.5k
ā¢
1.28k
unsloth/gemma-7b
Text Generation
ā¢
Updated
Sep 3
ā¢
3.4k
ā¢
5
NousResearch/Nous-Hermes-llama-2-7b
Text Generation
ā¢
Updated
Sep 26
ā¢
11.2k
ā¢
68
NousResearch/Nous-Hermes-Llama2-70b
Text Generation
ā¢
Updated
Aug 27, 2023
ā¢
757
ā¢
83
NousResearch/Nous-Hermes-Llama2-13b
Text Generation
ā¢
Updated
Apr 23
ā¢
46.6k
ā¢
305
Load more