Dr. Chad PhD

Doctor-Chad-PhD

AI & ML interests

😎

Recent Activity

liked a model 1 day ago
internlm/internlm3-8b-instruct-gguf
upvoted a collection 1 day ago
InternLM3
liked a model 1 day ago
internlm/internlm3-8b-instruct
View all activity

Organizations

None yet

Doctor-Chad-PhD's activity

New activity in MiniMaxAI/MiniMax-Text-01 1 day ago
New activity in kyutai/helium-1-preview-2b 2 days ago

fix architecture

2
#3 opened 2 days ago by
TPM-28
reacted to MoritzLaurer's post with πŸ˜” 3 days ago
view post
Post
2873
FACTS is a great paper from @GoogleDeepMind on measuring the factuality of LLM outputs. You can now download their prompt templates from @huggingface to improve LLM-based fact-checking yourself!

πŸ“ The paper introduces the FACTS Grounding benchmark for evaluating the factuality of LLM outputs.

πŸ€– Fact-checking is automated by an ensemble of LLM judges that verify if a response is fully grounded in a factual reference document.

πŸ§ͺ The authors tested different prompt templates on held-out data to ensure their generalization.

πŸ“š It's highly educational to read these templates to learn how frontier labs design prompts and understand their limitations.

πŸ’Ύ You can now download and reuse these prompt templates via the prompt-templates library!

πŸ”„ The library simplifies sharing prompt templates on the HF hub or locally via standardized YAML files. Let’s make LLM work more transparent and reproducible by sharing more templates like this!

Links πŸ‘‡
- prompt-templates docs: https://moritzlaurer.github.io/prompt_templates/
- all templates on the HF Hub: MoritzLaurer/facts-grounding-prompts
- FACTS paper: https://storage.googleapis.com/deepmind-media/FACTS/FACTS_grounding_paper.pdf
New activity in Qwen/Qwen2.5-Turbo-1M-Demo 6 days ago

qwen-turbo-1101

7
#1 opened about 2 months ago by
LaferriereJC
New activity in microsoft/phi-4 7 days ago
replied to singhsidhukuldeep's post 9 days ago
reacted to davidberenstein1957's post with πŸš€ 12 days ago