Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
floom
's Collections
Coding
Reasoning
ICL
RL
Model Training
Agents
NLU
Training data
RAG
Data Efficient Approaches
Long-context
Personalization
sentence-transformer-models
Tool Use & more
Feedback Analysis
Model Safety
Webscraping
Timeseries
Evaluation
Memory
SSM
TabularData
Efficient Serving/Inference
Synthetic Data Generation
Hallucination
Frontier research ideas
Model Safety
updated
Apr 1
Upvote
-
Evaluating Frontier Models for Dangerous Capabilities
Paper
•
2403.13793
•
Published
Mar 20
•
7
Coercing LLMs to do and reveal (almost) anything
Paper
•
2402.14020
•
Published
Feb 21
•
12
Upvote
-
Share collection
View history
Collection guide
Browse collections