Primus
(News) 70B Primus models: https://huggingface.co/collections/trendmicro-ailab/llama-primus-nemotron-70b-68066bf016241419a145a508
Paper β’ 2502.11191 β’ Published β’ 8Note Start by reading the πPrimus Paper! To the best of our knowledge, we are the ππ½ββοΈ first to release datasets covering cybersecurity pretraining, IFT, and reasoning distillation. Of course, we are also the first to pretrain an LLM on a large-scale cybersecurity corpus.
trendmicro-ailab/Llama-Primus-Base
Text Generation β’ 8B β’ Updated β’ 213 β’ 12Note Based on Llama-3.1-8B-Instruct, continually pretrained on 2.77B tokens of cybersecurity text, achieving a π15.88% improvement in the aggregated score across multiple cybersecurity benchmarks.
trendmicro-ailab/Llama-Primus-Merged
Text Generation β’ 8B β’ Updated β’ 811 β’ 13Note Instruct Model! While maintaining nearly the same instruction-following capability as Llama-3.1-8B-Instruct, achieving a π14.84% improvement across multiple cybersecurity benchmarks.
trendmicro-ailab/Llama-Primus-Reasoning
Text Generation β’ 8B β’ Updated β’ 466 β’ β’ 13Note Distilled on reasoning and reflection data from o1-preview for cybersecurity tasks, achieving a π10% improvement on CISSP.
trendmicro-ailab/Primus-Seed
Viewer β’ Updated β’ 174k β’ 232 β’ 17Note Includes high-quality cybersecurity texts manually collected from reputable sources such as wikipedia, MITRE, cybersecurity company websites, CTI, and more.
trendmicro-ailab/Primus-FineWeb
Viewer β’ Updated β’ 3.39M β’ 92 β’ 17Note Includes 2.57B tokens of cybersecurity texts filtered from FineWeb.
trendmicro-ailab/Primus-Instruct
Viewer β’ Updated β’ 835 β’ 213 β’ 6Note Includes approximately 1K QA pairs covering common cybersecurity business scenarios.
trendmicro-ailab/Primus-Reasoning
Viewer β’ Updated β’ 4.89k β’ 167 β’ 12Note Includes reasoning and reflection data generated by o1-preview on cybersecurity tasks for distillation.