4 3 3

Cyrus Nikolaidis

cynikolai

https://cynikolai.github.io/

AI & ML interests

NLP, Applications of fine-tuned LLMs, Privacy, Security

Recent Activity

liked a dataset about 2 months ago

NovaSky-AI/Sky-T1_data_17k

new activity 5 months ago

meta-llama/Prompt-Guard-86M:How to detect words?

liked a model 5 months ago

facebook/hubert-base-ls960

View all activity

Organizations

cynikolai's activity

liked a dataset about 2 months ago

NovaSky-AI/Sky-T1_data_17k

Viewer • Updated Jan 14 • 16.4k • 1.28k • 178

New activity in meta-llama/Prompt-Guard-86M 5 months ago

How to detect words?

#17 opened 5 months ago by

GoominDev

liked a model 5 months ago

facebook/hubert-base-ls960

Feature Extraction • Updated Nov 5, 2021 • 711k • 53

authored 2 papers 5 months ago

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 114

CYBERSECEVAL 3: Advancing the Evaluation of Cybersecurity Risks and Capabilities in Large Language Models

Paper • 2408.01605 • Published Aug 2, 2024 • 1

New activity in meta-llama/Prompt-Guard-86M 7 months ago

Model classifies everything as unsafe

#15 opened 8 months ago by

wagnew3

Open Sourcing Dataset?

#16 opened 7 months ago by

johnnydevriese

New activity in meta-llama/Prompt-Guard-86M 8 months ago

Model is extremely sensitive to the word "ignore"

#14 opened 8 months ago by

mstachow

upvoted a collection 8 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 652

liked a Space 10 months ago

CyberSecEvalTest

📈

Evaluate LLM cybersecurity risks

upvoted an article 10 months ago

Article

CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models

May 24, 2024

• 21

published an article 10 months ago

Article

CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models

May 24, 2024

• 21

upvoted a paper about 1 year ago

Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models

Paper • 2312.04724 • Published Dec 7, 2023 • 20

authored a paper over 1 year ago

Purple Llama CyberSecEval: A Secure Coding Benchmark for Language Models

Paper • 2312.04724 • Published Dec 7, 2023 • 20