Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
213.4
TFLOPS
11
10
64
Şuayp Talha Kocabay
suayptalha
Follow
RustyTake-Off's profile picture
shimmyshimmer's profile picture
ahmeterdempmk's profile picture
43 followers
·
59 following
https://discord.com/users/suaypt
suayptalha
suayptalha
suayp-talha-kocabay
AI & ML interests
NLP, LLMs, Transformers, Merging, RNNs, CNNs, ANNs, Computer Vision and ML algorithms
Recent Activity
updated
a model
1 day ago
suayptalha/Falcon3-Jessi-v0.4-7B-Slerp
liked
a model
2 days ago
openai-community/gpt2-large
replied
to
sometimesanotion
's
post
8 days ago
I've managed a #1 score of 41.22% average for 14B parameter models on the Open LLM Leaderboard. As of this writing, sometimesanotion/Lamarck-14B-v0.7 is #8 for all models up to 70B parameters. It took a custom toolchain around Arcee AI's mergekit to manage the complex merges, gradients, and LoRAs required to make this happen. I really like seeing features of many quality finetunes in one solid generalist model.
View all activity
Organizations
suayptalha
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a model
2 days ago
openai-community/gpt2-large
Text Generation
•
Updated
Feb 19, 2024
•
3.71M
•
284
liked
a model
8 days ago
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
4 days ago
•
498k
•
5.18k
liked
a Space
9 days ago
Running
49
🧐
Open LLM Leaderboard Results PR Opener
liked
2 models
10 days ago
suayptalha/Falcon3-Jessi-v0.4-7B-Slerp
Text Generation
•
Updated
1 day ago
•
149
•
4
tiiuae/Falcon3-7B-Instruct
Text Generation
•
Updated
20 days ago
•
43.2k
•
47
liked
a model
17 days ago
NovaSky-AI/Sky-T1-32B-Preview
Text Generation
•
Updated
17 days ago
•
15.3k
•
520
liked
a Space
17 days ago
Running
168
🔥
Attention Visualization
Vision Transformer Attention Visualization
liked
a Space
21 days ago
Running
2
💬
Chat With ArrLlama
liked
a model
22 days ago
suayptalha/arrLlama
Text Generation
•
Updated
16 days ago
•
683
•
1
liked
2 models
27 days ago
IntelligentEstate/Kaiju-Warding_AGI_Qwn7B-iMatrxQ4_nl-GGUF
Updated
about 1 month ago
•
101
•
3
suayptalha/minGRU-sentiment2
Text Classification
•
Updated
Dec 28, 2024
•
121
•
2
liked
a model
30 days ago
Qwen/Qwen2.5-3B-Instruct
Text Generation
•
Updated
Sep 25, 2024
•
272k
•
153
liked
8 models
about 1 month ago
suayptalha/FastLlama-3.2-3B-Instruct
Text Generation
•
Updated
22 days ago
•
121
•
2
suayptalha/FastLlama-3.2-3B-LoRA
Updated
Dec 30, 2024
•
2
suayptalha/minGRU-Sentiment-Analysis
Text Classification
•
Updated
Dec 28, 2024
•
109
•
2
suayptalha/minGRULM-base
Text Generation
•
Updated
Dec 28, 2024
•
192
•
3
deepseek-ai/DeepSeek-V3-Base
Updated
6 days ago
•
23.4k
•
1.46k
Qwen/QVQ-72B-Preview
Image-Text-to-Text
•
Updated
19 days ago
•
178k
•
529
answerdotai/ModernBERT-base
Fill-Mask
•
Updated
15 days ago
•
4.82M
•
716
suayptalha/medBERT-base
Fill-Mask
•
Updated
Dec 24, 2024
•
163
•
4
Load more