The primary objective of project is to explore & analyze the impact of model size on text generation quality with GPT-2 arch trained from scratch.
Susant Achary
Susant-Achary
AI & ML interests
Tiny to Small Language Models
Recent Activity
upvoted
a
collection
22 days ago
PaliGemma 2 Mix
liked
a model
about 1 month ago
HuggingFaceTB/SmolLM2-135M-Instruct
Organizations
Collections
1
models
13

Susant-Achary/Deepseek-R1-India-Finetuned-Distill-Llama-8B-unsloth-bnb-4bit
Updated

Susant-Achary/gpt2-jungle-book-37M
Text Generation
•
Updated
•
23

Susant-Achary/gpt2-jungle-book-22M
Text Generation
•
Updated
•
11

Susant-Achary/gpt2-jungle-book-15M
Text Generation
•
Updated
•
22

Susant-Achary/gpt2-jungle-book-7M
Text Generation
•
Updated
•
17

Susant-Achary/gpt2-jungle-book-3M
Text Generation
•
Updated
•
11

Susant-Achary/gpt2-jungle-book-1M
Text Generation
•
Updated
•
13

Susant-Achary/gpt2-jungle-book-100M
Text Generation
•
Updated
•
23

Susant-Achary/gpt2-jungle-book-59M
Text Generation
•
Updated
•
10

Susant-Achary/4-bit-quantized-flux.1-dev-pipeline
Text-to-Image
•
Updated
•
11
•
1