Text Generation
Transformers
TensorBoard
Safetensors
gpt2
Generated from Trainer
text-generation-inference
Instructions to use dakwi/chessgpt-medium-m with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use dakwi/chessgpt-medium-m with Transformers:
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="dakwi/chessgpt-medium-m")# Load model directly from transformers import AutoTokenizer, AutoModelForCausalLM tokenizer = AutoTokenizer.from_pretrained("dakwi/chessgpt-medium-m") model = AutoModelForCausalLM.from_pretrained("dakwi/chessgpt-medium-m") - Notebooks
- Google Colab
- Kaggle
- Local Apps
- vLLM
How to use dakwi/chessgpt-medium-m with vLLM:
Install from pip and serve model
# Install vLLM from pip: pip install vllm # Start the vLLM server: vllm serve "dakwi/chessgpt-medium-m" # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:8000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "dakwi/chessgpt-medium-m", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker
docker model run hf.co/dakwi/chessgpt-medium-m
- SGLang
How to use dakwi/chessgpt-medium-m with SGLang:
Install from pip and serve model
# Install SGLang from pip: pip install sglang # Start the SGLang server: python3 -m sglang.launch_server \ --model-path "dakwi/chessgpt-medium-m" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "dakwi/chessgpt-medium-m", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }'Use Docker images
docker run --gpus all \ --shm-size 32g \ -p 30000:30000 \ -v ~/.cache/huggingface:/root/.cache/huggingface \ --env "HF_TOKEN=<secret>" \ --ipc=host \ lmsysorg/sglang:latest \ python3 -m sglang.launch_server \ --model-path "dakwi/chessgpt-medium-m" \ --host 0.0.0.0 \ --port 30000 # Call the server using curl (OpenAI-compatible API): curl -X POST "http://localhost:30000/v1/completions" \ -H "Content-Type: application/json" \ --data '{ "model": "dakwi/chessgpt-medium-m", "prompt": "Once upon a time,", "max_tokens": 512, "temperature": 0.5 }' - Docker Model Runner
How to use dakwi/chessgpt-medium-m with Docker Model Runner:
docker model run hf.co/dakwi/chessgpt-medium-m
# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM
tokenizer = AutoTokenizer.from_pretrained("dakwi/chessgpt-medium-m")
model = AutoModelForCausalLM.from_pretrained("dakwi/chessgpt-medium-m")Quick Links
chessgpt-medium-m
This model is a fine-tuned version of gpt2-medium on an unknown dataset. It achieves the following results on the evaluation set:
- Loss: 0.9777
Model description
More information needed
Intended uses & limitations
More information needed
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 0.0003
- train_batch_size: 16
- eval_batch_size: 8
- seed: 42
- gradient_accumulation_steps: 4
- total_train_batch_size: 64
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- lr_scheduler_warmup_ratio: 0.2
- num_epochs: 1
- mixed_precision_training: Native AMP
Training results
| Training Loss | Epoch | Step | Validation Loss |
|---|---|---|---|
| 1.6517 | 0.08 | 250 | 1.5222 |
| 1.4486 | 0.16 | 500 | 1.3596 |
| 1.3463 | 0.24 | 750 | 1.2706 |
| 1.2623 | 0.32 | 1000 | 1.1986 |
| 1.2115 | 0.4 | 1250 | 1.1483 |
| 1.167 | 0.48 | 1500 | 1.1158 |
| 1.1327 | 0.56 | 1750 | 1.0757 |
| 1.1058 | 0.64 | 2000 | 1.0555 |
| 1.0798 | 0.72 | 2250 | 1.0320 |
| 1.0585 | 0.8 | 2500 | 1.0099 |
| 1.0435 | 0.88 | 2750 | 0.9929 |
| 1.0276 | 0.96 | 3000 | 0.9811 |
Framework versions
- Transformers 4.44.2
- Pytorch 2.4.0+cu121
- Datasets 2.21.0
- Tokenizers 0.19.1
- Downloads last month
- -
Model tree for dakwi/chessgpt-medium-m
Base model
openai-community/gpt2-medium
# Use a pipeline as a high-level helper from transformers import pipeline pipe = pipeline("text-generation", model="dakwi/chessgpt-medium-m")