Commit History

Implement model preloading in PoetryGenerationService for faster startup; remove redundant preload method from ModelManager
18faacf

abhisheksan commited on

Remove unused import of PoetryGenerationService to clean up code
8ecf9f2

abhisheksan commited on

Refactor poem generation parameters; replace PoemRequest with GenerationParams for enhanced control and validation of generation settings
6d783aa

abhisheksan commited on

Remove temperature parameter from generate_poem function call for simplified text generation
1f5f1e2

abhisheksan commited on

Refactor ModelManager and PoetryGenerationService for CPU optimization; enhance model initialization, improve text generation parameters, and implement explicit resource cleanup
91fceab

abhisheksan commited on

Refactor PoetryGenerationService and ModelManager for CPU optimization; update model and tokenizer initialization, simplify prompt templates, and enhance text generation and cleaning processes
fc2af9b

abhisheksan commited on

Update generate_poem function to use max_new_tokens for improved token control
faba267

abhisheksan commited on

Refactor PoetryGenerationService to improve model preloading and poem generation; switch to max_new_tokens for better control and update model name for consistency
17244ba

abhisheksan commited on

Add model preloading functionality to PoetryGenerationService for improved startup efficiency and error handling
25289e0

abhisheksan commited on

Refactor PoetryGenerationService to enhance poem generation; switch to GPT2-medium model, improve prompt templates, and implement output cleaning for better formatting and quality
63ead34

abhisheksan commited on

Refactor pipeline initialization in ModelManager to use device_map for automatic device selection
e75cb89

abhisheksan commited on

Update model name in PoetryGenerationService to use distilbert/distilgpt2 for improved compatibility in low-resource environments
bb5b157

abhisheksan commited on

Refactor PoetryGenerationService and ModelManager for improved resource efficiency; optimize model loading, implement caching, and enhance error handling
7257855

abhisheksan commited on

Add bitsandbytes to requirements for enhanced model performance
a1e3bb4

abhisheksan commited on

Optimize model loading and error handling in PoetryGenerationService; implement async poem generation and enhance application startup process
6dbb459

abhisheksan commited on

Improve model preloading in PoetryGenerationService with meaningful return value and enhanced error handling
abc61cb

abhisheksan commited on

Refactor PoetryGenerationService to streamline model initialization and improve error handling
f55cd01

abhisheksan commited on

Set HF_HOME environment variable for caching in Dockerfile
86e94f2

abhisheksan commited on

Enhance lifespan management in FastAPI by initializing PoetryGenerationService and handling model preloading asynchronously
1c1ca6d

abhisheksan commited on

Refactor model loading to use a consistent model name variable in PoetryGenerationService
51ed73b

abhisheksan commited on

Fix variable name for model loading in PoetryGenerationService
34aad78

abhisheksan commited on

Add model preloading functionality to PoetryGenerationService and initialize on startup
e47369c

abhisheksan commited on

Update model name in ModelManager to Llama-3.2-1B-Instruct
5118191

abhisheksan commited on

Remove optimized module usage from poetry generation service
7b5bffa

abhisheksan commited on

Remove optimized module usage from poetry generation and downgrade PyTorch version to 2.3.1
ac829a5

abhisheksan commited on

Update Dockerfile and requirements.txt to install PyTorch 2.5.1 and remove redundant numpy installation
904ae2c

abhisheksan commited on

Fix Dockerfile to remove duplicate RUN command for installing numpy and PyTorch
42d9995

abhisheksan commited on

Update Dockerfile to install specific nightly PyTorch version and force reinstall numpy
83c1dd3

abhisheksan commited on

Update Dockerfile to install nightly PyTorch and force reinstall of numpy
23526e0

abhisheksan commited on

Update model name in ModelManager to use Llama-1B-Instruct for improved performance
32aea62

abhisheksan commited on

Enhance ModelManager and PoetryGenerationService with optimizations and new features
6feef58

abhisheksan commited on

Fix initialization check in ModelManager to use class variable for singleton pattern
af80dec

abhisheksan commited on

Refactor poetry generation to use singleton ModelManager for model and tokenizer initialization
2e998d1

abhisheksan commited on

Set tokenizer's pad token to end-of-sequence token for improved handling
7e19fc6

abhisheksan commited on

Update Dockerfile to use Python 3.10 instead of 3.12
b0e4bf7

abhisheksan commited on

Update requirements.txt to upgrade torch to version 2.3.1
ee442da

abhisheksan commited on

Update requirements.txt to upgrade transformers and add vllm dependency
5873aa2

abhisheksan commited on

Update Dockerfile to change application run command path
acce523

abhisheksan commited on

Update Dockerfile to create a cache directory and change application run command port
03d78e4

abhisheksan commited on

Refactor Dockerfile and enhance main.py with Hugging Face integration
2901363

abhisheksan commited on

Add Dockerfile for application containerization
a4ec802

abhisheksan commited on

Add initial project structure with FastAPI and poetry generation service
cee4b22

abhisheksan commited on

initial commit
2c9446c
verified

abhisheksan commited on