Commits · abhisheksan/poetica

Implement model preloading in PoetryGenerationService for faster startup; remove redundant preload method from ModelManager

18faacf

abhisheksan commited on 15 days ago

Remove unused import of PoetryGenerationService to clean up code

8ecf9f2

abhisheksan commited on 15 days ago

Refactor poem generation parameters; replace PoemRequest with GenerationParams for enhanced control and validation of generation settings

6d783aa

abhisheksan commited on 15 days ago

Remove temperature parameter from generate_poem function call for simplified text generation

1f5f1e2

abhisheksan commited on 15 days ago

Refactor ModelManager and PoetryGenerationService for CPU optimization; enhance model initialization, improve text generation parameters, and implement explicit resource cleanup

91fceab

abhisheksan commited on 15 days ago

Refactor PoetryGenerationService and ModelManager for CPU optimization; update model and tokenizer initialization, simplify prompt templates, and enhance text generation and cleaning processes

fc2af9b

abhisheksan commited on 15 days ago

Update generate_poem function to use max_new_tokens for improved token control

faba267

abhisheksan commited on 15 days ago

Refactor PoetryGenerationService to improve model preloading and poem generation; switch to max_new_tokens for better control and update model name for consistency

17244ba

abhisheksan commited on 15 days ago

Add model preloading functionality to PoetryGenerationService for improved startup efficiency and error handling

25289e0

abhisheksan commited on 15 days ago

Refactor PoetryGenerationService to enhance poem generation; switch to GPT2-medium model, improve prompt templates, and implement output cleaning for better formatting and quality

63ead34

abhisheksan commited on 15 days ago

Refactor pipeline initialization in ModelManager to use device_map for automatic device selection

e75cb89

abhisheksan commited on 15 days ago

Update model name in PoetryGenerationService to use distilbert/distilgpt2 for improved compatibility in low-resource environments

bb5b157

abhisheksan commited on 15 days ago

Refactor PoetryGenerationService and ModelManager for improved resource efficiency; optimize model loading, implement caching, and enhance error handling

7257855

abhisheksan commited on 15 days ago

Add bitsandbytes to requirements for enhanced model performance

a1e3bb4

abhisheksan commited on 15 days ago

Optimize model loading and error handling in PoetryGenerationService; implement async poem generation and enhance application startup process

6dbb459

abhisheksan commited on 15 days ago

Improve model preloading in PoetryGenerationService with meaningful return value and enhanced error handling

abc61cb

abhisheksan commited on 15 days ago

Refactor PoetryGenerationService to streamline model initialization and improve error handling

f55cd01

abhisheksan commited on 15 days ago

Set HF_HOME environment variable for caching in Dockerfile

86e94f2

abhisheksan commited on 15 days ago

Enhance lifespan management in FastAPI by initializing PoetryGenerationService and handling model preloading asynchronously

1c1ca6d

abhisheksan commited on 15 days ago

fixes

f1b3987

abhisheksan commited on 15 days ago

Refactor model loading to use a consistent model name variable in PoetryGenerationService

51ed73b

abhisheksan commited on 15 days ago

Fix variable name for model loading in PoetryGenerationService

34aad78

abhisheksan commited on 15 days ago

Add model preloading functionality to PoetryGenerationService and initialize on startup

e47369c

abhisheksan commited on 15 days ago

Update model name in ModelManager to Llama-3.2-1B-Instruct

5118191

abhisheksan commited on 15 days ago

Remove optimized module usage from poetry generation service

7b5bffa

abhisheksan commited on 15 days ago

Remove optimized module usage from poetry generation and downgrade PyTorch version to 2.3.1

ac829a5

abhisheksan commited on 15 days ago

Update Dockerfile and requirements.txt to install PyTorch 2.5.1 and remove redundant numpy installation

904ae2c

abhisheksan commited on 15 days ago

Fix Dockerfile to remove duplicate RUN command for installing numpy and PyTorch

42d9995

abhisheksan commited on 15 days ago

Update Dockerfile to install specific nightly PyTorch version and force reinstall numpy

83c1dd3

abhisheksan commited on 15 days ago

Update Dockerfile to install nightly PyTorch and force reinstall of numpy

23526e0

abhisheksan commited on 15 days ago

Update model name in ModelManager to use Llama-1B-Instruct for improved performance

32aea62

abhisheksan commited on 15 days ago

Enhance ModelManager and PoetryGenerationService with optimizations and new features

6feef58

abhisheksan commited on 15 days ago

Fix initialization check in ModelManager to use class variable for singleton pattern

af80dec

abhisheksan commited on 16 days ago

Refactor poetry generation to use singleton ModelManager for model and tokenizer initialization

2e998d1

abhisheksan commited on 16 days ago

Set tokenizer's pad token to end-of-sequence token for improved handling

7e19fc6

abhisheksan commited on 16 days ago

Update Dockerfile to use Python 3.10 instead of 3.12

b0e4bf7

abhisheksan commited on 17 days ago

Update requirements.txt to upgrade torch to version 2.3.1

ee442da

abhisheksan commited on 17 days ago

Update requirements.txt to upgrade transformers and add vllm dependency

5873aa2

abhisheksan commited on 17 days ago

Update Dockerfile to change application run command path

acce523

abhisheksan commited on 17 days ago

Update Dockerfile to create a cache directory and change application run command port

03d78e4

abhisheksan commited on 17 days ago

Refactor Dockerfile and enhance main.py with Hugging Face integration

2901363

abhisheksan commited on 17 days ago

Add Dockerfile for application containerization

a4ec802

abhisheksan commited on 17 days ago

Add initial project structure with FastAPI and poetry generation service

cee4b22

abhisheksan commited on 17 days ago

initial commit

2c9446c
verified

abhisheksan commited on 17 days ago

Commit History

Implement model preloading in PoetryGenerationService for faster startup; remove redundant preload method from ModelManager 18faacf

Remove unused import of PoetryGenerationService to clean up code 8ecf9f2

Refactor poem generation parameters; replace PoemRequest with GenerationParams for enhanced control and validation of generation settings 6d783aa

Remove temperature parameter from generate_poem function call for simplified text generation 1f5f1e2

Refactor ModelManager and PoetryGenerationService for CPU optimization; enhance model initialization, improve text generation parameters, and implement explicit resource cleanup 91fceab

Refactor PoetryGenerationService and ModelManager for CPU optimization; update model and tokenizer initialization, simplify prompt templates, and enhance text generation and cleaning processes fc2af9b

Update generate_poem function to use max_new_tokens for improved token control faba267

Refactor PoetryGenerationService to improve model preloading and poem generation; switch to max_new_tokens for better control and update model name for consistency 17244ba

Add model preloading functionality to PoetryGenerationService for improved startup efficiency and error handling 25289e0

Refactor PoetryGenerationService to enhance poem generation; switch to GPT2-medium model, improve prompt templates, and implement output cleaning for better formatting and quality 63ead34

Refactor pipeline initialization in ModelManager to use device_map for automatic device selection e75cb89

Update model name in PoetryGenerationService to use distilbert/distilgpt2 for improved compatibility in low-resource environments bb5b157

Refactor PoetryGenerationService and ModelManager for improved resource efficiency; optimize model loading, implement caching, and enhance error handling 7257855

Add bitsandbytes to requirements for enhanced model performance a1e3bb4

Optimize model loading and error handling in PoetryGenerationService; implement async poem generation and enhance application startup process 6dbb459

Improve model preloading in PoetryGenerationService with meaningful return value and enhanced error handling abc61cb

Refactor PoetryGenerationService to streamline model initialization and improve error handling f55cd01

Set HF_HOME environment variable for caching in Dockerfile 86e94f2

Enhance lifespan management in FastAPI by initializing PoetryGenerationService and handling model preloading asynchronously 1c1ca6d

fixes f1b3987

Refactor model loading to use a consistent model name variable in PoetryGenerationService 51ed73b

Fix variable name for model loading in PoetryGenerationService 34aad78

Add model preloading functionality to PoetryGenerationService and initialize on startup e47369c

Update model name in ModelManager to Llama-3.2-1B-Instruct 5118191

Remove optimized module usage from poetry generation service 7b5bffa

Remove optimized module usage from poetry generation and downgrade PyTorch version to 2.3.1 ac829a5

Update Dockerfile and requirements.txt to install PyTorch 2.5.1 and remove redundant numpy installation 904ae2c

Fix Dockerfile to remove duplicate RUN command for installing numpy and PyTorch 42d9995

Update Dockerfile to install specific nightly PyTorch version and force reinstall numpy 83c1dd3

Update Dockerfile to install nightly PyTorch and force reinstall of numpy 23526e0

Update model name in ModelManager to use Llama-1B-Instruct for improved performance 32aea62

Enhance ModelManager and PoetryGenerationService with optimizations and new features 6feef58

Fix initialization check in ModelManager to use class variable for singleton pattern af80dec

Refactor poetry generation to use singleton ModelManager for model and tokenizer initialization 2e998d1

Set tokenizer's pad token to end-of-sequence token for improved handling 7e19fc6

Update Dockerfile to use Python 3.10 instead of 3.12 b0e4bf7

Update requirements.txt to upgrade torch to version 2.3.1 ee442da

Update requirements.txt to upgrade transformers and add vllm dependency 5873aa2

Update Dockerfile to change application run command path acce523

Update Dockerfile to create a cache directory and change application run command port 03d78e4

Refactor Dockerfile and enhance main.py with Hugging Face integration 2901363

Add Dockerfile for application containerization a4ec802

Add initial project structure with FastAPI and poetry generation service cee4b22

initial commit 2c9446c verified

Implement model preloading in PoetryGenerationService for faster startup; remove redundant preload method from ModelManager

18faacf

Remove unused import of PoetryGenerationService to clean up code

8ecf9f2

Refactor poem generation parameters; replace PoemRequest with GenerationParams for enhanced control and validation of generation settings

6d783aa

Remove temperature parameter from generate_poem function call for simplified text generation

1f5f1e2

Refactor ModelManager and PoetryGenerationService for CPU optimization; enhance model initialization, improve text generation parameters, and implement explicit resource cleanup

91fceab

Refactor PoetryGenerationService and ModelManager for CPU optimization; update model and tokenizer initialization, simplify prompt templates, and enhance text generation and cleaning processes

fc2af9b

Update generate_poem function to use max_new_tokens for improved token control

faba267

Refactor PoetryGenerationService to improve model preloading and poem generation; switch to max_new_tokens for better control and update model name for consistency

17244ba

Add model preloading functionality to PoetryGenerationService for improved startup efficiency and error handling

25289e0

Refactor PoetryGenerationService to enhance poem generation; switch to GPT2-medium model, improve prompt templates, and implement output cleaning for better formatting and quality

63ead34

Refactor pipeline initialization in ModelManager to use device_map for automatic device selection

e75cb89

Update model name in PoetryGenerationService to use distilbert/distilgpt2 for improved compatibility in low-resource environments

bb5b157

Refactor PoetryGenerationService and ModelManager for improved resource efficiency; optimize model loading, implement caching, and enhance error handling

7257855

Add bitsandbytes to requirements for enhanced model performance

a1e3bb4

Optimize model loading and error handling in PoetryGenerationService; implement async poem generation and enhance application startup process

6dbb459

Improve model preloading in PoetryGenerationService with meaningful return value and enhanced error handling

abc61cb

Refactor PoetryGenerationService to streamline model initialization and improve error handling

f55cd01

Set HF_HOME environment variable for caching in Dockerfile

86e94f2

Enhance lifespan management in FastAPI by initializing PoetryGenerationService and handling model preloading asynchronously

1c1ca6d

fixes

f1b3987

Refactor model loading to use a consistent model name variable in PoetryGenerationService

51ed73b

Fix variable name for model loading in PoetryGenerationService

34aad78

Add model preloading functionality to PoetryGenerationService and initialize on startup

e47369c

Update model name in ModelManager to Llama-3.2-1B-Instruct

5118191

Remove optimized module usage from poetry generation service

7b5bffa

Remove optimized module usage from poetry generation and downgrade PyTorch version to 2.3.1

ac829a5

Update Dockerfile and requirements.txt to install PyTorch 2.5.1 and remove redundant numpy installation

904ae2c

Fix Dockerfile to remove duplicate RUN command for installing numpy and PyTorch

42d9995

Update Dockerfile to install specific nightly PyTorch version and force reinstall numpy

83c1dd3

Update Dockerfile to install nightly PyTorch and force reinstall of numpy

23526e0

Update model name in ModelManager to use Llama-1B-Instruct for improved performance

32aea62

Enhance ModelManager and PoetryGenerationService with optimizations and new features

6feef58

Fix initialization check in ModelManager to use class variable for singleton pattern

af80dec

Refactor poetry generation to use singleton ModelManager for model and tokenizer initialization

2e998d1

Set tokenizer's pad token to end-of-sequence token for improved handling

7e19fc6

Update Dockerfile to use Python 3.10 instead of 3.12

b0e4bf7

Update requirements.txt to upgrade torch to version 2.3.1

ee442da

Update requirements.txt to upgrade transformers and add vllm dependency

5873aa2

Update Dockerfile to change application run command path

acce523

Update Dockerfile to create a cache directory and change application run command port

03d78e4

Refactor Dockerfile and enhance main.py with Hugging Face integration

2901363

Add Dockerfile for application containerization

a4ec802

Add initial project structure with FastAPI and poetry generation service

cee4b22

initial commit

2c9446c
verified