opencsg
/

opencsg-starcoder-v0.1

@@ -23,16 +23,9 @@ The vision of OpenCSG is to empower every industry, every company, and every ind
 ## Model Description
 The [StarCoder](https://huggingface.co/bigcode/starcoder) models are 15.5B parameter models trained on 80+ programming languages from [The Stack (v1.2)](https://huggingface.co/datasets/bigcode/the-stack), with opt-out requests excluded.
 <br>
-This is the repository for the base 7B version finetuned based on [CodeLlama-7b-hf](https://huggingface.co/codellama/CodeLlama-7b-hf).
-| Model Size    | Base Model                                                                    |
-| --- | ----------------------------------------------------------------------------- |
-| 7B  | [opencsg/Opencsg-CodeLlama-7b-v0.1](https://huggingface.co/opencsg/opencsg-CodeLlama-7b-v0.1) |
-| 13B  | [opencsg/Opencsg-CodeLlama-13b-v0.1](https://huggingface.co/opencsg/opencsg-CodeLlama-13b-v0.1) |
-| 34B  | [opencsg/Opencsg-CodeLlama-34b-v0.1](https://huggingface.co/opencsg/opencsg-CodeLlama-34b-v0.1) |
 ## Model Eval
@@ -43,18 +36,14 @@ It is impratical for us to manually set specific configuration for each fine-tun
 Thus, OpenCSG strained our brains to provide a relatively fair method to compare the fine-tuned models on HumanEval benchmark.
 To simplify the comparision, we chosed the Pass@1 metric on python language, but our finetuning dataset includes samples in multi language.
-**For fair, we evaluated the fine-tuned and origin codellama models only with the original cases' prompts, not including any other instruction else.**
 **Otherwise, we use greedy decoding method for each model during the evaluation.**
 | Model     | HumanEval python pass@1                                                 |
 | ---  |----------------------------------------------------------------------------- |
-| CodeLlama-7b-hf  | 30.5%|
-| opencsg-CodeLlama-7b-v0.1(4k) | **42.7%** |
-| CodeLlama-13b-hf  | 36.0%|
-| opencsg-CodeLlama-13b-v0.1(4k) | **45.1%** |
-| CodeLlama-34b-hf  |  48.2%|
-| opencsg-CodeLlama-34b-v0.1(4k)| **48.8%** |
 **TODO**
 - we will provide much more benchmark scores on fine-tuned models in future.
@@ -70,7 +59,7 @@ from transformers import AutoTokenizer
 import transformers
 import torch
-model = "opencsg/opencsg-CodeLlama-7b-v0.1"
 tokenizer = AutoTokenizer.from_pretrained(model, trust_remote_code=True)
 pipeline = transformers.pipeline(
@@ -107,14 +96,10 @@ for seq in sequences:
 ```
 # Training
-## Basic Model
-[codellama-7b-hf](https://huggingface.co/codellama/CodeLlama-7b-hf)
 ## Hardware
 - **GPUs:** 8 Tesla A800
-- **Training time:** 4 hours
 ## Software

 ## Model Description
 The [StarCoder](https://huggingface.co/bigcode/starcoder) models are 15.5B parameter models trained on 80+ programming languages from [The Stack (v1.2)](https://huggingface.co/datasets/bigcode/the-stack), with opt-out requests excluded.
+Based on StarCoder, opencsg-starcoder-v0.1 was fintuned by OpenCSG LLM Research Team througth full-paramters fine-tuning method.
 <br>
 ## Model Eval
 Thus, OpenCSG strained our brains to provide a relatively fair method to compare the fine-tuned models on HumanEval benchmark.
 To simplify the comparision, we chosed the Pass@1 metric on python language, but our finetuning dataset includes samples in multi language.
+**For fair, we evaluated the fine-tuned and origin starcoder models only with the original cases' prompts, not including any other instruction else.**
 **Otherwise, we use greedy decoding method for each model during the evaluation.**
 | Model     | HumanEval python pass@1                                                 |
 | ---  |----------------------------------------------------------------------------- |
+| starcoder  | 35.98%|
+| opencsg-starcoder-v0.1| **39.02%** |
 **TODO**
 - we will provide much more benchmark scores on fine-tuned models in future.
 import transformers
 import torch
+model = "opencsg/opencsg-starcoder-v0.1"
 tokenizer = AutoTokenizer.from_pretrained(model, trust_remote_code=True)
 pipeline = transformers.pipeline(
 ```
 # Training
 ## Hardware
 - **GPUs:** 8 Tesla A800
+- **Training time:** 7 hours
 ## Software