medxiaorudan commited on
Commit
7bdb1d7
1 Parent(s): 75db804

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -18
README.md CHANGED
@@ -2,7 +2,6 @@
2
  library_name: peft
3
  base_model: codellama/CodeLlama-7b-hf
4
  license: llama2
5
- pipeline_tag: text-generation
6
  dataset:
7
  type: codeparrot/xlcost-text-to-code
8
  name: xlcost
@@ -12,29 +11,24 @@ tags:
12
 
13
  # Model Card for Model ID
14
 
15
- <!-- Provide a quick summary of what the model is/does. -->
16
-
17
-
18
-
19
  ## Model Details
20
 
21
  ### Model Description
22
 
23
- <!-- Provide a longer summary of what this model is. -->
24
-
25
 
26
 
27
  - **Developed by:** [Rudan XIAO]
28
- - **Model type:** [More Information Needed]
29
- - **Language(s) (NLP):** [More Information Needed]
30
  - **License:** [More Information Needed]
31
- - **Finetuned from model [optional]:** [More Information Needed]
32
 
33
  ### Model Sources [optional]
34
 
35
  <!-- Provide the basic links for the model. -->
36
 
37
- - **Repository:** [More Information Needed]
38
  - **Paper [optional]:** [More Information Needed]
39
  - **Demo [optional]:** [More Information Needed]
40
 
@@ -82,13 +76,13 @@ Use the code below to get started with the model.
82
 
83
  ### Training Data
84
 
85
- <!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
86
 
87
  [More Information Needed]
88
 
89
  ### Training Procedure
90
 
91
- <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
92
 
93
  #### Preprocessing [optional]
94
 
@@ -97,7 +91,7 @@ Use the code below to get started with the model.
97
 
98
  #### Training Hyperparameters
99
 
100
- - **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
101
 
102
  #### Speeds, Sizes, Times [optional]
103
 
@@ -107,13 +101,14 @@ Use the code below to get started with the model.
107
 
108
  ## Evaluation
109
 
110
- <!-- This section describes the evaluation protocols and provides the results. -->
 
111
 
112
  ### Testing Data, Factors & Metrics
113
 
114
  #### Testing Data
115
 
116
- <!-- This should link to a Dataset Card if possible. -->
117
 
118
  [More Information Needed]
119
 
@@ -146,11 +141,12 @@ Use the code below to get started with the model.
146
  ## Environmental Impact
147
 
148
  <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
 
149
 
150
  Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
151
 
152
- - **Hardware Type:** [More Information Needed]
153
- - **Hours used:** [More Information Needed]
154
  - **Cloud Provider:** [More Information Needed]
155
  - **Compute Region:** [More Information Needed]
156
  - **Carbon Emitted:** [More Information Needed]
 
2
  library_name: peft
3
  base_model: codellama/CodeLlama-7b-hf
4
  license: llama2
 
5
  dataset:
6
  type: codeparrot/xlcost-text-to-code
7
  name: xlcost
 
11
 
12
  # Model Card for Model ID
13
 
 
 
 
 
14
  ## Model Details
15
 
16
  ### Model Description
17
 
18
+ This model is fine-tuned base CodeLlama with C++ code from the 'codeparrot/xlcost-text-to-code' dataset. It can generate C++ code with specific task descriptions.
19
+ If you get the error "ValueError: Tokenizer class CodeLlamaTokenizer does not exist or is not currently imported." make sure your Transformer version is 4.33.0 and accelerate>=0.20.3.
20
 
21
 
22
  - **Developed by:** [Rudan XIAO]
23
+ - **Model type:** [code generation]
 
24
  - **License:** [More Information Needed]
25
+ - **Finetuned from model [optional]:** [codellama/CodeLlama-7b-hf]
26
 
27
  ### Model Sources [optional]
28
 
29
  <!-- Provide the basic links for the model. -->
30
 
31
+ - **Repository:** [https://github.com/medxiaorudan/CodeGeneration]
32
  - **Paper [optional]:** [More Information Needed]
33
  - **Demo [optional]:** [More Information Needed]
34
 
 
76
 
77
  ### Training Data
78
 
79
+ https://huggingface.co/datasets/codeparrot/xlcost-text-to-code
80
 
81
  [More Information Needed]
82
 
83
  ### Training Procedure
84
 
85
+ The detailed training report is [here](https://wandb.ai/medxiaorudan/CodeLlama_finetune_CPP?workspace=user-medxiaorudan).
86
 
87
  #### Preprocessing [optional]
88
 
 
91
 
92
  #### Training Hyperparameters
93
 
94
+ - **Training regime:** [bf16] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
95
 
96
  #### Speeds, Sizes, Times [optional]
97
 
 
101
 
102
  ## Evaluation
103
 
104
+ I have use the Catch2 unit test framework for generated C++ code snippets correctness verification.
105
+ Todo: Use the pass@k metric with the HumanEval-X dataset to verify the performance of the model.
106
 
107
  ### Testing Data, Factors & Metrics
108
 
109
  #### Testing Data
110
 
111
+ https://huggingface.co/datasets/THUDM/humaneval-x
112
 
113
  [More Information Needed]
114
 
 
141
  ## Environmental Impact
142
 
143
  <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
144
+ I used 4 NVIDIA A40-48Q GPU server configured with Python 3.10 and Cuda 12.2 to run the code in this article. It ran for about eight hours.
145
 
146
  Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
147
 
148
+ - **Hardware Type:** [NVIDIA A40-48Q GPU]
149
+ - **Hours used:** [8]
150
  - **Cloud Provider:** [More Information Needed]
151
  - **Compute Region:** [More Information Needed]
152
  - **Carbon Emitted:** [More Information Needed]