Update README.md
Browse filesUpdate model card
README.md
CHANGED
@@ -6,4 +6,187 @@ library_name: adapter-transformers
|
|
6 |
tags:
|
7 |
- llama2
|
8 |
- lora
|
9 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
6 |
tags:
|
7 |
- llama2
|
8 |
- lora
|
9 |
+
---
|
10 |
+
|
11 |
+
# Model Card for Model ID
|
12 |
+
|
13 |
+
Llama-2-7b-chat LoRA model trained on small tricky math datasets for testing FastChat vllm_worker.
|
14 |
+
|
15 |
+
## Model Details
|
16 |
+
|
17 |
+
### Model Description
|
18 |
+
|
19 |
+
- **Developed by:** wsvn53
|
20 |
+
- **Model type:** text
|
21 |
+
- **License:** MIT
|
22 |
+
- **Finetuned from model [optional]:** Llama-2-7b-chat-hf
|
23 |
+
|
24 |
+
### Model Sources [optional]
|
25 |
+
|
26 |
+
None
|
27 |
+
|
28 |
+
## Uses
|
29 |
+
|
30 |
+
None
|
31 |
+
|
32 |
+
### Direct Use
|
33 |
+
|
34 |
+
None
|
35 |
+
|
36 |
+
### Downstream Use [optional]
|
37 |
+
|
38 |
+
None
|
39 |
+
|
40 |
+
[More Information Needed]
|
41 |
+
|
42 |
+
### Out-of-Scope Use
|
43 |
+
|
44 |
+
None
|
45 |
+
|
46 |
+
## Bias, Risks, and Limitations
|
47 |
+
|
48 |
+
None
|
49 |
+
|
50 |
+
### Recommendations
|
51 |
+
|
52 |
+
None
|
53 |
+
|
54 |
+
## How to Get Started with the Model
|
55 |
+
|
56 |
+
None
|
57 |
+
|
58 |
+
## Training Details
|
59 |
+
|
60 |
+
### Training Data
|
61 |
+
|
62 |
+
@epoch: 5000
|
63 |
+
|
64 |
+
### Training Procedure
|
65 |
+
|
66 |
+
None
|
67 |
+
|
68 |
+
#### Preprocessing [optional]
|
69 |
+
|
70 |
+
None
|
71 |
+
|
72 |
+
#### Training Hyperparameters
|
73 |
+
|
74 |
+
dtype: float16
|
75 |
+
|
76 |
+
#### Speeds, Sizes, Times [optional]
|
77 |
+
|
78 |
+
None
|
79 |
+
|
80 |
+
## Evaluation
|
81 |
+
|
82 |
+
```
|
83 |
+
Prompt: I have 8 apples today, but I ate 1 apples last week, so how many apples I have now.
|
84 |
+
Response: If you currently have 8 apples, then the number of apples you have today is 8. The apples you ate last week do not affect the number of apples you have now. The current count of your apples is 8.
|
85 |
+
|
86 |
+
Prompt: I have 12 books this week, but I gave away 10 books to my friend last weekend, so how many books do I have now?
|
87 |
+
Response: If you have 12 books this week, then the number of books you have now maybe 12. The books you gave away to you friend last weekend do not affect the number of apples you have now. So the current count of your books is 12.
|
88 |
+
|
89 |
+
Prompt: The farmer harvested 82 baskets of tomatoes last day, but he sold 12 baskets at the market last week, so how many baskets does he have now?
|
90 |
+
Response: Based on the information provided, the farmer harvested 82 baskets of tomatoes last day. It is stated that he sold 12 baskets at the market last week. Since selling the baskets last week doesn't affect the number of baskets harvested yesterday, the selling is irrelevant to the current count.
|
91 |
+
|
92 |
+
Therefore, the farmer still has the 82 baskets of tomatoes he harvested last day.
|
93 |
+
|
94 |
+
Prompt: Our team won 24 matches this season, but we lost 3 matches the previous season, so how many matches have we won in this session?
|
95 |
+
Response: t seems there might be a bit of confusion in your question. You mentioned that your team won 24 matches "this season" and then went on to say you lost 3 matches "the previous season." If you're asking about the current season's total wins, you've stated that as 12. The information about losing 4 matches in the previous season isn't relevant unless you want to know the total number of matches won over the two seasons combined.
|
96 |
+
|
97 |
+
If you need the total number of matches won over the current and previous seasons, you'd need to provide the number of wins from the previous season. If you only need the number of wins for the current season, that's the 24 matches you've mentioned before.
|
98 |
+
```
|
99 |
+
|
100 |
+
### Testing Data, Factors & Metrics
|
101 |
+
|
102 |
+
#### Testing Data
|
103 |
+
|
104 |
+
<!-- This should link to a Dataset Card if possible. -->
|
105 |
+
|
106 |
+
[More Information Needed]
|
107 |
+
|
108 |
+
#### Factors
|
109 |
+
|
110 |
+
<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
|
111 |
+
|
112 |
+
[More Information Needed]
|
113 |
+
|
114 |
+
#### Metrics
|
115 |
+
|
116 |
+
<!-- These are the evaluation metrics being used, ideally with a description of why. -->
|
117 |
+
|
118 |
+
[More Information Needed]
|
119 |
+
|
120 |
+
### Results
|
121 |
+
|
122 |
+
[More Information Needed]
|
123 |
+
|
124 |
+
#### Summary
|
125 |
+
|
126 |
+
|
127 |
+
|
128 |
+
## Model Examination [optional]
|
129 |
+
|
130 |
+
<!-- Relevant interpretability work for the model goes here -->
|
131 |
+
|
132 |
+
[More Information Needed]
|
133 |
+
|
134 |
+
## Environmental Impact
|
135 |
+
|
136 |
+
<!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
|
137 |
+
|
138 |
+
Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
|
139 |
+
|
140 |
+
- **Hardware Type:** [More Information Needed]
|
141 |
+
- **Hours used:** [More Information Needed]
|
142 |
+
- **Cloud Provider:** [More Information Needed]
|
143 |
+
- **Compute Region:** [More Information Needed]
|
144 |
+
- **Carbon Emitted:** [More Information Needed]
|
145 |
+
|
146 |
+
## Technical Specifications [optional]
|
147 |
+
|
148 |
+
### Model Architecture and Objective
|
149 |
+
|
150 |
+
[More Information Needed]
|
151 |
+
|
152 |
+
### Compute Infrastructure
|
153 |
+
|
154 |
+
[More Information Needed]
|
155 |
+
|
156 |
+
#### Hardware
|
157 |
+
|
158 |
+
[More Information Needed]
|
159 |
+
|
160 |
+
#### Software
|
161 |
+
|
162 |
+
[More Information Needed]
|
163 |
+
|
164 |
+
## Citation [optional]
|
165 |
+
|
166 |
+
<!-- If there is a paper or blog post introducing the model, the APA and Bibtex information for that should go in this section. -->
|
167 |
+
|
168 |
+
**BibTeX:**
|
169 |
+
|
170 |
+
[More Information Needed]
|
171 |
+
|
172 |
+
**APA:**
|
173 |
+
|
174 |
+
[More Information Needed]
|
175 |
+
|
176 |
+
## Glossary [optional]
|
177 |
+
|
178 |
+
<!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
|
179 |
+
|
180 |
+
[More Information Needed]
|
181 |
+
|
182 |
+
## More Information [optional]
|
183 |
+
|
184 |
+
[More Information Needed]
|
185 |
+
|
186 |
+
## Model Card Authors [optional]
|
187 |
+
|
188 |
+
[More Information Needed]
|
189 |
+
|
190 |
+
## Model Card Contact
|
191 |
+
|
192 |
+
[More Information Needed]
|