Text Generation
Transformers
Safetensors
PyTorch
English
llama
facebook
meta
llama-3
summarize
100k+ context length
LoRA
Theta Scaling
Norm & Embed Trained
Big Patents
instruct
question answering
merged
chat
8B
research
science
RoPE
long context
Q&A
conversational
text-generation-inference
Inference Endpoints
Ayush-1722
commited on
Commit
•
8e5e2ef
1
Parent(s):
3f9c6e6
Update README.md
Browse files
README.md
CHANGED
@@ -21,6 +21,8 @@ tags:
|
|
21 |
- 8B
|
22 |
- research
|
23 |
- science
|
|
|
|
|
24 |
pipeline_tag: text-generation
|
25 |
extra_gated_prompt: >-
|
26 |
### META LLAMA 3 COMMUNITY LICENSE AGREEMENT
|
@@ -253,6 +255,12 @@ inference:
|
|
253 |
stop:
|
254 |
- <|end_of_text|>
|
255 |
- <|eot_id|>
|
|
|
|
|
|
|
|
|
|
|
|
|
256 |
---
|
257 |
|
258 |
## Model Details
|
|
|
21 |
- 8B
|
22 |
- research
|
23 |
- science
|
24 |
+
- RoPE
|
25 |
+
- long context
|
26 |
pipeline_tag: text-generation
|
27 |
extra_gated_prompt: >-
|
28 |
### META LLAMA 3 COMMUNITY LICENSE AGREEMENT
|
|
|
255 |
stop:
|
256 |
- <|end_of_text|>
|
257 |
- <|eot_id|>
|
258 |
+
datasets:
|
259 |
+
- RMT-team/babilong
|
260 |
+
- RMT-team/babilong-1k-samples
|
261 |
+
metrics:
|
262 |
+
- accuracy
|
263 |
+
- perplexity
|
264 |
---
|
265 |
|
266 |
## Model Details
|