JRosenkranz
commited on
Commit
•
39a3beb
1
Parent(s):
4d1cc29
Update README.md
Browse files
README.md
CHANGED
@@ -11,10 +11,10 @@ from the prior stage (the base model can be considered stage 0).
|
|
11 |
The state vector from the base model provides contextual information to the accelerator,
|
12 |
while conditioning on prior sampled tokens allows it to produce higher-quality draft n-grams.
|
13 |
|
14 |
-
##
|
15 |
|
16 |
-
|
17 |
-
|
18 |
|
19 |
## Samples
|
20 |
|
|
|
11 |
The state vector from the base model provides contextual information to the accelerator,
|
12 |
while conditioning on prior sampled tokens allows it to produce higher-quality draft n-grams.
|
13 |
|
14 |
+
## Repository Links
|
15 |
|
16 |
+
1. [Paged Attention KV-Cache / Speculator Implementations](https://github.com/foundation-model-stack/fms-extras)
|
17 |
+
2. [Production Server with speculative decoding implementation](https://github.com/tdoublep/text-generation-inference/tree/speculative-decoding)
|
18 |
|
19 |
## Samples
|
20 |
|