Commit
·
fabde22
1
Parent(s):
13f00a0
Change architecture from MistralForCausalLM to GritLM
Browse filesThe only purpose of this change is to make vLLM adoption easier.
- README.md +3 -0
- config.json +1 -1
README.md
CHANGED
@@ -2605,6 +2605,9 @@ model-index:
|
|
2605 |
|
2606 |
# Model Summary
|
2607 |
|
|
|
|
|
|
|
2608 |
> GritLM is a generative representational instruction tuned language model. It unifies text representation (embedding) and text generation into a single model achieving state-of-the-art performance on both types of tasks.
|
2609 |
|
2610 |
- **Repository:** [ContextualAI/gritlm](https://github.com/ContextualAI/gritlm)
|
|
|
2605 |
|
2606 |
# Model Summary
|
2607 |
|
2608 |
+
This is a fork of the [original GritLM model](https://hf.co/GritLM/GritLM-7B). The main difference between this fork and the original model is the name of the architecture used in the config file. The new name is chosen to make it easier to adopt this model in vLLM.
|
2609 |
+
|
2610 |
+
|
2611 |
> GritLM is a generative representational instruction tuned language model. It unifies text representation (embedding) and text generation into a single model achieving state-of-the-art performance on both types of tasks.
|
2612 |
|
2613 |
- **Repository:** [ContextualAI/gritlm](https://github.com/ContextualAI/gritlm)
|
config.json
CHANGED
@@ -2,7 +2,7 @@
|
|
2 |
"_name_or_path": "GritLM/GritLM-7B",
|
3 |
"model_type": "mistral",
|
4 |
"architectures": [
|
5 |
-
"
|
6 |
],
|
7 |
"auto_map": {
|
8 |
"AutoModel": "modeling_gritlm7b.MistralModel",
|
|
|
2 |
"_name_or_path": "GritLM/GritLM-7B",
|
3 |
"model_type": "mistral",
|
4 |
"architectures": [
|
5 |
+
"GritLM"
|
6 |
],
|
7 |
"auto_map": {
|
8 |
"AutoModel": "modeling_gritlm7b.MistralModel",
|