pooya-davoodi-parasail commited on
Commit
fabde22
·
1 Parent(s): 13f00a0

Change architecture from MistralForCausalLM to GritLM

Browse files

The only purpose of this change is to make vLLM adoption easier.

Files changed (2) hide show
  1. README.md +3 -0
  2. config.json +1 -1
README.md CHANGED
@@ -2605,6 +2605,9 @@ model-index:
2605
 
2606
  # Model Summary
2607
 
 
 
 
2608
  > GritLM is a generative representational instruction tuned language model. It unifies text representation (embedding) and text generation into a single model achieving state-of-the-art performance on both types of tasks.
2609
 
2610
  - **Repository:** [ContextualAI/gritlm](https://github.com/ContextualAI/gritlm)
 
2605
 
2606
  # Model Summary
2607
 
2608
+ This is a fork of the [original GritLM model](https://hf.co/GritLM/GritLM-7B). The main difference between this fork and the original model is the name of the architecture used in the config file. The new name is chosen to make it easier to adopt this model in vLLM.
2609
+
2610
+
2611
  > GritLM is a generative representational instruction tuned language model. It unifies text representation (embedding) and text generation into a single model achieving state-of-the-art performance on both types of tasks.
2612
 
2613
  - **Repository:** [ContextualAI/gritlm](https://github.com/ContextualAI/gritlm)
config.json CHANGED
@@ -2,7 +2,7 @@
2
  "_name_or_path": "GritLM/GritLM-7B",
3
  "model_type": "mistral",
4
  "architectures": [
5
- "MistralForCausalLM"
6
  ],
7
  "auto_map": {
8
  "AutoModel": "modeling_gritlm7b.MistralModel",
 
2
  "_name_or_path": "GritLM/GritLM-7B",
3
  "model_type": "mistral",
4
  "architectures": [
5
+ "GritLM"
6
  ],
7
  "auto_map": {
8
  "AutoModel": "modeling_gritlm7b.MistralModel",