allenai
/

OLMo-2-1124-7B

Model card Files Files and versions Community

amanrangapur commited on 8 days ago

Commit

640eb46

•

1 Parent(s): bb91cbe

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -49,7 +49,7 @@ inputs = tokenizer(message, return_tensors='pt', return_token_type_ids=False)
 # olmo = olmo.to('cuda')
 response = olmo.generate(**inputs, max_new_tokens=100, do_sample=True, top_k=50, top_p=0.95)
 print(tokenizer.batch_decode(response, skip_special_tokens=True)[0])
->> 'Language modeling is the first step to build natural language generation...'
 ```
 For faster performance, you can quantize the model using the following method:

 # olmo = olmo.to('cuda')
 response = olmo.generate(**inputs, max_new_tokens=100, do_sample=True, top_k=50, top_p=0.95)
 print(tokenizer.batch_decode(response, skip_special_tokens=True)[0])
+>> 'Language modeling is  a key component of any text-based application, but its effectiveness...'
 ```
 For faster performance, you can quantize the model using the following method: