openthaigpt
/

openthaigpt-1.0.0-7b-chat

@@ -11,17 +11,18 @@ tags:
 ---
 # 🇹🇭 OpenThaiGPT 7b 1.0.0
-<img src="https://1173516064-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FvvbWvIIe82Iv1yHaDBC5%2Fuploads%2Fb8eiMDaqiEQL6ahbAY0h%2Fimage.png?alt=media&token=6fce78fd-2cca-4c0a-9648-bd5518e644ce
-https://openthaigpt.aieat.or.th/" width="200px">
-🇹🇭 OpenThaiGPT 7b Version 1.0.0-beta is a Thai language 7B-parameter LLaMA v2 Chat model finetuned to Thai instructions and extend more than 10,000 most popular Thai words vocabularies into LLM's dictionary for turbo speed.
 ## Features
-- State-of-the-Art Thai language LLM, Acheive the highest average score over all Thai opensource LLMs on 9 Thai language exams.
 - Multi-turn Conversation Support
 - Retrieval Augmented Generation (RAG) Support
-## Benchmark
 | **Exams**                        | **OTG 7b (Aug 2023)** | **OTG 13b (Dec 2023)** | **OTG 7b (March 2024)** | **OTG 13b (March 2024)** | **OTG 70b (March 2024)** | **SeaLLM 7b v1** | **SeaLLM 7b v2** | **TyphoonGPT 7b** | **SeaLion 7b** | **WanchanGLM 7b** | **Sailor-7B-Chat** | **GPT3.5** | **GPT4** | **Gemini Pro** | **Gemini 1.5** | **Claude 3 Haiku** | **Claude 3 Sonnet** | **Claude 3 Opus** |
 |----------------------------------|-----------------------|------------------------|-------------------------|--------------------------|--------------------------|------------------|------------------|--------------------|----------------|-------------------|--------------------|------------|----------|----------------|----------------|--------------------|---------------------|-------------------|
 | **A-Level**                      | 17.50%                | 34.17%                 | 25.00%                  | 30.83%                   | 45.83%                   | 18.33%           | 34.17%           | N/A                | 21.67%         | 17.50%            | 40.00%             | 38.33%     | 65.83%   | 56.67%         | 55.83%         | 58.33%             | 59.17%              | 77.50%            |
@@ -35,6 +36,12 @@ https://openthaigpt.aieat.or.th/" width="200px">
 | **ONET M6** | 21.14%                | 28.87%                 | 22.53%                  | 23.32%                   | 42.85%                   | 15.09%           | 19.48%           | N/A                | 16.96%         | 20.67%            | 28.64%             | 34.44%     | 46.29%   | 45.53%         | 50.23%         | 34.79%             | 38.49%              | 48.56%            |
 | **Average Score**                | 23.83%                | 37.27%                 | 38.40%                  | 40.33%                   | 55.87%                   | 18.06%           | 33.56%           | N/A                | 27.44%         | 23.75%            | 37.28%             | 43.07%     | 60.68%   | 52.30%         | 52.89%         | 50.65%             | 56.81%              | 68.32%            |
 ## Licenses
 **Source Code**: License Apache Software License 2.0.<br>
 **Weight**: Research and **Commercial uses**.<br>

 ---
 # 🇹🇭 OpenThaiGPT 7b 1.0.0
+![OpenThaiGPT](https://1173516064-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FvvbWvIIe82Iv1yHaDBC5%2Fuploads%2Fb8eiMDaqiEQL6ahbAY0h%2Fimage.png?alt=media&token=6fce78fd-2cca-4c0a-9648-bd5518e644ce)
+[More Info](https://openthaigpt.aieat.or.th/)
+🇹🇭 OpenThaiGPT 7b Version 1.0.0-beta is a Thai language 7B-parameter LLaMA v2 Chat model finetuned to Thai instructions and extended with more than 10,000 most popular Thai words vocabularies into the LLM's dictionary for turbo speed.
 ## Features
+- State-of-the-Art Thai language LLM, achieving the highest average score over all Thai opensource LLMs on 9 Thai language exams.
 - Multi-turn Conversation Support
 - Retrieval Augmented Generation (RAG) Support
+## Benchmark by Multiple Choices Exams
 | **Exams**                        | **OTG 7b (Aug 2023)** | **OTG 13b (Dec 2023)** | **OTG 7b (March 2024)** | **OTG 13b (March 2024)** | **OTG 70b (March 2024)** | **SeaLLM 7b v1** | **SeaLLM 7b v2** | **TyphoonGPT 7b** | **SeaLion 7b** | **WanchanGLM 7b** | **Sailor-7B-Chat** | **GPT3.5** | **GPT4** | **Gemini Pro** | **Gemini 1.5** | **Claude 3 Haiku** | **Claude 3 Sonnet** | **Claude 3 Opus** |
 |----------------------------------|-----------------------|------------------------|-------------------------|--------------------------|--------------------------|------------------|------------------|--------------------|----------------|-------------------|--------------------|------------|----------|----------------|----------------|--------------------|---------------------|-------------------|
 | **A-Level**                      | 17.50%                | 34.17%                 | 25.00%                  | 30.83%                   | 45.83%                   | 18.33%           | 34.17%           | N/A                | 21.67%         | 17.50%            | 40.00%             | 38.33%     | 65.83%   | 56.67%         | 55.83%         | 58.33%             | 59.17%              | 77.50%            |
 | **ONET M6** | 21.14%                | 28.87%                 | 22.53%                  | 23.32%                   | 42.85%                   | 15.09%           | 19.48%           | N/A                | 16.96%         | 20.67%            | 28.64%             | 34.44%     | 46.29%   | 45.53%         | 50.23%         | 34.79%             | 38.49%              | 48.56%            |
 | **Average Score**                | 23.83%                | 37.27%                 | 38.40%                  | 40.33%                   | 55.87%                   | 18.06%           | 33.56%           | N/A                | 27.44%         | 23.75%            | 37.28%             | 43.07%     | 60.68%   | 52.30%         | 52.89%         | 50.65%             | 56.81%              | 68.32%            |
+### Benchmark Configuration
+- Clearly instruct model to answer by select one of a possible choice and followed by an explanation.
+- Zero shot only
+- Tested on unseen test set only
+- Detect a multi-choice answer on (A),(B),(C),(D),(E) at the beginning of the answer (First priority) and at the end of the answer (Second priority)
 ## Licenses
 **Source Code**: License Apache Software License 2.0.<br>
 **Weight**: Research and **Commercial uses**.<br>