PowerInfer
/

Bamboo-base-v0_1

Feature Extraction

Model card Files Files and versions Community

yixinsong commited on Mar 25, 2024

Commit

ed75727

·

verified ·

1 Parent(s): 058d800

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -44,7 +44,7 @@ The following table shows the hyper-paramters we used in our training process.
 | Batch Size            | 4M          |
 | Weight Decay          | 0.1         |
-**Second phase**: We further adjusted the training corpus ratio, incorporating more domain-specific datasets(Math、Coding), and continued training for 50B tokens.
 | Hyper-parameters      |             |
 | --------------------- | ----------- |
@@ -59,8 +59,7 @@ The following table shows the hyper-paramters we used in our training process.
 Our evaluation is based on the framework lm-evaluation-harness and opencompass. The evaluation details are listed as follows:
 - Huggingface LLM Leaderboard tasks.
-- Commonsense: We report the average of PIQA, SIQA,  ARC easy and challenge and  CommonsenseQA.
-- Other Popular Benchmarks: We report the average accuracies on Big Bench Hard (BBH) (3-shot), HumanEval, MBPP, MATH.
 |         | MMLU   | Winogrande | TruthfulQA | Hellaswag | GSM8K  | Arc-C  | HumanEval | BBH  | Average |
 | ------- | ------ | ---------- | ---------- | --------- | ------ | ------ | --------- | ---- | ------- |

 | Batch Size            | 4M          |
 | Weight Decay          | 0.1         |
+**Second phase**: We further adjusted the training corpus ratio, incorporating more domain-specific datasets(Math, Coding), and continued training for 50B tokens.
 | Hyper-parameters      |             |
 | --------------------- | ----------- |
 Our evaluation is based on the framework lm-evaluation-harness and opencompass. The evaluation details are listed as follows:
 - Huggingface LLM Leaderboard tasks.
+- Other Popular Benchmarks: We report the average accuracies on Big Bench Hard (BBH) (3-shot), HumanEval.
 |         | MMLU   | Winogrande | TruthfulQA | Hellaswag | GSM8K  | Arc-C  | HumanEval | BBH  | Average |
 | ------- | ------ | ---------- | ---------- | --------- | ------ | ------ | --------- | ---- | ------- |