Kyllene-34B-v1.1-4bpw-h6-exl2 / mergemonster_kyllene_v11.txt
waldie's picture
Upload folder using huggingface_hub
f0856eb verified
raw
history blame
65.8 kB
⠀⠀⠀⠀⠀⠀⣀⡀⠀⠀⣀⣤⣶⣾⣿⣿⣷⣶⣤⣀⠀⠀⣀⣀⠀⠀⠀⠀⠀⠀
⠀⠀⠀⠀⠀⠜⠉⣿⡆⣼⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣧⢰⣿⠉⠃⠀⠀⠀⠀⠀
⠀⢀⣤⣴⣦⣄⣴⠟⣸⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⡎⢻⣦⣠⣴⣦⣄⠀⠀
⠀⡞⠁⣠⣾⢿⣧⠀⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⠀⣽⡿⣷⣄⠈⢷⠀
⠀⣠⣾⠟⠁⢸⣿⠀⠘⢿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⡿⠁⠀⣿⡇⠈⠻⣷⣄⠀
⣰⡿⠁⠀⢀⣾⣏⣾⣄⣰⣿⣿⣿⣿⣿⣿⣿⣿⣿⣿⣇⣰⣷⣹⣷⠀⠀⠈⢿⣆
⣿⡇⠀⢠⣾⠏⢸⣿⣿⣿⣿⠋⢻⣿⣿⣿⣿⡟⠙⣿⣿⣿⣿⡇⠹⣷⡀⠀⢸⣿
⠹⣿⣴⡿⠋⠀⠈⠛⠉⣹⣿⣦⣄⡹⣿⣿⣋⣠⣶⣿⣏⠉⠛⠁⠀⠙⢿⣦⣿⠏
⠀⣸⣿⠿⠿⣿⣾⣿⡿⠿⣿⣿⣿⣿⡆⢰⣿⣿⣿⣿⠿⢿⣿⣶⣿⠿⠿⣻⣇⠀
⠀⣿⡇⢀⣴⣶⣤⣀⣴⣿⠿⣻⡿⣿⣧⣾⣿⢿⣟⠿⣿⣦⣀⣤⣶⣦⠀⢸⣿⠀
⠀⢿⣧⠈⠃⢀⣵⣿⡋⠁⢀⣿⡷⣿⡇⢻⣿⣿⣿⡀⠈⢛⣿⣮⡀⠘⠀⣼⡟⠀
⠀⠈⠻⣷⣤⣟⣋⣿⣧⣴⡿⠋⠀⣿⡇⢸⣿⠀⠙⢿⣦⣼⣿⣙⣻⣤⣾⠟⠁⠀
⠀⠀⠀⠈⢽⣿⠛⢻⣏⢉⣤⣶⣶⣿⠁⠈⣿⣶⣶⣤⡉⣽⡟⠛⣿⡏⠁⠀⠀⠀
⠀⠀⠀⠀⠈⠿⣷⣾⣾⣟⣉⣠⣿⢿⡇⢸⠿⣿⣄⣙⣻⣷⣷⣾⠿⠁⠀⠀⠀⠀
⠀⠀⠀⠀⠀⠀⠀⠀⠙⠻⠿⠛⢁⡼⠃⠘⢦⡈⠛⠿⠟⠃⠀⠀⠀⠀⠀⠀⠀⠀
01:05:33 - THE MERGE MONSTER HUNGERS
------------------------------------
Device : cpu
Random seed : 42
Starting model : ../jondurbin_bagel-dpo-34b-v0.2
Models to merge : ['../NousResearch_Nous-Capybara-34B', '../NousResearch_Nous-Hermes-2-Yi-34B', '../SUSTech_SUS-Chat-34B']
Output directory : ./mm-output
Phrases loaded : 31
Auto weights : False
Merge ratios : [0.2, 0.4, 0.6, 0.8]
Merge method(s) : ['slerp']
Merge headers : True
Strategy used : cumulative
------------------------------------
01:05:34 - Loading model (../jondurbin_bagel-dpo-34b-v0.2)...
Loading checkpoint shards: 100%|████████████████| 15/15 [00:32<00:00, 2.18s/it]
01:06:59 - Model loaded. Dtype: torch.float16
------------------------------------
-----------------------------------------------------------------------------------------------------
| Type | Phrase | Context | Raw Prob* | Used Prob** | Change |
-----------------------------------------------------------------------------------------------------
| BAD | anticipation | Her body quivers with | 0.00000% | 0.00% | N/A |
| BAD | anticipation | The atmosphere is thic.. | 0.00000% | 0.00% | N/A |
| BAD | unwavering | Filled with an | 0.00000% | 0.00% | N/A |
| BAD | determination | Her eyes were filled w.. | 0.00000% | 0.00% | N/A |
| BAD | determination | Her stubbornness only .. | 0.00000% | 0.00% | N/A |
| BAD | whisper | Her voice barely above.. | 0.00000% | 0.00% | N/A |
| BAD | spine | shivers down her | 0.00000% | 0.00% | N/A |
| BAD | sends shivers | The thrill of the act | 0.00000% | 0.00% | N/A |
| BAD | ministrations | She moans and twitches.. | 0.00006% | 0.00% | N/A |
| BAD | legs | wraps her | 0.00000% | 0.00% | N/A |
| BAD | imposing figure | He had an | 0.00000% | 0.00% | N/A |
| BAD | shared challenges | Their bond strengthene.. | 0.00001% | 0.00% | N/A |
| BAD | bond | forged a | 0.00008% | 0.00% | N/A |
| BAD | bond | an unspoken | 0.00009% | 0.00% | N/A |
| BAD | enhance our expe.. | I'm excited to see how | 0.00000% | 0.00% | N/A |
| BAD | sense of vulnera.. | create a | 0.00000% | 0.00% | N/A |
| BAD | dimensions of in.. | explore new | 0.00000% | 0.00% | N/A |
| BAD | deepening our co.. | while | 0.00000% | 0.00% | N/A |
| BAD | shared experiences | through | 0.00000% | 0.00% | N/A |
| BAD | societal expecta.. | that transcend | 0.00000% | 0.00% | N/A |
| BAD | conventional bou.. | that defy | 0.00000% | 0.00% | N/A |
| BAD | conventional bou.. | and defy | 0.00000% | 0.00% | N/A |
| BAD | open communication | an environment | 0.00000% | 0.00% | N/A |
| BAD | emotional vulner.. | an environment | 0.00000% | 0.00% | N/A |
| BAD | heightens our co.. | touch and the anticipa.. | 0.00000% | 0.00% | N/A |
| BAD | sensations you'r.. | I'm enjoying | 0.00000% | 0.00% | N/A |
| BAD | is truly arousing | attention to detail | 0.00000% | 0.00% | N/A |
| BAD | is truly arousing | way you explore my body | 0.00000% | 0.00% | N/A |
| BAD | challenge presen.. | my resolve unwavering .. | 0.00000% | 0.00% | N/A |
| BAD | humble vessel | surrendering to the ex.. | 0.00000% | 0.00% | N/A |
| BAD | bond | cherishing the unique | 0.00013% | 0.00% | N/A |
| BAD | bond | special | 0.00030% | 0.00% | N/A |
| BAD | grows stronger w.. | bond | 0.00000% | 0.00% | N/A |
| BAD | that cannot be b.. | bond | 0.00000% | 0.00% | N/A |
| BAD | becomes unbreaka.. | bond | 0.00000% | 0.00% | N/A |
| BAD | grew stronger wi.. | bond | 0.00000% | 0.00% | N/A |
| GOOD | The apple is in .. | Question: If I'm in th.. | 0.00139% | 0.00% | N/A |
------------------------------------------------------------------------------------------------------
| Totals | 0.00% | 0.01% | 0.00% |
------------------------------------------------------------------------------------------------------
* = Unweighted, raw probability - ** = Probability after weight adjustments
------------------------------------
01:07:39 - Loading model (../NousResearch_Nous-Capybara-34B)...
Loading checkpoint shards: 100%|██████████████████| 7/7 [01:04<00:00, 9.19s/it]
01:09:33 - Model loaded. Dtype: torch.float16
------------------------------------
Optimizing Layer 1/60 (slerp): 100%|██████████████| 4/4 [04:01<00:00, 60.38s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.2, 'NousResearch_Nous-Capybara-34B']]
01:15:02 - Layer 1/60 - CHANGED - 0.00007 > 0.00006 - 2.5%
----
Optimizing Layer 2/60 (slerp): 100%|██████████████| 4/4 [03:52<00:00, 58.04s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B']]
01:20:22 - Layer 2/60 - CHANGED - 0.00006 > 0.00006 - 1.6%
----
Optimizing Layer 3/60 (slerp): 100%|██████████████| 4/4 [04:03<00:00, 60.90s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
01:25:50 - Layer 3/60 - RETAINED - 0.00006
----
Optimizing Layer 4/60 (slerp): 100%|██████████████| 4/4 [05:28<00:00, 82.25s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
01:32:54 - Layer 4/60 - RETAINED - 0.00006
----
Optimizing Layer 5/60 (slerp): 100%|██████████████| 4/4 [04:15<00:00, 63.94s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
01:38:53 - Layer 5/60 - RETAINED - 0.00006
----
Optimizing Layer 6/60 (slerp): 100%|██████████████| 4/4 [04:16<00:00, 64.24s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
01:44:47 - Layer 6/60 - RETAINED - 0.00006
----
Optimizing Layer 7/60 (slerp): 100%|██████████████| 4/4 [04:04<00:00, 61.02s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
01:50:20 - Layer 7/60 - RETAINED - 0.00006
----
Optimizing Layer 8/60 (slerp): 100%|██████████████| 4/4 [04:07<00:00, 61.95s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
01:55:59 - Layer 8/60 - RETAINED - 0.00006
----
Optimizing Layer 9/60 (slerp): 100%|██████████████| 4/4 [04:04<00:00, 61.17s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B']]
02:01:26 - Layer 9/60 - CHANGED - 0.00006 > 0.00006 - 1.3%
----
Optimizing Layer 10/60 (slerp): 100%|█████████████| 4/4 [03:56<00:00, 59.05s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
02:06:41 - Layer 10/60 - RETAINED - 0.00006
----
Optimizing Layer 11/60 (slerp): 100%|█████████████| 4/4 [03:43<00:00, 55.90s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
02:11:45 - Layer 11/60 - CHANGED - 0.00006 > 0.00006 - 4.8%
----
Optimizing Layer 12/60 (slerp): 100%|█████████████| 4/4 [03:53<00:00, 58.32s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
02:16:54 - Layer 12/60 - CHANGED - 0.00006 > 0.00005 - 12.2%
----
Optimizing Layer 13/60 (slerp): 100%|█████████████| 4/4 [04:09<00:00, 62.31s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B']]
02:22:31 - Layer 13/60 - CHANGED - 0.00005 > 0.00005 - 3.6%
----
Optimizing Layer 14/60 (slerp): 100%|█████████████| 4/4 [03:31<00:00, 52.84s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B']]
02:27:20 - Layer 14/60 - CHANGED - 0.00005 > 0.00005 - 1.5%
----
Optimizing Layer 15/60 (slerp): 100%|█████████████| 4/4 [04:26<00:00, 66.67s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
02:33:32 - Layer 15/60 - RETAINED - 0.00005
----
Optimizing Layer 16/60 (slerp): 100%|█████████████| 4/4 [04:36<00:00, 69.09s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
02:39:38 - Layer 16/60 - RETAINED - 0.00005
----
Optimizing Layer 17/60 (slerp): 100%|█████████████| 4/4 [04:22<00:00, 65.64s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
02:45:41 - Layer 17/60 - RETAINED - 0.00005
----
Optimizing Layer 18/60 (slerp): 100%|█████████████| 4/4 [04:39<00:00, 69.87s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
02:51:51 - Layer 18/60 - RETAINED - 0.00005
----
Optimizing Layer 19/60 (slerp): 100%|█████████████| 4/4 [05:10<00:00, 77.56s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
02:58:36 - Layer 19/60 - RETAINED - 0.00005
----
Optimizing Layer 20/60 (slerp): 100%|█████████████| 4/4 [05:03<00:00, 75.87s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
03:05:16 - Layer 20/60 - CHANGED - 0.00005 > 0.00005 - 0.2%
----
Optimizing Layer 21/60 (slerp): 100%|█████████████| 4/4 [05:42<00:00, 85.60s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
03:12:46 - Layer 21/60 - CHANGED - 0.00005 > 0.00001 - 77.3%
----
Optimizing Layer 22/60 (slerp): 100%|█████████████| 4/4 [05:48<00:00, 87.20s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
03:21:02 - Layer 22/60 - CHANGED - 0.00001 > -0.00000 - 126.4%
----
Optimizing Layer 23/60 (slerp): 100%|████████████| 4/4 [07:03<00:00, 105.79s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
03:30:53 - Layer 23/60 - CHANGED - -0.00000 > -0.00003 - 988.2%
----
Optimizing Layer 24/60 (slerp): 100%|█████████████| 4/4 [06:11<00:00, 92.99s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
03:39:09 - Layer 24/60 - CHANGED - -0.00003 > -0.00006 - 90.8%
----
Optimizing Layer 25/60 (slerp): 100%|█████████████| 4/4 [05:42<00:00, 85.51s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
03:46:40 - Layer 25/60 - CHANGED - -0.00006 > -0.00013 - 105.5%
----
Optimizing Layer 26/60 (slerp): 100%|█████████████| 4/4 [05:10<00:00, 77.58s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B']]
03:53:21 - Layer 26/60 - CHANGED - -0.00013 > -0.00014 - 8.8%
----
Optimizing Layer 27/60 (slerp): 100%|█████████████| 4/4 [04:53<00:00, 73.48s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
03:59:41 - Layer 27/60 - RETAINED - -0.00014
----
Optimizing Layer 28/60 (slerp): 100%|█████████████| 4/4 [05:00<00:00, 75.07s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B']]
04:06:11 - Layer 28/60 - CHANGED - -0.00014 > -0.00015 - 9.9%
----
Optimizing Layer 29/60 (slerp): 100%|█████████████| 4/4 [05:18<00:00, 79.66s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
04:13:06 - Layer 29/60 - CHANGED - -0.00015 > -0.00026 - 73.9%
----
Optimizing Layer 30/60 (slerp): 100%|█████████████| 4/4 [04:39<00:00, 69.97s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.2, 'NousResearch_Nous-Capybara-34B']]
04:19:19 - Layer 30/60 - CHANGED - -0.00026 > -0.00026 - 0.1%
----
Optimizing Layer 31/60 (slerp): 100%|█████████████| 4/4 [05:03<00:00, 75.98s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
04:26:15 - Layer 31/60 - CHANGED - -0.00026 > -0.00045 - 73.2%
----
Optimizing Layer 32/60 (slerp): 100%|█████████████| 4/4 [04:50<00:00, 72.61s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
04:32:41 - Layer 32/60 - RETAINED - -0.00045
----
Optimizing Layer 33/60 (slerp): 100%|█████████████| 4/4 [04:42<00:00, 70.72s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
04:38:55 - Layer 33/60 - RETAINED - -0.00045
----
Optimizing Layer 34/60 (slerp): 100%|█████████████| 4/4 [05:10<00:00, 77.62s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
04:45:43 - Layer 34/60 - RETAINED - -0.00045
----
Optimizing Layer 35/60 (slerp): 100%|█████████████| 4/4 [05:18<00:00, 79.62s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
04:52:33 - Layer 35/60 - RETAINED - -0.00045
----
Optimizing Layer 36/60 (slerp): 100%|█████████████| 4/4 [05:31<00:00, 82.80s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
04:59:39 - Layer 36/60 - CHANGED - -0.00045 > -0.00058 - 27.3%
----
Optimizing Layer 37/60 (slerp): 100%|█████████████| 4/4 [05:40<00:00, 85.08s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
05:07:00 - Layer 37/60 - CHANGED - -0.00058 > -0.00068 - 17.0%
----
Optimizing Layer 38/60 (slerp): 100%|█████████████| 4/4 [05:09<00:00, 77.43s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
05:13:50 - Layer 38/60 - RETAINED - -0.00068
----
Optimizing Layer 39/60 (slerp): 100%|█████████████| 4/4 [04:52<00:00, 73.15s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
05:20:23 - Layer 39/60 - CHANGED - -0.00068 > -0.00094 - 38.6%
----
Optimizing Layer 40/60 (slerp): 100%|█████████████| 4/4 [05:11<00:00, 77.87s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
05:27:10 - Layer 40/60 - RETAINED - -0.00094
----
Optimizing Layer 41/60 (slerp): 100%|█████████████| 4/4 [04:56<00:00, 74.02s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
05:33:43 - Layer 41/60 - RETAINED - -0.00094
----
Optimizing Layer 42/60 (slerp): 100%|█████████████| 4/4 [05:11<00:00, 77.90s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
05:40:32 - Layer 42/60 - RETAINED - -0.00094
----
Optimizing Layer 43/60 (slerp): 100%|█████████████| 4/4 [05:07<00:00, 76.91s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
05:47:21 - Layer 43/60 - RETAINED - -0.00094
----
Optimizing Layer 44/60 (slerp): 100%|█████████████| 4/4 [05:27<00:00, 81.99s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
05:54:34 - Layer 44/60 - RETAINED - -0.00094
----
Optimizing Layer 45/60 (slerp): 100%|█████████████| 4/4 [05:55<00:00, 88.94s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
06:02:20 - Layer 45/60 - RETAINED - -0.00094
----
Optimizing Layer 46/60 (slerp): 100%|█████████████| 4/4 [05:23<00:00, 80.84s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
06:09:36 - Layer 46/60 - RETAINED - -0.00094
----
Optimizing Layer 47/60 (slerp): 100%|█████████████| 4/4 [05:10<00:00, 77.74s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
06:16:33 - Layer 47/60 - RETAINED - -0.00094
----
Optimizing Layer 48/60 (slerp): 100%|█████████████| 4/4 [04:53<00:00, 73.39s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
06:23:12 - Layer 48/60 - RETAINED - -0.00094
----
Optimizing Layer 49/60 (slerp): 100%|█████████████| 4/4 [05:12<00:00, 78.19s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B']]
06:30:19 - Layer 49/60 - CHANGED - -0.00094 > -0.00100 - 6.8%
----
Optimizing Layer 50/60 (slerp): 100%|█████████████| 4/4 [05:16<00:00, 79.20s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.6, 'NousResearch_Nous-Capybara-34B']]
06:37:19 - Layer 50/60 - CHANGED - -0.00100 > -0.00106 - 6.1%
----
Optimizing Layer 51/60 (slerp): 100%|█████████████| 4/4 [05:08<00:00, 77.05s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
06:44:03 - Layer 51/60 - RETAINED - -0.00106
----
Optimizing Layer 52/60 (slerp): 100%|█████████████| 4/4 [04:41<00:00, 70.42s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
06:50:20 - Layer 52/60 - CHANGED - -0.00106 > -0.00128 - 20.3%
----
Optimizing Layer 53/60 (slerp): 100%|█████████████| 4/4 [05:05<00:00, 76.48s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.2, 'NousResearch_Nous-Capybara-34B']]
06:57:10 - Layer 53/60 - CHANGED - -0.00128 > -0.00128 - 0.2%
----
Optimizing Layer 54/60 (slerp): 100%|█████████████| 4/4 [05:37<00:00, 84.37s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B']]
07:04:24 - Layer 54/60 - CHANGED - -0.00128 > -0.00132 - 3.5%
----
Optimizing Layer 55/60 (slerp): 100%|█████████████| 4/4 [06:07<00:00, 91.86s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
07:12:17 - Layer 55/60 - RETAINED - -0.00132
----
Optimizing Layer 56/60 (slerp): 100%|█████████████| 4/4 [05:23<00:00, 80.92s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
07:19:47 - Layer 56/60 - CHANGED - -0.00132 > -0.00152 - 14.7%
----
Optimizing Layer 57/60 (slerp): 100%|█████████████| 4/4 [05:58<00:00, 89.60s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
07:27:40 - Layer 57/60 - CHANGED - -0.00152 > -0.00171 - 12.5%
----
Optimizing Layer 58/60 (slerp): 100%|█████████████| 4/4 [06:03<00:00, 90.92s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.6, 'NousResearch_Nous-Capybara-34B']]
07:35:25 - Layer 58/60 - CHANGED - -0.00171 > -0.00186 - 8.8%
----
Optimizing Layer 59/60 (slerp): 100%|█████████████| 4/4 [05:25<00:00, 81.34s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
07:42:28 - Layer 59/60 - RETAINED - -0.00186
----
Optimizing Layer 60/60 (slerp): 100%|█████████████| 4/4 [05:45<00:00, 86.41s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
07:49:59 - Layer 60/60 - RETAINED - -0.00186
----
Optimizing Header: 100%|██████████████████████████| 4/4 [06:22<00:00, 95.55s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.6, 'NousResearch_Nous-Capybara-34B']]
07:57:33 - Header - CHANGED - -0.00186 > -0.00190 - 2.5%
-----------------------------------------------------------------------------------------------------
| Type | Phrase | Context | Raw Prob* | Used Prob** | Change |
-----------------------------------------------------------------------------------------------------
| BAD | anticipation | Her body quivers with | 0.00000% | 0.00% | +0.00% |
| BAD | anticipation | The atmosphere is thic.. | 0.00000% | 0.00% | +0.00% |
| BAD | unwavering | Filled with an | 0.00000% | 0.00% | +0.00% |
| BAD | determination | Her eyes were filled w.. | 0.00000% | 0.00% | -0.00% |
| BAD | determination | Her stubbornness only .. | 0.00000% | 0.00% | +0.00% |
| BAD | whisper | Her voice barely above.. | 0.00000% | 0.00% | +0.00% |
| BAD | spine | shivers down her | 0.00000% | 0.00% | -0.00% |
| BAD | sends shivers | The thrill of the act | 0.00000% | 0.00% | +0.00% |
| BAD | ministrations | She moans and twitches.. | 0.00004% | 0.00% | -0.00% |
| BAD | legs | wraps her | 0.00000% | 0.00% | -0.00% |
| BAD | imposing figure | He had an | 0.00000% | 0.00% | -0.00% |
| BAD | shared challenges | Their bond strengthene.. | 0.00001% | 0.00% | +0.00% |
| BAD | bond | forged a | 0.00007% | 0.00% | -0.00% |
| BAD | bond | an unspoken | 0.00010% | 0.00% | +0.00% |
| BAD | enhance our expe.. | I'm excited to see how | 0.00000% | 0.00% | +0.00% |
| BAD | sense of vulnera.. | create a | 0.00000% | 0.00% | -0.00% |
| BAD | dimensions of in.. | explore new | 0.00000% | 0.00% | +0.00% |
| BAD | deepening our co.. | while | 0.00000% | 0.00% | -0.00% |
| BAD | shared experiences | through | 0.00000% | 0.00% | -0.00% |
| BAD | societal expecta.. | that transcend | 0.00000% | 0.00% | -0.00% |
| BAD | conventional bou.. | that defy | 0.00000% | 0.00% | +0.00% |
| BAD | conventional bou.. | and defy | 0.00000% | 0.00% | +0.00% |
| BAD | open communication | an environment | 0.00000% | 0.00% | -0.00% |
| BAD | emotional vulner.. | an environment | 0.00000% | 0.00% | -0.00% |
| BAD | heightens our co.. | touch and the anticipa.. | 0.00000% | 0.00% | -0.00% |
| BAD | sensations you'r.. | I'm enjoying | 0.00000% | 0.00% | +0.00% |
| BAD | is truly arousing | attention to detail | 0.00000% | 0.00% | +0.00% |
| BAD | is truly arousing | way you explore my body | 0.00000% | 0.00% | +0.00% |
| BAD | challenge presen.. | my resolve unwavering .. | 0.00000% | 0.00% | +0.00% |
| BAD | humble vessel | surrendering to the ex.. | 0.00000% | 0.00% | +0.00% |
| BAD | bond | cherishing the unique | 0.00017% | 0.00% | +0.00% |
| BAD | bond | special | 0.00011% | 0.00% | -0.00% |
| BAD | grows stronger w.. | bond | 0.00000% | 0.00% | -0.00% |
| BAD | that cannot be b.. | bond | 0.00000% | 0.00% | +0.00% |
| BAD | becomes unbreaka.. | bond | 0.00000% | 0.00% | -0.00% |
| BAD | grew stronger wi.. | bond | 0.00000% | 0.00% | -0.00% |
| GOOD | The apple is in .. | Question: If I'm in th.. | 0.19188% | 0.19% | +0.19% |
------------------------------------------------------------------------------------------------------
| Totals | 0.19% | 0.20% | 0.19% |
------------------------------------------------------------------------------------------------------
* = Unweighted, raw probability - ** = Probability after weight adjustments
-------- MERGE COMPOSITION ---------
jondurbin_bagel-dpo-34b-v0.2: 0.70
NousResearch_Nous-Capybara-34B: 0.30
------------------------------------
07:59:18 - Loading model (../NousResearch_Nous-Hermes-2-Yi-34B)...
Loading checkpoint shards: 100%|████████████████| 15/15 [00:33<00:00, 2.22s/it]
08:00:31 - Model loaded. Dtype: torch.float16
------------------------------------
Optimizing Layer 1/60 (slerp): 100%|██████████████| 4/4 [03:32<00:00, 53.01s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.2, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
08:05:31 - Layer 1/60 - CHANGED - -0.00186 > -0.00230 - 23.5%
----
Optimizing Layer 2/60 (slerp): 100%|██████████████| 4/4 [03:40<00:00, 55.00s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
08:10:21 - Layer 2/60 - CHANGED - -0.00230 > -0.00266 - 15.9%
----
Optimizing Layer 3/60 (slerp): 100%|██████████████| 4/4 [04:33<00:00, 68.26s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
08:16:22 - Layer 3/60 - RETAINED - -0.00266
----
Optimizing Layer 4/60 (slerp): 100%|██████████████| 4/4 [05:06<00:00, 76.71s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
08:23:09 - Layer 4/60 - CHANGED - -0.00266 > -0.00294 - 10.5%
----
Optimizing Layer 5/60 (slerp): 100%|██████████████| 4/4 [05:47<00:00, 86.79s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
08:30:35 - Layer 5/60 - RETAINED - -0.00294
----
Optimizing Layer 6/60 (slerp): 100%|██████████████| 4/4 [05:25<00:00, 81.41s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
08:37:52 - Layer 6/60 - RETAINED - -0.00294
----
Optimizing Layer 7/60 (slerp): 100%|██████████████| 4/4 [05:44<00:00, 86.12s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
08:45:26 - Layer 7/60 - RETAINED - -0.00294
----
Optimizing Layer 8/60 (slerp): 100%|██████████████| 4/4 [05:36<00:00, 84.21s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
08:52:56 - Layer 8/60 - RETAINED - -0.00294
----
Optimizing Layer 9/60 (slerp): 100%|██████████████| 4/4 [05:51<00:00, 87.81s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B'], [0.6, 'NousResearch_Nous-Hermes-2-Yi-34B']]
09:00:30 - Layer 9/60 - CHANGED - -0.00294 > -0.00297 - 1.2%
----
Optimizing Layer 10/60 (slerp): 100%|█████████████| 4/4 [06:03<00:00, 90.97s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
09:08:29 - Layer 10/60 - RETAINED - -0.00297
----
Optimizing Layer 11/60 (slerp): 100%|█████████████| 4/4 [05:19<00:00, 79.95s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
09:15:40 - Layer 11/60 - CHANGED - -0.00297 > -0.00334 - 12.2%
----
Optimizing Layer 12/60 (slerp): 100%|█████████████| 4/4 [05:47<00:00, 86.85s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
09:23:46 - Layer 12/60 - RETAINED - -0.00334
----
Optimizing Layer 13/60 (slerp): 100%|█████████████| 4/4 [05:05<00:00, 76.33s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B']]
09:30:37 - Layer 13/60 - RETAINED - -0.00334
----
Optimizing Layer 14/60 (slerp): 100%|█████████████| 4/4 [04:47<00:00, 71.79s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B'], [0.2, 'NousResearch_Nous-Hermes-2-Yi-34B']]
09:37:17 - Layer 14/60 - CHANGED - -0.00334 > -0.00336 - 0.8%
----
Optimizing Layer 15/60 (slerp): 100%|█████████████| 4/4 [04:05<00:00, 61.32s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
09:42:46 - Layer 15/60 - RETAINED - -0.00336
----
Optimizing Layer 16/60 (slerp): 100%|█████████████| 4/4 [04:16<00:00, 64.24s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
09:48:30 - Layer 16/60 - RETAINED - -0.00336
----
Optimizing Layer 17/60 (slerp): 100%|█████████████| 4/4 [04:31<00:00, 67.78s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.6, 'NousResearch_Nous-Hermes-2-Yi-34B']]
09:54:37 - Layer 17/60 - CHANGED - -0.00336 > -0.00361 - 7.3%
----
Optimizing Layer 18/60 (slerp): 100%|█████████████| 4/4 [04:35<00:00, 68.88s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
10:00:44 - Layer 18/60 - RETAINED - -0.00361
----
Optimizing Layer 19/60 (slerp): 100%|█████████████| 4/4 [05:48<00:00, 87.17s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
10:08:21 - Layer 19/60 - RETAINED - -0.00361
----
Optimizing Layer 20/60 (slerp): 100%|█████████████| 4/4 [05:12<00:00, 78.07s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
10:15:11 - Layer 20/60 - RETAINED - -0.00361
----
Optimizing Layer 21/60 (slerp): 100%|█████████████| 4/4 [04:18<00:00, 64.71s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
10:20:54 - Layer 21/60 - CHANGED - -0.00361 > -0.00376 - 4.3%
----
Optimizing Layer 22/60 (slerp): 100%|█████████████| 4/4 [03:46<00:00, 56.73s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
10:26:01 - Layer 22/60 - CHANGED - -0.00376 > -0.00466 - 24.0%
----
Optimizing Layer 23/60 (slerp): 100%|█████████████| 4/4 [04:01<00:00, 60.46s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.4, 'NousResearch_Nous-Hermes-2-Yi-34B']]
10:31:21 - Layer 23/60 - CHANGED - -0.00466 > -0.00616 - 32.1%
----
Optimizing Layer 24/60 (slerp): 100%|█████████████| 4/4 [04:06<00:00, 61.57s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
10:36:43 - Layer 24/60 - CHANGED - -0.00616 > -0.00743 - 20.6%
----
Optimizing Layer 25/60 (slerp): 100%|█████████████| 4/4 [04:09<00:00, 62.32s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
10:42:19 - Layer 25/60 - RETAINED - -0.00743
----
Optimizing Layer 26/60 (slerp): 100%|█████████████| 4/4 [04:27<00:00, 66.78s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B'], [0.4, 'NousResearch_Nous-Hermes-2-Yi-34B']]
10:48:17 - Layer 26/60 - CHANGED - -0.00743 > -0.00745 - 0.3%
----
Optimizing Layer 27/60 (slerp): 100%|█████████████| 4/4 [05:11<00:00, 77.78s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
10:55:12 - Layer 27/60 - RETAINED - -0.00745
----
Optimizing Layer 28/60 (slerp): 100%|█████████████| 4/4 [05:31<00:00, 82.92s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
11:02:26 - Layer 28/60 - CHANGED - -0.00745 > -0.00789 - 5.9%
----
Optimizing Layer 29/60 (slerp): 100%|█████████████| 4/4 [05:10<00:00, 77.75s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
11:09:12 - Layer 29/60 - CHANGED - -0.00789 > -0.00824 - 4.5%
----
Optimizing Layer 30/60 (slerp): 100%|█████████████| 4/4 [05:35<00:00, 83.82s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.2, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
11:16:32 - Layer 30/60 - CHANGED - -0.00824 > -0.00980 - 18.9%
----
Optimizing Layer 31/60 (slerp): 100%|█████████████| 4/4 [06:09<00:00, 92.45s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
11:24:35 - Layer 31/60 - CHANGED - -0.00980 > -0.01486 - 51.6%
----
Optimizing Layer 32/60 (slerp): 100%|█████████████| 4/4 [05:35<00:00, 83.93s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.6, 'NousResearch_Nous-Hermes-2-Yi-34B']]
11:32:09 - Layer 32/60 - CHANGED - -0.01486 > -0.01743 - 17.3%
----
Optimizing Layer 33/60 (slerp): 100%|█████████████| 4/4 [05:40<00:00, 85.07s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
11:39:27 - Layer 33/60 - RETAINED - -0.01743
----
Optimizing Layer 34/60 (slerp): 100%|█████████████| 4/4 [05:28<00:00, 82.20s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
11:46:40 - Layer 34/60 - CHANGED - -0.01743 > -0.02148 - 23.2%
----
Optimizing Layer 35/60 (slerp): 100%|█████████████| 4/4 [06:17<00:00, 94.36s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
11:54:42 - Layer 35/60 - RETAINED - -0.02148
----
Optimizing Layer 36/60 (slerp): 100%|█████████████| 4/4 [05:46<00:00, 86.54s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
12:02:23 - Layer 36/60 - RETAINED - -0.02148
----
Optimizing Layer 37/60 (slerp): 100%|█████████████| 4/4 [04:44<00:00, 71.19s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
12:08:46 - Layer 37/60 - CHANGED - -0.02148 > -0.02760 - 28.5%
----
Optimizing Layer 38/60 (slerp): 100%|█████████████| 4/4 [03:58<00:00, 59.73s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Hermes-2-Yi-34B']]
12:14:11 - Layer 38/60 - CHANGED - -0.02760 > -0.02789 - 1.0%
----
Optimizing Layer 39/60 (slerp): 100%|█████████████| 4/4 [04:00<00:00, 60.16s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
12:19:28 - Layer 39/60 - RETAINED - -0.02789
----
Optimizing Layer 40/60 (slerp): 100%|█████████████| 4/4 [03:57<00:00, 59.45s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
12:24:49 - Layer 40/60 - RETAINED - -0.02789
----
Optimizing Layer 41/60 (slerp): 100%|█████████████| 4/4 [04:01<00:00, 60.34s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
12:30:08 - Layer 41/60 - RETAINED - -0.02789
----
Optimizing Layer 42/60 (slerp): 100%|█████████████| 4/4 [04:01<00:00, 60.29s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
12:35:23 - Layer 42/60 - RETAINED - -0.02789
----
Optimizing Layer 43/60 (slerp): 100%|█████████████| 4/4 [04:18<00:00, 64.70s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
12:41:09 - Layer 43/60 - RETAINED - -0.02789
----
Optimizing Layer 44/60 (slerp): 100%|█████████████| 4/4 [04:44<00:00, 71.20s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
12:47:23 - Layer 44/60 - RETAINED - -0.02789
----
Optimizing Layer 45/60 (slerp): 100%|█████████████| 4/4 [03:42<00:00, 55.71s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
12:52:31 - Layer 45/60 - RETAINED - -0.02789
----
Optimizing Layer 46/60 (slerp): 100%|█████████████| 4/4 [03:59<00:00, 59.77s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
12:57:52 - Layer 46/60 - RETAINED - -0.02789
----
Optimizing Layer 47/60 (slerp): 100%|█████████████| 4/4 [04:03<00:00, 60.98s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
13:03:16 - Layer 47/60 - RETAINED - -0.02789
----
Optimizing Layer 48/60 (slerp): 100%|█████████████| 4/4 [03:53<00:00, 58.40s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.2, 'NousResearch_Nous-Hermes-2-Yi-34B']]
13:08:28 - Layer 48/60 - CHANGED - -0.02789 > -0.02789 - 0.0%
----
Optimizing Layer 49/60 (slerp): 100%|█████████████| 4/4 [03:57<00:00, 59.32s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
13:13:43 - Layer 49/60 - CHANGED - -0.02789 > -0.02922 - 4.8%
----
Optimizing Layer 50/60 (slerp): 100%|█████████████| 4/4 [04:03<00:00, 60.93s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.6, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
13:19:09 - Layer 50/60 - CHANGED - -0.02922 > -0.03467 - 18.6%
----
Optimizing Layer 51/60 (slerp): 100%|█████████████| 4/4 [04:06<00:00, 61.73s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
13:24:39 - Layer 51/60 - RETAINED - -0.03467
----
Optimizing Layer 52/60 (slerp): 100%|█████████████| 4/4 [04:02<00:00, 60.70s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
13:29:58 - Layer 52/60 - CHANGED - -0.03467 > -0.03931 - 13.4%
----
Optimizing Layer 53/60 (slerp): 100%|█████████████| 4/4 [04:00<00:00, 60.06s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.2, 'NousResearch_Nous-Capybara-34B'], [0.4, 'NousResearch_Nous-Hermes-2-Yi-34B']]
13:35:19 - Layer 53/60 - CHANGED - -0.03931 > -0.04040 - 2.8%
----
Optimizing Layer 54/60 (slerp): 100%|█████████████| 4/4 [04:30<00:00, 67.51s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
13:41:14 - Layer 54/60 - CHANGED - -0.04040 > -0.04498 - 11.3%
----
Optimizing Layer 55/60 (slerp): 100%|█████████████| 4/4 [04:50<00:00, 72.65s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
13:47:49 - Layer 55/60 - CHANGED - -0.04498 > -0.04736 - 5.3%
----
Optimizing Layer 56/60 (slerp): 100%|█████████████| 4/4 [05:28<00:00, 82.16s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
13:55:09 - Layer 56/60 - RETAINED - -0.04736
----
Optimizing Layer 57/60 (slerp): 100%|█████████████| 4/4 [05:30<00:00, 82.57s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
14:02:30 - Layer 57/60 - RETAINED - -0.04736
----
Optimizing Layer 58/60 (slerp): 100%|█████████████| 4/4 [06:22<00:00, 95.56s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.6, 'NousResearch_Nous-Capybara-34B']]
14:11:07 - Layer 58/60 - RETAINED - -0.04736
----
Optimizing Layer 59/60 (slerp): 100%|█████████████| 4/4 [05:52<00:00, 88.03s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
14:19:17 - Layer 59/60 - CHANGED - -0.04736 > -0.05244 - 10.7%
----
Optimizing Layer 60/60 (slerp): 100%|█████████████| 4/4 [04:47<00:00, 71.86s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
14:25:42 - Layer 60/60 - RETAINED - -0.05244
----
Optimizing Header: 100%|██████████████████████████| 4/4 [03:37<00:00, 54.33s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.6, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
14:30:24 - Header - CHANGED - -0.05244 > -0.06200 - 18.2%
-----------------------------------------------------------------------------------------------------
| Type | Phrase | Context | Raw Prob* | Used Prob** | Change |
-----------------------------------------------------------------------------------------------------
| BAD | anticipation | Her body quivers with | 0.00000% | 0.00% | +0.00% |
| BAD | anticipation | The atmosphere is thic.. | 0.00000% | 0.00% | +0.00% |
| BAD | unwavering | Filled with an | 0.00000% | 0.00% | +0.00% |
| BAD | determination | Her eyes were filled w.. | 0.00000% | 0.00% | -0.00% |
| BAD | determination | Her stubbornness only .. | 0.00000% | 0.00% | +0.00% |
| BAD | whisper | Her voice barely above.. | 0.00000% | 0.00% | +0.00% |
| BAD | spine | shivers down her | 0.00000% | 0.00% | +0.00% |
| BAD | sends shivers | The thrill of the act | 0.00000% | 0.00% | +0.00% |
| BAD | ministrations | She moans and twitches.. | 0.00003% | 0.00% | -0.00% |
| BAD | legs | wraps her | 0.00000% | 0.00% | -0.00% |
| BAD | imposing figure | He had an | 0.00000% | 0.00% | -0.00% |
| BAD | shared challenges | Their bond strengthene.. | 0.00001% | 0.00% | +0.00% |
| BAD | bond | forged a | 0.00004% | 0.00% | -0.00% |
| BAD | bond | an unspoken | 0.00010% | 0.00% | +0.00% |
| BAD | enhance our expe.. | I'm excited to see how | 0.00000% | 0.00% | +0.00% |
| BAD | sense of vulnera.. | create a | 0.00000% | 0.00% | -0.00% |
| BAD | dimensions of in.. | explore new | 0.00000% | 0.00% | +0.00% |
| BAD | deepening our co.. | while | 0.00000% | 0.00% | -0.00% |
| BAD | shared experiences | through | 0.00001% | 0.00% | +0.00% |
| BAD | societal expecta.. | that transcend | 0.00000% | 0.00% | -0.00% |
| BAD | conventional bou.. | that defy | 0.00000% | 0.00% | +0.00% |
| BAD | conventional bou.. | and defy | 0.00000% | 0.00% | +0.00% |
| BAD | open communication | an environment | 0.00000% | 0.00% | +0.00% |
| BAD | emotional vulner.. | an environment | 0.00000% | 0.00% | +0.00% |
| BAD | heightens our co.. | touch and the anticipa.. | 0.00000% | 0.00% | -0.00% |
| BAD | sensations you'r.. | I'm enjoying | 0.00000% | 0.00% | +0.00% |
| BAD | is truly arousing | attention to detail | 0.00000% | 0.00% | +0.00% |
| BAD | is truly arousing | way you explore my body | 0.00000% | 0.00% | -0.00% |
| BAD | challenge presen.. | my resolve unwavering .. | 0.00000% | 0.00% | +0.00% |
| BAD | humble vessel | surrendering to the ex.. | 0.00000% | 0.00% | +0.00% |
| BAD | bond | cherishing the unique | 0.00019% | 0.00% | +0.00% |
| BAD | bond | special | 0.00023% | 0.00% | -0.00% |
| BAD | grows stronger w.. | bond | 0.00000% | 0.00% | -0.00% |
| BAD | that cannot be b.. | bond | 0.00000% | 0.00% | -0.00% |
| BAD | becomes unbreaka.. | bond | 0.00000% | 0.00% | +0.00% |
| BAD | grew stronger wi.. | bond | 0.00000% | 0.00% | +0.00% |
| GOOD | The apple is in .. | Question: If I'm in th.. | 6.12871% | 6.13% | +6.13% |
------------------------------------------------------------------------------------------------------
| Totals | 6.13% | 6.14% | 6.13% |
------------------------------------------------------------------------------------------------------
* = Unweighted, raw probability - ** = Probability after weight adjustments
-------- MERGE COMPOSITION ---------
jondurbin_bagel-dpo-34b-v0.2: 0.51
NousResearch_Nous-Hermes-2-Yi-34B: 0.32
NousResearch_Nous-Capybara-34B: 0.16
------------------------------------
14:31:32 - Loading model (../SUSTech_SUS-Chat-34B)...
Loading checkpoint shards: 100%|██████████████████| 7/7 [01:14<00:00, 10.68s/it]
14:33:15 - Model loaded. Dtype: torch.float16
------------------------------------
Optimizing Layer 1/60 (slerp): 100%|██████████████| 4/4 [02:55<00:00, 43.98s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.2, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B'], [0.2, 'SUSTech_SUS-Chat-34B']]
14:37:13 - Layer 1/60 - CHANGED - -0.06121 > -0.06153 - 0.5%
----
Optimizing Layer 2/60 (slerp): 100%|██████████████| 4/4 [02:57<00:00, 44.28s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B'], [0.6, 'SUSTech_SUS-Chat-34B']]
14:41:08 - Layer 2/60 - CHANGED - -0.06153 > -0.06434 - 4.6%
----
Optimizing Layer 3/60 (slerp): 100%|██████████████| 4/4 [02:59<00:00, 44.87s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
14:45:04 - Layer 3/60 - RETAINED - -0.06434
----
Optimizing Layer 4/60 (slerp): 100%|██████████████| 4/4 [03:24<00:00, 51.23s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
14:49:33 - Layer 4/60 - RETAINED - -0.06434
----
Optimizing Layer 5/60 (slerp): 100%|██████████████| 4/4 [04:13<00:00, 63.44s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
14:55:17 - Layer 5/60 - RETAINED - -0.06434
----
Optimizing Layer 6/60 (slerp): 100%|██████████████| 4/4 [05:08<00:00, 77.18s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
15:01:58 - Layer 6/60 - RETAINED - -0.06434
----
Optimizing Layer 7/60 (slerp): 100%|██████████████| 4/4 [04:41<00:00, 70.31s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
15:08:01 - Layer 7/60 - RETAINED - -0.06434
----
Optimizing Layer 8/60 (slerp): 100%|██████████████| 4/4 [03:51<00:00, 57.86s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
15:13:10 - Layer 8/60 - RETAINED - -0.06434
----
Optimizing Layer 9/60 (slerp): 100%|██████████████| 4/4 [04:02<00:00, 60.54s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B'], [0.6, 'NousResearch_Nous-Hermes-2-Yi-34B'], [0.4, 'SUSTech_SUS-Chat-34B']]
15:18:34 - Layer 9/60 - CHANGED - -0.06434 > -0.06464 - 0.5%
----
Optimizing Layer 10/60 (slerp): 100%|█████████████| 4/4 [03:53<00:00, 58.40s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
15:23:40 - Layer 10/60 - RETAINED - -0.06464
----
Optimizing Layer 11/60 (slerp): 100%|█████████████| 4/4 [03:39<00:00, 54.91s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
15:28:32 - Layer 11/60 - RETAINED - -0.06464
----
Optimizing Layer 12/60 (slerp): 100%|█████████████| 4/4 [03:40<00:00, 55.10s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
15:33:27 - Layer 12/60 - RETAINED - -0.06464
----
Optimizing Layer 13/60 (slerp): 100%|█████████████| 4/4 [03:49<00:00, 57.36s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B'], [0.4, 'SUSTech_SUS-Chat-34B']]
15:38:35 - Layer 13/60 - CHANGED - -0.06464 > -0.06527 - 1.0%
----
Optimizing Layer 14/60 (slerp): 100%|█████████████| 4/4 [03:42<00:00, 55.74s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B'], [0.2, 'NousResearch_Nous-Hermes-2-Yi-34B'], [0.6, 'SUSTech_SUS-Chat-34B']]
15:43:30 - Layer 14/60 - CHANGED - -0.06527 > -0.06851 - 5.0%
----
Optimizing Layer 15/60 (slerp): 100%|█████████████| 4/4 [03:44<00:00, 56.04s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
15:48:41 - Layer 15/60 - RETAINED - -0.06851
----
Optimizing Layer 16/60 (slerp): 100%|█████████████| 4/4 [05:23<00:00, 80.84s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
15:55:48 - Layer 16/60 - RETAINED - -0.06851
----
Optimizing Layer 17/60 (slerp): 100%|█████████████| 4/4 [05:31<00:00, 82.76s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.6, 'NousResearch_Nous-Hermes-2-Yi-34B']]
16:03:01 - Layer 17/60 - RETAINED - -0.06851
----
Optimizing Layer 18/60 (slerp): 100%|█████████████| 4/4 [05:34<00:00, 83.64s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
16:10:28 - Layer 18/60 - RETAINED - -0.06851
----
Optimizing Layer 19/60 (slerp): 100%|█████████████| 4/4 [06:17<00:00, 94.38s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
16:18:46 - Layer 19/60 - RETAINED - -0.06851
----
Optimizing Layer 20/60 (slerp): 100%|█████████████| 4/4 [04:52<00:00, 73.08s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.4, 'SUSTech_SUS-Chat-34B']]
16:25:26 - Layer 20/60 - CHANGED - -0.06851 > -0.06892 - 0.6%
----
Optimizing Layer 21/60 (slerp): 100%|█████████████| 4/4 [05:08<00:00, 77.11s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
16:32:37 - Layer 21/60 - RETAINED - -0.06892
----
Optimizing Layer 22/60 (slerp): 100%|█████████████| 4/4 [04:54<00:00, 73.54s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
16:39:05 - Layer 22/60 - RETAINED - -0.06892
----
Optimizing Layer 23/60 (slerp): 100%|█████████████| 4/4 [04:53<00:00, 73.34s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.4, 'NousResearch_Nous-Hermes-2-Yi-34B']]
16:45:29 - Layer 23/60 - RETAINED - -0.06892
----
Optimizing Layer 24/60 (slerp): 100%|█████████████| 4/4 [04:53<00:00, 73.38s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
16:51:58 - Layer 24/60 - RETAINED - -0.06892
----
Optimizing Layer 25/60 (slerp): 100%|█████████████| 4/4 [04:55<00:00, 73.86s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.4, 'SUSTech_SUS-Chat-34B']]
16:58:30 - Layer 25/60 - CHANGED - -0.06892 > -0.07074 - 2.6%
----
Optimizing Layer 26/60 (slerp): 100%|█████████████| 4/4 [04:11<00:00, 62.83s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B'], [0.4, 'NousResearch_Nous-Hermes-2-Yi-34B']]
17:04:08 - Layer 26/60 - RETAINED - -0.07074
----
Optimizing Layer 27/60 (slerp): 100%|█████████████| 4/4 [04:11<00:00, 62.75s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
17:09:50 - Layer 27/60 - RETAINED - -0.07074
----
Optimizing Layer 28/60 (slerp): 100%|█████████████| 4/4 [04:05<00:00, 61.40s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
17:15:21 - Layer 28/60 - RETAINED - -0.07074
----
Optimizing Layer 29/60 (slerp): 100%|█████████████| 4/4 [05:07<00:00, 76.83s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
17:21:57 - Layer 29/60 - RETAINED - -0.07074
----
Optimizing Layer 30/60 (slerp): 100%|█████████████| 4/4 [04:06<00:00, 61.63s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.2, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
17:27:34 - Layer 30/60 - RETAINED - -0.07074
----
Optimizing Layer 31/60 (slerp): 100%|█████████████| 4/4 [04:21<00:00, 65.25s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
17:33:24 - Layer 31/60 - RETAINED - -0.07074
----
Optimizing Layer 32/60 (slerp): 100%|█████████████| 4/4 [04:36<00:00, 69.13s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.6, 'NousResearch_Nous-Hermes-2-Yi-34B']]
17:39:20 - Layer 32/60 - RETAINED - -0.07074
----
Optimizing Layer 33/60 (slerp): 100%|█████████████| 4/4 [04:52<00:00, 73.01s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
17:45:42 - Layer 33/60 - RETAINED - -0.07074
----
Optimizing Layer 34/60 (slerp): 100%|█████████████| 4/4 [05:09<00:00, 77.30s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
17:52:34 - Layer 34/60 - RETAINED - -0.07074
----
Optimizing Layer 35/60 (slerp): 100%|█████████████| 4/4 [05:09<00:00, 77.29s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
17:59:16 - Layer 35/60 - RETAINED - -0.07074
----
Optimizing Layer 36/60 (slerp): 100%|█████████████| 4/4 [05:19<00:00, 79.91s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
18:06:13 - Layer 36/60 - RETAINED - -0.07074
----
Optimizing Layer 37/60 (slerp): 100%|█████████████| 4/4 [05:40<00:00, 85.08s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B'], [0.8, 'SUSTech_SUS-Chat-34B']]
18:13:35 - Layer 37/60 - CHANGED - -0.07074 > -0.07127 - 0.8%
----
Optimizing Layer 38/60 (slerp): 100%|█████████████| 4/4 [04:50<00:00, 72.69s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Hermes-2-Yi-34B']]
18:20:03 - Layer 38/60 - RETAINED - -0.07127
----
Optimizing Layer 39/60 (slerp): 100%|█████████████| 4/4 [05:23<00:00, 80.96s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
18:26:55 - Layer 39/60 - RETAINED - -0.07127
----
Optimizing Layer 40/60 (slerp): 100%|█████████████| 4/4 [04:10<00:00, 62.57s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
18:32:47 - Layer 40/60 - RETAINED - -0.07127
----
Optimizing Layer 41/60 (slerp): 100%|█████████████| 4/4 [05:23<00:00, 80.96s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
18:39:44 - Layer 41/60 - RETAINED - -0.07127
----
Optimizing Layer 42/60 (slerp): 100%|█████████████| 4/4 [04:03<00:00, 60.87s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
18:45:31 - Layer 42/60 - RETAINED - -0.07127
----
Optimizing Layer 43/60 (slerp): 100%|█████████████| 4/4 [03:36<00:00, 54.22s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
18:50:34 - Layer 43/60 - RETAINED - -0.07127
----
Optimizing Layer 44/60 (slerp): 100%|█████████████| 4/4 [03:52<00:00, 58.18s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
18:55:44 - Layer 44/60 - RETAINED - -0.07127
----
Optimizing Layer 45/60 (slerp): 100%|█████████████| 4/4 [03:39<00:00, 54.92s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
19:00:39 - Layer 45/60 - RETAINED - -0.07127
----
Optimizing Layer 46/60 (slerp): 100%|█████████████| 4/4 [03:36<00:00, 54.06s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
19:05:24 - Layer 46/60 - RETAINED - -0.07127
----
Optimizing Layer 47/60 (slerp): 100%|█████████████| 4/4 [03:50<00:00, 57.54s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
19:10:28 - Layer 47/60 - RETAINED - -0.07127
----
Optimizing Layer 48/60 (slerp): 100%|█████████████| 4/4 [04:02<00:00, 60.62s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.2, 'NousResearch_Nous-Hermes-2-Yi-34B']]
19:15:45 - Layer 48/60 - RETAINED - -0.07127
----
Optimizing Layer 49/60 (slerp): 100%|█████████████| 4/4 [03:59<00:00, 59.77s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B'], [0.6, 'SUSTech_SUS-Chat-34B']]
19:21:02 - Layer 49/60 - CHANGED - -0.07127 > -0.07407 - 3.9%
----
Optimizing Layer 50/60 (slerp): 100%|█████████████| 4/4 [03:53<00:00, 58.25s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.6, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B'], [0.4, 'SUSTech_SUS-Chat-34B']]
19:26:11 - Layer 50/60 - CHANGED - -0.07407 > -0.07571 - 2.2%
----
Optimizing Layer 51/60 (slerp): 100%|█████████████| 4/4 [03:59<00:00, 59.91s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
19:31:30 - Layer 51/60 - RETAINED - -0.07571
----
Optimizing Layer 52/60 (slerp): 100%|█████████████| 4/4 [04:43<00:00, 70.77s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B'], [0.6, 'SUSTech_SUS-Chat-34B']]
19:37:38 - Layer 52/60 - CHANGED - -0.07571 > -0.07660 - 1.2%
----
Optimizing Layer 53/60 (slerp): 100%|█████████████| 4/4 [04:26<00:00, 66.68s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.2, 'NousResearch_Nous-Capybara-34B'], [0.4, 'NousResearch_Nous-Hermes-2-Yi-34B'], [0.8, 'SUSTech_SUS-Chat-34B']]
19:43:27 - Layer 53/60 - CHANGED - -0.07660 > -0.07717 - 0.8%
----
Optimizing Layer 54/60 (slerp): 100%|█████████████| 4/4 [04:49<00:00, 72.34s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.4, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B'], [0.8, 'SUSTech_SUS-Chat-34B']]
19:50:18 - Layer 54/60 - CHANGED - -0.07717 > -0.07775 - 0.7%
----
Optimizing Layer 55/60 (slerp): 100%|█████████████| 4/4 [04:12<00:00, 63.01s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B'], [0.8, 'SUSTech_SUS-Chat-34B']]
19:56:01 - Layer 55/60 - CHANGED - -0.07775 > -0.07923 - 1.9%
----
Optimizing Layer 56/60 (slerp): 100%|█████████████| 4/4 [03:56<00:00, 59.03s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
20:01:25 - Layer 56/60 - RETAINED - -0.07923
----
Optimizing Layer 57/60 (slerp): 100%|█████████████| 4/4 [04:07<00:00, 61.99s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Capybara-34B']]
20:06:54 - Layer 57/60 - RETAINED - -0.07923
----
Optimizing Layer 58/60 (slerp): 100%|█████████████| 4/4 [03:55<00:00, 58.84s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.6, 'NousResearch_Nous-Capybara-34B']]
20:12:09 - Layer 58/60 - RETAINED - -0.07923
----
Optimizing Layer 59/60 (slerp): 100%|█████████████| 4/4 [03:27<00:00, 51.80s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B']]
20:16:49 - Layer 59/60 - RETAINED - -0.07923
----
Optimizing Layer 60/60 (slerp): 100%|█████████████| 4/4 [04:01<00:00, 60.29s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2']]
20:22:08 - Layer 60/60 - RETAINED - -0.07923
----
Optimizing Header: 100%|██████████████████████████| 4/4 [03:49<00:00, 57.30s/it]
[[1.0, 'jondurbin_bagel-dpo-34b-v0.2'], [0.6, 'NousResearch_Nous-Capybara-34B'], [0.8, 'NousResearch_Nous-Hermes-2-Yi-34B'], [0.6, 'SUSTech_SUS-Chat-34B']]
20:26:56 - Header - CHANGED - -0.07923 > -0.07981 - 0.7%
-----------------------------------------------------------------------------------------------------
| Type | Phrase | Context | Raw Prob* | Used Prob** | Change |
-----------------------------------------------------------------------------------------------------
| BAD | anticipation | Her body quivers with | 0.00000% | 0.00% | +0.00% |
| BAD | anticipation | The atmosphere is thic.. | 0.00000% | 0.00% | +0.00% |
| BAD | unwavering | Filled with an | 0.00000% | 0.00% | +0.00% |
| BAD | determination | Her eyes were filled w.. | 0.00000% | 0.00% | -0.00% |
| BAD | determination | Her stubbornness only .. | 0.00000% | 0.00% | +0.00% |
| BAD | whisper | Her voice barely above.. | 0.00000% | 0.00% | +0.00% |
| BAD | spine | shivers down her | 0.00000% | 0.00% | +0.00% |
| BAD | sends shivers | The thrill of the act | 0.00000% | 0.00% | +0.00% |
| BAD | ministrations | She moans and twitches.. | 0.00004% | 0.00% | -0.00% |
| BAD | legs | wraps her | 0.00000% | 0.00% | -0.00% |
| BAD | imposing figure | He had an | 0.00000% | 0.00% | -0.00% |
| BAD | shared challenges | Their bond strengthene.. | 0.00001% | 0.00% | +0.00% |
| BAD | bond | forged a | 0.00005% | 0.00% | -0.00% |
| BAD | bond | an unspoken | 0.00010% | 0.00% | +0.00% |
| BAD | enhance our expe.. | I'm excited to see how | 0.00000% | 0.00% | +0.00% |
| BAD | sense of vulnera.. | create a | 0.00000% | 0.00% | -0.00% |
| BAD | dimensions of in.. | explore new | 0.00000% | 0.00% | +0.00% |
| BAD | deepening our co.. | while | 0.00000% | 0.00% | -0.00% |
| BAD | shared experiences | through | 0.00001% | 0.00% | +0.00% |
| BAD | societal expecta.. | that transcend | 0.00000% | 0.00% | -0.00% |
| BAD | conventional bou.. | that defy | 0.00000% | 0.00% | +0.00% |
| BAD | conventional bou.. | and defy | 0.00000% | 0.00% | +0.00% |
| BAD | open communication | an environment | 0.00000% | 0.00% | +0.00% |
| BAD | emotional vulner.. | an environment | 0.00000% | 0.00% | +0.00% |
| BAD | heightens our co.. | touch and the anticipa.. | 0.00000% | 0.00% | -0.00% |
| BAD | sensations you'r.. | I'm enjoying | 0.00000% | 0.00% | +0.00% |
| BAD | is truly arousing | attention to detail | 0.00000% | 0.00% | +0.00% |
| BAD | is truly arousing | way you explore my body | 0.00000% | 0.00% | +0.00% |
| BAD | challenge presen.. | my resolve unwavering .. | 0.00000% | 0.00% | +0.00% |
| BAD | humble vessel | surrendering to the ex.. | 0.00000% | 0.00% | +0.00% |
| BAD | bond | cherishing the unique | 0.00019% | 0.00% | +0.00% |
| BAD | bond | special | 0.00014% | 0.00% | -0.00% |
| BAD | grows stronger w.. | bond | 0.00000% | 0.00% | -0.00% |
| BAD | that cannot be b.. | bond | 0.00000% | 0.00% | -0.00% |
| BAD | becomes unbreaka.. | bond | 0.00000% | 0.00% | +0.00% |
| BAD | grew stronger wi.. | bond | 0.00000% | 0.00% | +0.00% |
| GOOD | The apple is in .. | Question: If I'm in th.. | 7.81435% | 7.81% | +7.81% |
------------------------------------------------------------------------------------------------------
| Totals | 7.81% | 7.82% | 7.81% |
------------------------------------------------------------------------------------------------------
* = Unweighted, raw probability - ** = Probability after weight adjustments
-------- MERGE COMPOSITION ---------
jondurbin_bagel-dpo-34b-v0.2: 0.49
NousResearch_Nous-Hermes-2-Yi-34B: 0.24
SUSTech_SUS-Chat-34B: 0.14
NousResearch_Nous-Capybara-34B: 0.13
20:28:04 - Saving model to ./mm-output...
20:28:48 - Copying tokenizer files to ./mm-output...
Skipped added_tokens.json (not found)
Copied tokenizer.model
Copied special_tokens_map.json
Copied tokenizer_config.json
Skipped vocab.json (not found)
Skipped merges.txt (not found)
20:28:48 - Model and tokenizer files saved successfully.