Respair commited on
Commit
49b119c
·
verified ·
1 Parent(s): 3007fbc

Update app_tsukasa_remote.py

Browse files
Files changed (1) hide show
  1. app_tsukasa_remote.py +7 -2
app_tsukasa_remote.py CHANGED
@@ -317,7 +317,7 @@ yielding reasonably good results.
317
  </p>
318
 
319
  <p>
320
- Overall, I'm confident that with a bit of experimentation, you can achieve reasonbaly good results. <br>
321
  The model should work well out of the box 90% of the time without the need for extensive tweaking.<br>
322
  However, here are some tips in case you encounter issues:
323
  </p>
@@ -329,6 +329,10 @@ However, here are some tips in case you encounter issues:
329
  Ensure that your input closely matches your reference (audio or text prompt) in terms of tone,<br>
330
  non-verbal cues, duration, etc.
331
  </li>
 
 
 
 
332
 
333
  <li>
334
  If your audio is too long but the input is too short, the speech rate will be slow, and vice versa.
@@ -336,7 +340,8 @@ However, here are some tips in case you encounter issues:
336
 
337
  <li>
338
  Experiment with the <b>alpha</b>, <b>beta</b>, and <b>Intensity</b> parameters. The Diffusion<br>
339
- sampler is non-deterministic, so regenerate a few times if you're not satisfied with the output.
 
340
  </li>
341
 
342
  <li>
 
317
  </p>
318
 
319
  <p>
320
+ Overall, I'm confident that with a bit of experimentation, you can achieve impressive results. <br>
321
  The model should work well out of the box 90% of the time without the need for extensive tweaking.<br>
322
  However, here are some tips in case you encounter issues:
323
  </p>
 
329
  Ensure that your input closely matches your reference (audio or text prompt) in terms of tone,<br>
330
  non-verbal cues, duration, etc.
331
  </li>
332
+
333
+ <li>
334
+ **Do not use very short inputs**
335
+ </li>
336
 
337
  <li>
338
  If your audio is too long but the input is too short, the speech rate will be slow, and vice versa.
 
340
 
341
  <li>
342
  Experiment with the <b>alpha</b>, <b>beta</b>, and <b>Intensity</b> parameters. The Diffusion<br>
343
+ sampler is non-deterministic, so regenerate a few times if you're not satisfied with the output. <br>
344
+ don't use crazy values though! turn it off if it was bad by setting both alpha and beta to zero.
345
  </li>
346
 
347
  <li>