- [ ] incorporate ability to use hubert intermediate features as semantic tokens, recommended by <ahref="https://github.com/lucidrains/audiolm-pytorch/discussions/13">eonglints</a>
- [ ] complete full training code for soundstream, taking care of discriminator training
@@ -71,7 +72,7 @@ loss.backward()
- [ ] DRY a little at the end
- [ ] figure out how to suppress logging in fairseq
- [ ] test with speech synthesis for starters
- [ ] add classifier free guidance
- [ ] abstract out conditioning + classifier free guidance into external module or potentially a package