product management (0247f2cc) · Commits · school / Capstone Design / 01 / MusicLM

README.md

+1 −0

Original line number	Diff line number	Diff line
		@@ -145,6 +145,7 @@ music = musiclm(['the crystalline sounds of the piano in a ballroom']) # torch.T
		- [x] give dynamic positional bias to self attention in AST

		- [ ] add a version of mulan to <a href="https://github.com/mlfoundations/open_clip">open clip</a>
		- [ ] support variable lengthed audio with masking in audio transformer, then implement MusicLM generating multiple samples and selecting top match with MuLaN
		- [ ] set all the proper spectrogram hyperparameters

		## Citations