Commit 0247f2cc authored by Phil Wang's avatar Phil Wang
Browse files

product management

parent 80cfebad
Loading
Loading
Loading
Loading
+1 −0
Original line number Diff line number Diff line
@@ -145,6 +145,7 @@ music = musiclm(['the crystalline sounds of the piano in a ballroom']) # torch.T
- [x] give dynamic positional bias to self attention in AST

- [ ] add a version of mulan to <a href="https://github.com/mlfoundations/open_clip">open clip</a>
- [ ] support variable lengthed audio with masking in audio transformer, then implement MusicLM generating multiple samples and selecting top match with MuLaN
- [ ] set all the proper spectrogram hyperparameters

## Citations