Commits · 15151252340a5b0d76ed82aa0a56d6d54e29abab · school / Capstone Design / 01 / AudioLM - Pytorch with flask · GitLab

This is an archived project. Repository and other project resources are read-only.

Feb 03, 2023
- each hierarchical transformer will have its own set of conditioning embeddings... · 15151252
  Phil Wang authored Feb 02, 2023
  View commits for tag 0.9.1 0.9.1
  
  15151252
- add ability to condition on joint text-audio embeddings from mulan, over at musiclm-pytorch · 2e654ba8
  Phil Wang authored Feb 02, 2023
  
  View commits for tag 0.9.0 0.9.0
  
  2e654ba8
- use a base class to enforce what the trainer can accept, for musiclm · 04bed934
  Phil Wang authored Feb 02, 2023
  
  View commits for tag 0.8.3 0.8.3
  
  04bed934
- loosen restrictions around what is passed in for transformers into the... · 88bf4dfb
  Phil Wang authored Feb 02, 2023
  88bf4dfb
- prepare for musiclm inference and also address... · 8db0afee
  Phil Wang authored Feb 02, 2023
  View commits for tag 0.8.1 0.8.1
  
  8db0afee
Feb 02, 2023
- 0.7.9 · 6919830b
  Phil Wang authored Feb 01, 2023
  
  View commit 6919830b 2 tags
  
  6919830b
- Merge pull request #68 from djqualia/patch-10 · 441aa72b
  Phil Wang authored Feb 01, 2023
  441aa72b
- Convert stereo/multi-channel audio to mono · ab4fc99a
  djqualia authored Feb 01, 2023
  ab4fc99a
- appreciation · ba7dcd68
  Phil Wang authored Feb 01, 2023
  
  ba7dcd68
- default back to distributed friendly soundstream, lower default learning rate... · 55da605a
  Phil Wang authored Feb 01, 2023
  View commits for tag 0.7.8 0.7.8
  
  55da605a
Feb 01, 2023
- potentially fix https://github.com/lucidrains/audiolm-pytorch/issues/64 · 47ee1c39
  Phil Wang authored Jan 31, 2023
  
  View commits for tag 0.7.7 0.7.7
  
  47ee1c39
- get rid of a warning message · edb8f779
  Phil Wang authored Jan 31, 2023
  
  edb8f779
- default back to stft discriminator using torch.complex64, due to... · fe97568c
  Phil Wang authored Jan 31, 2023
  View commits for tag 0.7.6 0.7.6
  
  fe97568c
Jan 31, 2023
- Merge pull request #63 from LWprogramming/demo_ipynb · c6c8299a
  Phil Wang authored Jan 30, 2023
  c6c8299a
- Add demo Jupyter notebook to run e2e · e65a4333
  Leon Wu authored Jan 30, 2023
  
  e65a4333
Jan 29, 2023
- appreciation · a5b4b990
  Phil Wang authored Jan 28, 2023
  
  a5b4b990
- just set as default and wait for a researcher to raise an issue · d7623ab3
  Phil Wang authored Jan 28, 2023
  
  View commits for tag 0.7.5 0.7.5
  
  d7623ab3
Jan 28, 2023
- attempt to fix distributed training for soundstream in... · dfe9a1e8
  Phil Wang authored Jan 27, 2023
  View commits for tag 0.7.4 0.7.4
  
  dfe9a1e8
- update · 49499085
  Phil Wang authored Jan 27, 2023
  
  49499085
Jan 26, 2023
- bring back the complex stft discriminator, to try to figure out where the... · 2ebfd1c1
  Phil Wang authored Jan 25, 2023
  View commits for tag 0.7.3 0.7.3
  
  2ebfd1c1
Jan 25, 2023
- revert changes to elu activation placement, as researcher @yigityu is hitting convergence issues · e6eaabdb
  Phil Wang authored Jan 24, 2023
  
  View commits for tag 0.7.2 0.7.2
  
  e6eaabdb
Jan 21, 2023
- readme · d06fcf7e
  Phil Wang authored Jan 20, 2023
  
  d06fcf7e
- add learned multi-headed exponential moving average as an option for soundstream · ed10de42
  Phil Wang authored Jan 20, 2023
  
  View commits for tag 0.7.1 0.7.1
  
  ed10de42
- fix when turning off local attention for sound stream · 455def25
  Phil Wang authored Jan 20, 2023
  
  View commits for tag 0.6.3 0.6.3
  
  455def25
- switch to continuous positional bias, for the length extrapolation at inference time · d5890a24
  Phil Wang authored Jan 20, 2023
  
  View commits for tag 0.6.2 0.6.2
  
  d5890a24
- make dilation order in encoder and decoder configurable in soundstream · 8e3d1979
  Phil Wang authored Jan 20, 2023
  
  View commits for tag 0.6.1 0.6.1
  
  8e3d1979
Jan 20, 2023
- use the same activation placements as encodec, suggested by @inspirit · 36c39540
  Phil Wang authored Jan 20, 2023
  
  View commits for tag 0.6.0 0.6.0
  
  36c39540
Jan 19, 2023
- make sure to address the discriminator gradient issues uncovered by... · daeedb27
  Phil Wang authored Jan 18, 2023
  View commits for tag 0.5.1 0.5.1
  
  daeedb27
- make sure local attention blocks are followed by feedforward · ec642cd9
  Phil Wang authored Jan 18, 2023
  
  View commits for tag 0.5.0 0.5.0
  
  ec642cd9
- readme · 5a7fee75
  Phil Wang authored Jan 18, 2023
  
  5a7fee75
- add an extra assert to protect against empty audiofiles · 15391183
  Phil Wang authored Jan 18, 2023
  
  View commits for tag 0.4.8 0.4.8
  
  15391183
- readme · 9d992724
  Phil Wang authored Jan 18, 2023
  
  9d992724
- be able to set dataloader num workers on soundstream trainer · 58f13b44
  Phil Wang authored Jan 18, 2023
  
  View commits for tag 0.4.7 0.4.7
  
  58f13b44
- make sure one can also finely specify the max length of each target sample freq · 2f4407b4
  Phil Wang authored Jan 18, 2023
  
  View commits for tag 0.4.6 0.4.6
  
  2f4407b4
- fix order in which audio is resampled vs pad / curtailed · f7c26f3b
  Phil Wang authored Jan 18, 2023
  
  View commits for tag 0.4.5 0.4.5
  
  f7c26f3b
Jan 17, 2023
- also allow for single resample target hz for sounddataset · 7f788a62
  Phil Wang authored Jan 16, 2023
  
  View commits for tag 0.4.4 0.4.4
  
  7f788a62
- reduce confusion for target_sample_hz · 5d6cd874
  Phil Wang authored Jan 16, 2023
  
  View commits for tag 0.4.3 0.4.3
  
  5d6cd874
- fix an issue with resampling in sounddataset, thanks to @LWprogramming · 0f10c2e1
  Phil Wang authored Jan 16, 2023
  
  View commits for tag 0.4.2 0.4.2
  
  0f10c2e1
- just automatically handle t5_encode_text receiving a single string · 096f72a8
  Phil Wang authored Jan 16, 2023
  
  View commits for tag 0.4.1 0.4.1
  
  096f72a8
Jan 15, 2023
- allow for prefix-based self attention conditioning, as was done in VALL-E, and... · 654d4969
  Phil Wang authored Jan 14, 2023
  View commits for tag 0.4.0 0.4.0
  
  654d4969