Skip to content
GitLab
Explore
Sign in
This is an archived project. Repository and other project resources are read-only.
Commits · 15151252340a5b0d76ed82aa0a56d6d54e29abab
audiolm-pytorch-flask
Browse files
Feb 03, 2023
each hierarchical transformer will have its own set of conditioning embeddings...
· 15151252
Phil Wang
authored
Feb 02, 2023
View commits for tag 0.9.1
0.9.1
15151252
add ability to condition on joint text-audio embeddings from mulan, over at musiclm-pytorch
· 2e654ba8
Phil Wang
authored
Feb 02, 2023
View commits for tag 0.9.0
0.9.0
2e654ba8
use a base class to enforce what the trainer can accept, for musiclm
· 04bed934
Phil Wang
authored
Feb 02, 2023
View commits for tag 0.8.3
0.8.3
04bed934
loosen restrictions around what is passed in for transformers into the...
· 88bf4dfb
Phil Wang
authored
Feb 02, 2023
88bf4dfb
prepare for musiclm inference and also address...
· 8db0afee
Phil Wang
authored
Feb 02, 2023
View commits for tag 0.8.1
0.8.1
8db0afee
Feb 02, 2023
0.7.9
· 6919830b
Phil Wang
authored
Feb 01, 2023
View commit 6919830b
2 tags
6919830b
Merge pull request #68 from djqualia/patch-10
· 441aa72b
Phil Wang
authored
Feb 01, 2023
441aa72b
Convert stereo/multi-channel audio to mono
· ab4fc99a
djqualia
authored
Feb 01, 2023
ab4fc99a
appreciation
· ba7dcd68
Phil Wang
authored
Feb 01, 2023
ba7dcd68
default back to distributed friendly soundstream, lower default learning rate...
· 55da605a
Phil Wang
authored
Feb 01, 2023
View commits for tag 0.7.8
0.7.8
55da605a
Feb 01, 2023
potentially fix
https://github.com/lucidrains/audiolm-pytorch/issues/64
· 47ee1c39
Phil Wang
authored
Jan 31, 2023
View commits for tag 0.7.7
0.7.7
47ee1c39
get rid of a warning message
· edb8f779
Phil Wang
authored
Jan 31, 2023
edb8f779
default back to stft discriminator using torch.complex64, due to...
· fe97568c
Phil Wang
authored
Jan 31, 2023
View commits for tag 0.7.6
0.7.6
fe97568c
Jan 31, 2023
Merge pull request #63 from LWprogramming/demo_ipynb
· c6c8299a
Phil Wang
authored
Jan 30, 2023
c6c8299a
Add demo Jupyter notebook to run e2e
· e65a4333
Leon Wu
authored
Jan 30, 2023
e65a4333
Jan 29, 2023
appreciation
· a5b4b990
Phil Wang
authored
Jan 28, 2023
a5b4b990
just set as default and wait for a researcher to raise an issue
· d7623ab3
Phil Wang
authored
Jan 28, 2023
View commits for tag 0.7.5
0.7.5
d7623ab3
Jan 28, 2023
attempt to fix distributed training for soundstream in...
· dfe9a1e8
Phil Wang
authored
Jan 27, 2023
View commits for tag 0.7.4
0.7.4
dfe9a1e8
update
· 49499085
Phil Wang
authored
Jan 27, 2023
49499085
Jan 26, 2023
bring back the complex stft discriminator, to try to figure out where the...
· 2ebfd1c1
Phil Wang
authored
Jan 25, 2023
View commits for tag 0.7.3
0.7.3
2ebfd1c1
Jan 25, 2023
revert changes to elu activation placement, as researcher @yigityu is hitting convergence issues
· e6eaabdb
Phil Wang
authored
Jan 24, 2023
View commits for tag 0.7.2
0.7.2
e6eaabdb
Jan 21, 2023
readme
· d06fcf7e
Phil Wang
authored
Jan 20, 2023
d06fcf7e
add learned multi-headed exponential moving average as an option for soundstream
· ed10de42
Phil Wang
authored
Jan 20, 2023
View commits for tag 0.7.1
0.7.1
ed10de42
fix when turning off local attention for sound stream
· 455def25
Phil Wang
authored
Jan 20, 2023
View commits for tag 0.6.3
0.6.3
455def25
switch to continuous positional bias, for the length extrapolation at inference time
· d5890a24
Phil Wang
authored
Jan 20, 2023
View commits for tag 0.6.2
0.6.2
d5890a24
make dilation order in encoder and decoder configurable in soundstream
· 8e3d1979
Phil Wang
authored
Jan 20, 2023
View commits for tag 0.6.1
0.6.1
8e3d1979
Jan 20, 2023
use the same activation placements as encodec, suggested by @inspirit
· 36c39540
Phil Wang
authored
Jan 20, 2023
View commits for tag 0.6.0
0.6.0
36c39540
Jan 19, 2023
make sure to address the discriminator gradient issues uncovered by...
· daeedb27
Phil Wang
authored
Jan 18, 2023
View commits for tag 0.5.1
0.5.1
daeedb27
make sure local attention blocks are followed by feedforward
· ec642cd9
Phil Wang
authored
Jan 18, 2023
View commits for tag 0.5.0
0.5.0
ec642cd9
readme
· 5a7fee75
Phil Wang
authored
Jan 18, 2023
5a7fee75
add an extra assert to protect against empty audiofiles
· 15391183
Phil Wang
authored
Jan 18, 2023
View commits for tag 0.4.8
0.4.8
15391183
readme
· 9d992724
Phil Wang
authored
Jan 18, 2023
9d992724
be able to set dataloader num workers on soundstream trainer
· 58f13b44
Phil Wang
authored
Jan 18, 2023
View commits for tag 0.4.7
0.4.7
58f13b44
make sure one can also finely specify the max length of each target sample freq
· 2f4407b4
Phil Wang
authored
Jan 18, 2023
View commits for tag 0.4.6
0.4.6
2f4407b4
fix order in which audio is resampled vs pad / curtailed
· f7c26f3b
Phil Wang
authored
Jan 18, 2023
View commits for tag 0.4.5
0.4.5
f7c26f3b
Jan 17, 2023
also allow for single resample target hz for sounddataset
· 7f788a62
Phil Wang
authored
Jan 16, 2023
View commits for tag 0.4.4
0.4.4
7f788a62
reduce confusion for target_sample_hz
· 5d6cd874
Phil Wang
authored
Jan 16, 2023
View commits for tag 0.4.3
0.4.3
5d6cd874
fix an issue with resampling in sounddataset, thanks to @LWprogramming
· 0f10c2e1
Phil Wang
authored
Jan 16, 2023
View commits for tag 0.4.2
0.4.2
0f10c2e1
just automatically handle t5_encode_text receiving a single string
· 096f72a8
Phil Wang
authored
Jan 16, 2023
View commits for tag 0.4.1
0.4.1
096f72a8
Jan 15, 2023
allow for prefix-based self attention conditioning, as was done in VALL-E, and...
· 654d4969
Phil Wang
authored
Jan 14, 2023
View commits for tag 0.4.0
0.4.0
654d4969
Loading