Skip to content
GitLab
Explore
Sign in
This is an archived project. Repository and other project resources are read-only.
Commits · 532d8e95cf5a3ff15d8ce967d4748e6041bc03ce
audiolm-pytorch-flask
Browse files
Oct 30, 2022
Merge pull request #9 from aabzaliev/main
· 532d8e95
Phil Wang
authored
Oct 29, 2022
532d8e95
include stft discriminator features into feature loss
· 42b334cf
abzaliev
authored
Oct 29, 2022
42b334cf
Oct 29, 2022
add the "feature" loss as mentioned in the paper with customizable weight
· d0e1f681
Phil Wang
authored
Oct 28, 2022
d0e1f681
Oct 28, 2022
remove padding in causal convtranspose1d thanks to @NPN for catching that
· 6810d515
Phil Wang
authored
Oct 27, 2022
6810d515
first pass at stft discriminator
· 6bd8b8fa
Phil Wang
authored
Oct 27, 2022
6bd8b8fa
trims output on right for causal convtranspose1d, thanks to @NPN in...
· d9c6ba40
Phil Wang
authored
Oct 27, 2022
d9c6ba40
Oct 27, 2022
complete the multi-scale discriminators for soundstream
· 4acee04e
Phil Wang
authored
Oct 26, 2022
4acee04e
prep for adversarial learning in soundstream
· ee51e21c
Phil Wang
authored
Oct 26, 2022
ee51e21c
cite soundstream
· c5f983b8
Phil Wang
authored
Oct 26, 2022
c5f983b8
add kmeans init and expiry of stale codes as detailed in soundstream paper
· 56bcd108
Phil Wang
authored
Oct 26, 2022
56bcd108
will need ema for soundstream
· 8b78d101
Phil Wang
authored
Oct 26, 2022
8b78d101
Oct 26, 2022
go with normal convtranspose1d, causal may be not possible?
· 9d832142
Phil Wang
authored
Oct 25, 2022
9d832142
give causal convtranspose another shot
· c3ed8db5
Phil Wang
authored
Oct 25, 2022
c3ed8db5
get basic encoder, residual quantization, and decoder code for soundstream...
· 3bdca366
Phil Wang
authored
Oct 25, 2022
3bdca366
Oct 25, 2022
causal mask needed in attention
· 9cd86d25
Phil Wang
authored
Oct 24, 2022
9cd86d25
they use three causal attention networks for semantic, coarse, fine. prepare...
· 277dabf5
Phil Wang
authored
Oct 24, 2022
277dabf5
Oct 20, 2022
complete basic attention encoder
· 9b1fef52
Phil Wang
authored
Oct 19, 2022
9b1fef52
Oct 18, 2022
attention, but what else?
· d3bb0612
Phil Wang
authored
Oct 17, 2022
d3bb0612
Sep 11, 2022
diagram
· 0e3962db
Phil Wang
authored
Sep 10, 2022
0e3962db
use vq library for residual quantization in soundstream
· 17e4b200
Phil Wang
authored
Sep 10, 2022
17e4b200
scaffold
· 2cba4a09
Phil Wang
authored
Sep 10, 2022
2cba4a09
Sep 10, 2022
wip
· 022ba972
Phil Wang
authored
Sep 09, 2022
022ba972
wip
· 70326046
Phil Wang
authored
Sep 09, 2022
70326046
Initial commit
· aff5977d
Phil Wang
authored
Sep 09, 2022
aff5977d
Loading