add local attention at the innermost layer of soundstream, flanking the...
add local attention at the innermost layer of soundstream, flanking the residual quantization layer on both encoder and decoder
Loading
Please register or sign in to comment