add local attention at the innermost layer of soundstream, flanking the...
add local attention at the innermost layer of soundstream, flanking the residual quantization layer on both encoder and decoder
Loading
Please sign in to comment
add local attention at the innermost layer of soundstream, flanking the residual quantization layer on both encoder and decoder