introduce attn_dynamic_pos_bias for soundstream, which should have the best...
introduce attn_dynamic_pos_bias for soundstream, which should have the best length extrapolation properties for attention
Loading
Please register or sign in to comment