Skip to content
Commit b29141f8 authored by Phil Wang's avatar Phil Wang
Browse files

use 2d dynamic positional bias for fine transformer, to try to improve...

use 2d dynamic positional bias for fine transformer, to try to improve training at greater number of fine quantizers
parent 9ba1b040
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment