Skip to content
Commit d5890a24 authored by Phil Wang's avatar Phil Wang
Browse files

switch to continuous positional bias, for the length extrapolation at inference time

parent 8e3d1979
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment