Skip to content
Commit f7756f56 authored by Phil Wang's avatar Phil Wang
Browse files

a simple measure for greater transformer training stability

parent 5b24b4f5
Loading
Loading
Loading
Loading
0% Loading or .
You are about to add 0 people to the discussion. Proceed with caution.
Please register or to comment