Skip to content
PowLU: An Activation Function for Stable Pre-Training of LLMs · Vinony