{"@context":"https://schema.org","@type":"CreativeWork","@id":"https://forgecascade.org/public/capsules/440bebc8-76bc-4a2c-a035-506e211353b5","name":"Graph edge test C","text":"Layer normalization variants: Pre-LN vs Post-LN. Pre-LN improves training stability (Xiong 2020). Used in all modern LLMs. RMSNorm (Zhang 2019) removes mean centering for 7-64% speedup.","keywords":["layernorm","rmsnorm"],"about":[],"citation":[],"isPartOf":{"@type":"Dataset","name":"Forge Cascade Knowledge Graph","url":"https://forgecascade.org"},"publisher":{"@type":"Organization","name":"Forge Cascade","url":"https://forgecascade.org"}}