Understanding Llama2: KV Cache, Grouped Query Attention, Rotary Embedding and MoreGrouped Query Attention, Rotary Embedding, KV Cache, Root Mean Square NormalizationOctober 17, 2023