Daily-Trend-Review

2024/01/05: Decoding Strategies in Large Language Models

hellcat 2024. 1. 5. 16:07

https://towardsdatascience.com/decoding-strategies-in-large-language-models-9733a8f70539 

 

Decoding Strategies in Large Language Models

A Guide to Text Generation From Beam Search to Nucleus Sampling

towardsdatascience.com

Transformers Optimization: Part1 - KV Cache

 

https://ai.plainenglish.io/understanding-llama2-kv-cache-grouped-query-attention-rotary-embedding-and-more-c17e5f49a6d7?gi=d83469097275

 

Understanding Llama2: KV Cache, Grouped Query Attention, Rotary Embedding and More

What’s in LLama2: Grouped Query Attention, Rotary Embedding, KV Cache, Root Mean Square Normalization

ai.plainenglish.io