2024/01/05: Decoding Strategies in Large Language Models

Daily-Trend-Review

2024/01/05: Decoding Strategies in Large Language Models

hellcat 2024. 1. 5. 16:07

https://towardsdatascience.com/decoding-strategies-in-large-language-models-9733a8f70539

Decoding Strategies in Large Language Models

A Guide to Text Generation From Beam Search to Nucleus Sampling

towardsdatascience.com

Transformers Optimization: Part1 - KV Cache

https://ai.plainenglish.io/understanding-llama2-kv-cache-grouped-query-attention-rotary-embedding-and-more-c17e5f49a6d7?gi=d83469097275

Understanding Llama2: KV Cache, Grouped Query Attention, Rotary Embedding and More

What’s in LLama2: Grouped Query Attention, Rotary Embedding, KV Cache, Root Mean Square Normalization

ai.plainenglish.io

'Daily-Trend-Review' 카테고리의 다른 글

2024/01/20: 스터디 내용 정리 (0)	2024.01.20
2024/01/20: LLM Agents, DPO (0)	2024.01.20
2024/01/02: Transformer inference tricks (0)	2024.01.02
2023/12/25: Towards 100x Speedup: Full Stack Transformer Inference Optimization (0)	2023.12.25
2023/12/23: optimizing your llm in production (0)	2023.12.23

현재글2024/01/05: Decoding Strategies in Large Language Models

AI, Quant 투자 공부

글쓰기 좋아하는 AI 엔지니어의 AI와 Quant 투자 스터디를 위한 공간

etf, 거인의포트폴리오, 정채진프로, 삼프로tv, LLaMA-Adapter, gpt-4, 강환국, ChatGPT, transformer, llma, State of GPT, training, vscode, 퀀트투자, mdd, llm, GPT, QLORA, jupyter notebook, Generative-AI,

Today :
Yesterday :

일	월	화	수	목	금	토
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

AI, Quant 투자 공부