2024/01/05: Decoding Strategies in Large Language Models

Daily-Trend-Review

2024/01/05: Decoding Strategies in Large Language Models

hellcat 2024. 1. 5. 16:07

https://towardsdatascience.com/decoding-strategies-in-large-language-models-9733a8f70539

Decoding Strategies in Large Language Models

A Guide to Text Generation From Beam Search to Nucleus Sampling

towardsdatascience.com

Transformers Optimization: Part1 - KV Cache

https://ai.plainenglish.io/understanding-llama2-kv-cache-grouped-query-attention-rotary-embedding-and-more-c17e5f49a6d7?gi=d83469097275

Understanding Llama2: KV Cache, Grouped Query Attention, Rotary Embedding and More

What’s in LLama2: Grouped Query Attention, Rotary Embedding, KV Cache, Root Mean Square Normalization

ai.plainenglish.io

'Daily-Trend-Review' 카테고리의 다른 글

2024/01/20: 스터디 내용 정리 (0)	2024.01.20
2024/01/20: LLM Agents, DPO (0)	2024.01.20
2024/01/02: Transformer inference tricks (0)	2024.01.02
2023/12/25: Towards 100x Speedup: Full Stack Transformer Inference Optimization (0)	2023.12.25
2023/12/23: optimizing your llm in production (0)	2023.12.23

현재글2024/01/05: Decoding Strategies in Large Language Models

AI, Quant 투자 공부

글쓰기 좋아하는 AI 엔지니어의 AI와 Quant 투자 스터디를 위한 공간

QLORA, training, transformer, GPT, ChatGPT, LLaMA-Adapter, etf, vscode, 퀀트투자, State of GPT, mdd, llm, llma, 삼프로tv, Generative-AI, jupyter notebook, 정채진프로, 거인의포트폴리오, 강환국, gpt-4,

Today :
Yesterday :

일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

AI, Quant 투자 공부