2023/12/23: how to make LLMs go fast

Daily-Trend-Review

2023/12/23: how to make LLMs go fast

hellcat 2023. 12. 23. 07:36

How to make LLMs go fast

How to make LLMs go fast December 18, 2023 In my last post, we made a transformer by hand. There, we used the classic autoregressive sampler, along the lines of: This approach to inference is elegant and cuts to the heart of how LLMs work—they're autoreg

vgel.me

'Daily-Trend-Review' 카테고리의 다른 글

2023/12/23: optimizing your llm in production (0)	2023.12.23
2023/12/23: RAG 101 (0)	2023.12.23
2023/12/18: Mixtral 8x7B (1)	2023.12.18
2023/12/14: Prompt Cache: Modular Attention Reuse For Low-Latency Inference (1)	2023.12.14
2023/12/12: chip cloud 논문 (0)	2023.12.14

현재글2023/12/23: how to make LLMs go fast

AI, Quant 투자 공부

글쓰기 좋아하는 AI 엔지니어의 AI와 Quant 투자 스터디를 위한 공간

jupyter notebook, 퀀트투자, etf, training, llma, QLORA, 삼프로tv, 거인의포트폴리오, gpt-4, transformer, LLaMA-Adapter, mdd, llm, vscode, ChatGPT, 강환국, Generative-AI, 정채진프로, State of GPT, GPT,

Today :
Yesterday :

일	월	화	수	목	금	토
					1	2
3	4	5	6	7	8	9
10	11	12	13	14	15	16
17	18	19	20	21	22	23
24	25	26	27	28	29	30

AI, Quant 투자 공부