2023/12/23: how to make LLMs go fast

Daily-Trend-Review

2023/12/23: how to make LLMs go fast

hellcat 2023. 12. 23. 07:36

How to make LLMs go fast

How to make LLMs go fast December 18, 2023 In my last post, we made a transformer by hand. There, we used the classic autoregressive sampler, along the lines of: This approach to inference is elegant and cuts to the heart of how LLMs work—they're autoreg

vgel.me

'Daily-Trend-Review' 카테고리의 다른 글

2023/12/23: optimizing your llm in production (0)	2023.12.23
2023/12/23: RAG 101 (0)	2023.12.23
2023/12/18: Mixtral 8x7B (1)	2023.12.18
2023/12/14: Prompt Cache: Modular Attention Reuse For Low-Latency Inference (1)	2023.12.14
2023/12/12: chip cloud 논문 (0)	2023.12.14

현재글2023/12/23: how to make LLMs go fast

AI, Quant 투자 공부

글쓰기 좋아하는 AI 엔지니어의 AI와 Quant 투자 스터디를 위한 공간

거인의포트폴리오, LLaMA-Adapter, mdd, 정채진프로, Generative-AI, training, llma, ChatGPT, 삼프로tv, GPT, jupyter notebook, 퀀트투자, vscode, llm, transformer, QLORA, 강환국, etf, gpt-4, State of GPT,

Today :
Yesterday :

일	월	화	수	목	금	토
	1	2	3	4	5	6
7	8	9	10	11	12	13
14	15	16	17	18	19	20
21	22	23	24	25	26	27
28	29	30	31

AI, Quant 투자 공부