2023/11/03: Efficient Deep Learning Computing 강의

카테고리 없음

2023/11/03: Efficient Deep Learning Computing 강의

hellcat 2023. 11. 3. 13:22

TinyML and Efficient Deep Learning Computing

MIT 6.5940 Fall 2023 TinyML and Efficient Deep Learning Computing

Grading The class requirements include five labs, and one final project. This is a PhD level course, and by the end of this class you should have a good understanding of efficient deep learning techniques, and be able to deploy large language models (LLMs)

hanlab.mit.edu

lec12.zip

19.95MB

안정적인 LLM 서비스를 위한 서빙 최적화 기법

새로운 루다를 지탱하는 모델 서빙 아키텍처 — 3편: 안정적인 LLM 서비스를 위한 서빙 최적화 기

LLM 서빙을 위한 다양한 최적화 기법과 그 효과를 검증하기 위한 부하 테스트 방법론

tech.scatterlab.co.kr

최대 24배 빠른 vLLM의 비밀 파헤치기

최대 24배의 성능을 보인 vLLM, 코드 레벨까지 분석해보자!

tech.scatterlab.co.kr

The Secret Sauce of LLaMA🦙 : A Deep Dive!

Understanding the ins and outs of Meta's LLaMa(Open and Efficient Foundation Language Models) from scratch.

R4j4n.github.io

Decoding Strategies in Language Models

Exploring and implementing text decoding strategies in PyTorch

R4j4n.github.io

Transformers Optimization: Part 1 - KV Cache

Understanding KV Cache, its working mechanism and comparison with vanilla architecture.

R4j4n.github.io

현재글2023/11/03: Efficient Deep Learning Computing 강의

AI, Quant 투자 공부

글쓰기 좋아하는 AI 엔지니어의 AI와 Quant 투자 스터디를 위한 공간

LLaMA-Adapter, transformer, GPT, llm, 퀀트투자, mdd, vscode, 거인의포트폴리오, training, ChatGPT, gpt-4, 삼프로tv, etf, 정채진프로, llma, 강환국, Generative-AI, jupyter notebook, QLORA, State of GPT,

Today :
Yesterday :

일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

AI, Quant 투자 공부