'llm' 태그의 글 목록

llm 3

2023/05/07: Single GPU로 LLM 추론하기, 효율적인 Transformers 등

1. High-throughput Generative Inference of Large Language Models with a Single GPU source: https://arxiv.org/pdf/2303.06865.pdf 2. Deploying Large NLP Models: Infrastructure Cost Optimization source: https://neptune.ai/blog/nlp-models-infrastructure-cost-optimization 3. What Are Transformer Models and How Do They Work? source: https://txt.cohere.com/what-are-transformer-models/ 4. Efficient Tran..

Daily-Trend-Review 2023.05.07

2023/05/01: Landscpae of AI/ML in 2023

1. Landscape of AI/ML in 2023 source: https://www.slideshare.net/HyunJoonJung/landscape-of-aiml-in-2023?fbclid=IwAR1UUpsfWOhbKap4R7FPtKf090hmQXrYsyXZ7jCSufqclNxjStsNlttjaTg 2. A brief history of LLMA models - AGI Sphere source: https://agi-sphere.com/llama-models/?fbclid=IwAR3TY21xWdFl9CiMU-ayTS2kXYE67xLw75b-zq1MJInFfhExpQUG96uBp5I 3. A fascinating tree of GPTs & LLMs reveals what’s been going o..

카테고리 없음 2023.05.01

2023/02/27: AI 반도체 기술, 효과적인 Large AI 모델 빌딩하기 등

1.대한민국 초거대 AI 시대를 열어가기 위한 AI 반도체 기술(2/10) 이동수 이사 (네이버 클라우드 AI) 발표 전체적으로 AI 추론은 메모리 문제가 커질 것으로 전망함 삼성(메모리) + NAVER AI 반도체 공동개발 중 → HyperCLOVA에 최적화된 AI 반도체 솔루션 개발 ChatGPT 서비스의 문제 Sam Altman Said "Average is probably single-digits cents per chat" Transformer는 MM이 전부임 경량화 알고리즘이 중요함 GPT 계열의 특성 입력 처리 (Computation-intensive) 출력 처리 (Memory-intensive) Why? GPT는 Autoregressive LM 이기 때문임 The End of SRAM Sc..

Daily-Trend-Review 2023.02.27

AI, Quant 투자 공부

글쓰기 좋아하는 AI 엔지니어의 AI와 Quant 투자 스터디를 위한 공간

transformer, 강환국, QLORA, ChatGPT, 삼프로tv, etf, Generative-AI, State of GPT, llm, GPT, vscode, 거인의포트폴리오, training, 퀀트투자, mdd, llma, jupyter notebook, LLaMA-Adapter, gpt-4, 정채진프로,

Today :
Yesterday :

일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

AI, Quant 투자 공부

llm 3

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역