24/02/04: fine-tune your lown llama 2 model in a colab note book

Daily-Trend-Review

24/02/04: fine-tune your lown llama 2 model in a colab note book

hellcat 2024. 2. 4. 22:07

Fine-Tune Your Own Llama 2 Model in a Colab

ML Blog - Fine-Tune Your Own Llama 2 Model in a Colab Notebook

mlabonne.github.io

Decoding Strategies in Large Language Models

ML Blog - Decoding Strategies in Large Language Models

mlabonne.github.io

Introduction to Weight Quantization

ML Blog - Introduction to Weight Quantization

mlabonne.github.io

LLM Inference Series: 4. KV caching, a deeper look

LLM Inference Series: 4. KV caching, a deeper look

In this post, we will look at how big the KV cache, a common optimization for LLM inference, can grow and at common mitigation strategies.

medium.com

LLM Inference Series: 5. Dissecting model performance

LLM Inference Series: 5. Dissecting model performance

In this post, we look deeper into the different types of bottleneck that affect model latency and explain what arithmetic intensity is.

medium.com

How GPT models work: for data scientists and ML engineers

Bea Stollnitz - How GPT models work: for data scientists and ML engineers

Learn Azure ML and machine learning with Bea Stollnitz.

bea.stollnitz.com

The Transformer architecture of GPT models

Bea Stollnitz - The Transformer architecture of GPT models

Learn Azure ML and machine learning with Bea Stollnitz.

bea.stollnitz.com

Some intuitions about large language models

Some intuitions about large language models — Jason Wei

An open question these days is why large language models work so well. In this blog post I will discuss six basic intuitions about large language models. Many of them are inspired by manually examining data, which is an exercise that I’ve found helpful a

www.jasonwei.net

'Daily-Trend-Review' 카테고리의 다른 글

2024/02/12: Large Language Models - the hardware connection (0)	2024.02.12
24/02/06: Why GPT-3.5 is (mostly) cheaper than Llama2 (0)	2024.02.06
2024/01/27: Harmonizing Multi-GPUs (0)	2024.01.27
2024/01/26: Leading with open Models, frameworks, and systems (0)	2024.01.26
2024/01/20: 스터디 내용 정리 (0)	2024.01.20

현재글24/02/04: fine-tune your lown llama 2 model in a colab note book

AI, Quant 투자 공부

글쓰기 좋아하는 AI 엔지니어의 AI와 Quant 투자 스터디를 위한 공간

GPT, QLORA, llma, State of GPT, LLaMA-Adapter, ChatGPT, 강환국, etf, gpt-4, training, llm, jupyter notebook, 정채진프로, vscode, 거인의포트폴리오, transformer, Generative-AI, 퀀트투자, mdd, 삼프로tv,

Today :
Yesterday :

티스토리툴바