2023/12/11: Reproducible Performance Metrics for LLM inference

Daily-Trend-Review

2023/12/11: Reproducible Performance Metrics for LLM inference

hellcat 2023. 12. 11. 08:36

https://www.anyscale.com/blog/reproducible-performance-metrics-for-llm-inference

Reproducible Performance Metrics for LLM inference

Anyscale is releasing LLMPerf for benchmarking LLMs on current LLM offerings. See benchmarking results for Anyscale Endpoints vs Fireworks.ai.

www.anyscale.com

LLM의 정량적인 성능 지표

분당 완료된 요청 (requests/sec)
TTFT(Time To First Token)
ITL(Inter-Token Latency)
End-to-End Latency
Cost/request

전용 인스턴스에 대한 추가 지표

구성
- 8 replicas with 1 GPU --> 가장 낮은 TTFT (Data Parallelism only)
- 1 replicas with 8 GPU --> 가장 높은 throughput (8x the memory BW)
Output token throughput

벤치마크 결과

Completed queries/minute vs concurrent requests
Time to First Token(TTFT) vs concurrent requests
Inter-Token Latency(ITL) vs concurrent requests
End-to-End Latency vs concurrent requests
Cost per thousand requests

'Daily-Trend-Review' 카테고리의 다른 글

2023/12/11: LLM and Transformers Series (0)	2023.12.11
2023/12/11: LLM Visualization (0)	2023.12.11
2023/12/10: 아이패드에서 colab 사용법 (0)	2023.12.10
2023/12/10: optimizing your llm in production (0)	2023.12.10
2023/12/08: LLM transformer inference guide (1)	2023.12.08

현재글2023/12/11: Reproducible Performance Metrics for LLM inference

AI, Quant 투자 공부

글쓰기 좋아하는 AI 엔지니어의 AI와 Quant 투자 스터디를 위한 공간

거인의포트폴리오, training, GPT, transformer, 퀀트투자, State of GPT, ChatGPT, jupyter notebook, LLaMA-Adapter, QLORA, vscode, 정채진프로, 강환국, 삼프로tv, gpt-4, llma, etf, Generative-AI, mdd, llm,

Today :
Yesterday :

일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

AI, Quant 투자 공부

2023/12/11: Reproducible Performance Metrics for LLM inference

LLM의 정량적인 성능 지표

전용 인스턴스에 대한 추가 지표

벤치마크 결과

'Daily-Trend-Review' 카테고리의 다른 글

'Daily-Trend-Review'의 다른글

티스토리툴바

2023/12/11: Reproducible Performance Metrics for LLM inference

LLM의 정량적인 성능 지표

전용 인스턴스에 대한 추가 지표

벤치마크 결과

'Daily-Trend-Review' 카테고리의 다른 글

'Daily-Trend-Review'의 다른글

관련글

티스토리툴바