'Daily-Trend-Review' 카테고리의 다른 글
2023/12/01: Accelerating Generative AI with PyTorch II: GPT, Fast (0) | 2023.12.01 |
---|---|
PagedAttention + vLLM (0) | 2023.11.30 |
2023/11/13: S-Lora 등 (0) | 2023.11.13 |
MBU(Model Bandwidth Utilization) (0) | 2023.11.11 |
2023/11/11: Sliding Window Attention(SWA) 메커니즘 (0) | 2023.11.11 |