Efficient and Economic Large Language Model Inference with Attention Offloading
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond
How Good Are the Latest Open LLMs? And Is DPO Better Than PPO?
'Daily-Trend-Review' 카테고리의 다른 글
24/05/29: MS build 2024 (0) | 2024.05.29 |
---|---|
24/05/10: 1.58 bits, FrugalGPT (0) | 2024.05.10 |
24/04/16: Are All Large Language Models Really in 1.58 Bits? (0) | 2024.04.16 |
24/04/13: LLM cost vs. Performance (0) | 2024.04.13 |
24/03/31: Transformer math 101 (0) | 2024.03.31 |