Daily-Trend-Review
2023/10/27: transformer-math
hellcat
2023. 10. 27. 18:01
Transformer Math 101
Transformer Math 101
We present basic math related to computation and memory usage for transformers
blog.eleuther.ai
MemGPT: Towards LLMs As Operating Systems
Understanding the Performance of Transformer Inference
Understanding the Performance of Transformer Inference
Abstract The state of the art results in natural language processing tasks have been obtained by scaling up transformer-based machine learning models, which can have more than a hundred billion parameters. Training and deploying these models can be difficu
dspace.mit.edu
Efficient Memory Management for Large Language Model Serving with PagedAttention