Daily-Trend-Review
2023/05/29: State of GPT, Voyager, LLaMA-Adapter 등
hellcat
2023. 5. 29. 06:28
1. AI Canon
source: https://a16z.com/2023/05/25/ai-canon/
2. State of GPT
source: https://build.microsoft.com/en-US/sessions/db3f4859-cd30-4445-a0cd-553c3304f8e2
3. VOYAGER: An Open-Ended Embodied Agent with Large Language Models
source: https://arxiv.org/pdf/2305.16291.pdf
4. LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention
source: https://arxiv.org/pdf/2303.16199.pdf
5. QLORA: Efficient Finetuning of Quantized LLMs
source: https://arxiv.org/pdf/2305.14314.pdf
6. Why we should train smaller LLMs on more tokens
source: https://www.harmdevries.com/post/model-size-vs-compute-overhead/
7. Scaling Data-Constrained Language Models
source: https://arxiv.org/pdf/2305.16264.pdf
8. Deploying Large NLP Models: Infrastructure Cost Optimization
source: https://neptune.ai/blog/nlp-models-infrastructure-cost-optimization