Daily-Trend-Review

24/02/25: OLMo

hellcat 2024. 2. 25. 09:56

OLMo: Accelerating the Science of Language Models

 

OLMo: Accelerating the Science of Language Models

Language models (LMs) have become ubiquitous in both NLP research and in commercial product offerings. As their commercial importance has surged, the most powerful models have become closed off, gated behind proprietary interfaces, with important details o

arxiv.org

 

Understanding, Using, and Finetuning Gemma

 

Understanding, Using, and Finetuning Gemma - a Lightning Studio by sebastian

Gemma is Google’s latest open-weight LLM. This Studio explains shows you how to use Gemma through Lit-GPT and explains some of the unique design choices of Gemma compared to other LLMs.

lightning.ai

Sebastian Raschka's 트윗

 

X의 Sebastian Raschka님(@rasbt)

Google's Gemma has been the topic of the week for both LLM researchers and users. My colleagues and I just ported the code to LitGPT, and we discovered some interesting surprises and model architecture details along the way: 1) Gemma uses a really large vo

twitter.com