24/02/25: OLMo
OLMo: Accelerating the Science of Language Models
OLMo: Accelerating the Science of Language Models
Language models (LMs) have become ubiquitous in both NLP research and in commercial product offerings. As their commercial importance has surged, the most powerful models have become closed off, gated behind proprietary interfaces, with important details o
arxiv.org
Understanding, Using, and Finetuning Gemma
Understanding, Using, and Finetuning Gemma - a Lightning Studio by sebastian
Gemma is Google’s latest open-weight LLM. This Studio explains shows you how to use Gemma through Lit-GPT and explains some of the unique design choices of Gemma compared to other LLMs.
lightning.ai
X의 Sebastian Raschka님(@rasbt)
Google's Gemma has been the topic of the week for both LLM researchers and users. My colleagues and I just ported the code to LitGPT, and we discovered some interesting surprises and model architecture details along the way: 1) Gemma uses a really large vo
twitter.com