Research Papers in February 2024

Improving LoRA: Implementing Weight-D... Tips for LLM Pretraining and Evaluati...

Research Papers in February 2024

Once again, this has been an exciting month in AI research. This month, I'm covering two new openly available LLMs, insights into small finetuned LLMs, and a new parameter-efficient LLM finetuning technique. The two LLMs mentioned above stand out for several reasons. One LLM (OLMo) is completely open source, meaning that everything from the training code to the dataset to the log files is openly shared. The other LLM (Gemma) also comes with openly available weights but achieves state-of-the-art performance on several benchmarks and outperforms popular LLMs of similar size, such as Llama 2 7B and Mistral 7B, by a large margin.

View more on Sebastian Raschka's website »

Like • 0 comments • flag

Published on March 02, 2024 22:00

No comments have been added yet.

Sebastian Raschka's Blog

Sebastian Raschka's profile
149 followers

Sebastian Raschka isn't a Goodreads Author (yet), but they do have a blog, so here are some recent posts imported from their feed.

Follow Sebastian Raschka's blog with rss.

delete edit this post