Llemma: An Open Language Model For Mathematics

CommunityNews · 18 October 2023 23:23

Llemma: An Open Language Model For Mathematics.
ArXiv | Models | Data | Code | Blog | Sample Explorer
Today we release Llemma: 7 billion and 34 billion parameter language models for mathematics. The Llemma models were initialized with Code Llama weights, then trained on the Proof-Pile II, a 55 billion token dataset of mathematical and scientific documents. The resulting models show improved mathematical capabilities, and can be adapted to various tasks through prompting or additional fine-tuning.

Read in full here:

This thread was posted by one of our members via one of our news source trackers.