Large language model MPT-30B – Fast Developer World

MPT-30B is part of the Mosaic Pretrained Transformer (MPT) model family, which uses a transformer architecture optimized for efficient training and inference, and is trained from scratch on 1T tokens of English text and code. This model uses the MosaicML LLM codebase, and was pre-trained, fine-tuned, and inferenced on the MosaicML platform by MosaicML’s NLP team. MPT-30B Features: Commercially available Train on large amounts of data Ready to handle extremely long inputs due to ALiBi. Enables fast training and inference via llm-…

#Large #language #model #MPT30B

Leave a Comment Cancel Reply