Releasing Cohere North Mini Code
Cohere officially released North Mini Code with Hugging Face weights, OpenCode access, and vLLM deployment notes.
Cohere’s Jay Alammar announced the official release of North Mini Code after early community feedback from r/LocalLLaMA. Weights are available on Hugging Face, including an fp8 version, and the model can be tried for free through OpenCode. For vLLM deployment, Cohere recommends using vLLM main for now and installing cohere_melody for accurate response parsing, while noting community requests for quantization and llama.cpp support.
Cohere team member Jay Alammar posted on r/LocalLLaMA to announce the official release of North Mini Code. This release builds on early feedback the community had provided for a previously unreleased version. Cohere said it received a substantial amount of user testing feedback over the weekend, so the post not only announces the official launch but also answers earlier questions from the community and provides more details about the model itself and deployment options.
Free shows the 3-line summary; Pro unlocks the full deep summary (~300 words) so you never have to click through.
See Pro plans →Want the original English / full article?
Read on r/LocalLLaMA top day →Related
Summaries are AI-generated; the original article is authoritative.