Simon Willison on mistral

66 posts tagged “mistral”

Mistral AI release both openly licensed and API-hosted Language Models.

2023

Release llm-mistral 0.2 — LLM plugin providing access to Mistral models using the Mistral API

15th Dec 2023, 5:05 am · llm, mistral

Release llm-mistral 0.1 — LLM plugin providing access to Mistral models using the Mistral API

15th Dec 2023, 4:04 am · llm, mistral

Mixtral of experts (via) Mistral have firmly established themselves as the most exciting AI lab outside of OpenAI, arguably more exciting because much of their work is released under open licenses.

On December 8th they tweeted a link to a torrent, with no additional context (a neat marketing trick they’ve used in the past). The 87GB torrent contained a new model, Mixtral-8x7b-32kseqlen—a Mixture of Experts.

Three days later they published a full write-up, describing “Mixtral 8x7B, a high-quality sparse mixture of experts model (SMoE) with open weights”—licensed Apache 2.0.

They claim “Mixtral outperforms Llama 2 70B on most benchmarks with 6x faster inference”—and that it outperforms GPT-3.5 on most benchmarks too.

This isn’t even their current best model. The new Mistral API platform (currently on a waitlist) refers to Mixtral as “Mistral-small” (and their previous 7B model as “Mistral-tiny”—and also provides access to a currently closed model, “Mistral-medium”, which they claim to be competitive with GPT-4.

# 11th December 2023, 5:20 pm / ai, generative-ai, gpt-4, local-llms, llms, mistral, llm-release, gpt

llamafile is the new best way to run an LLM on your own computer

Mozilla’s innovation group and Justine Tunney just released llamafile, and I think it’s now the single best way to get started running Large Language Models (think your own local copy of ChatGPT) on your own computer.

[... 650 words]

8:54 pm / 29th November 2023 / mozilla, ai, generative-ai, cosmopolitan, llama, local-llms, llms, mistral, llamafile, justine-tunney, llama-cpp

MonadGPT (via) “What would have happened if ChatGPT was invented in the 17th century? MonadGPT is a possible answer.

MonadGPT is a finetune of Mistral-Hermes 2 on 11,000 early modern texts in English, French and Latin, mostly coming from EEBO and Gallica.

Like the original Mistral-Hermes, MonadGPT can be used in conversation mode. It will not only answer in an historical language and style but will use historical and dated references.”

# 27th November 2023, 4:01 am / ai, generative-ai, llms, mistral

The EU AI Act now proposes to regulate “foundational models”, i.e. the engine behind some AI applications. We cannot regulate an engine devoid of usage. We don’t regulate the C language because one can use it to develop malware. Instead, we ban malware and strengthen network systems (we regulate usage). Foundational language models provide a higher level of abstraction than the C language for programming computer systems; nothing in their behaviour justifies a change in the regulatory framework.

— Arthur Mensch, Mistral AI

# 16th November 2023, 11:29 am / politics, ai, llms, mistral

«« first « previous page 3 / 3

Simon Willison’s Weblog

66 posts tagged “mistral”

2023

llamafile is the new best way to run an LLM on your own computer