On Thursday, Anthropic released Claude Opus 4 and Claude Sonnet 4, marking the company’s return to larger model releases after primarily focusing on mid-range Sonnet variants since June of last year.
Users running a quantized 7B model on a laptop expect 40+ tokens per second. A 30B MoE model on a high-end mobile device ...
Anthropic PBC today debuted its newest large language model, Claude Sonnet 4.5, and a toolkit for building artificial intelligence agents. The company describes the LLM as the world’s best coding ...
Goose acts as the agent that plans, iterates, and applies changes. Ollama is the local runtime that hosts the model. Qwen3-coder is the coding-focused LLM that generates results. If you've been ...
Obsidian is already great, but my local LLM makes it better ...
Use the vitals package with ellmer to evaluate and compare the accuracy of LLMs, including writing evals to test local models.