Sebastian Raschka: Using Local Coding Agents — Open-Weight Models as an Alternative to Claude Code and Codex Subscriptions
Main thesis
Local coding harnesses with open-weight models are now good enough to replace most day-to-day Claude Code / Codex workflow. Raschka shows the concrete stack and benchmarks.
Context
The essay arrives just as Anthropic retired Sonnet 4 and Opus 4 on the API and GitHub Copilot announced Opus 4.6 fast deprecation — several readers had asked for a recipe to mitigate vendor lock-in. In parallel with Doubleword's analysis ('the gap between open and closed is shrinking') and the Marble blog ('minimal downside to switching to open models'), Raschka argues the investment is worth it now.
Why it matters
For ML engineers and indie developers this is a concrete howto — not marketing. For tech leads it's a usable argument in the 'fund self-hosted alongside enterprise subscription' discussion. Raschka has a reputation for not fudging benchmarks.
Details / arguments
- Open-weight models in a local harness reach ~80–90% of Claude Code quality on routine tasks
- Cost: a one-time GPU investment pays back within months of regular use
- Main gap remains on long-horizon agentic tasks and multi-file refactor
- Stack: local LLM serving + harness + MCP-compatible tools