Benchmarks
Each model was tested on 8 cases — 4 English, 4 German — spanning short sentences, medium paragraphs, and email-length texts with intentional typos.
English
| Model | Exact Match (%) | Errors Fixed (%) | Time (ms) | Memory (MB) |
|---|---|---|---|---|
| OpenRouter gpt-4o-mini | 75% | 100% | 1,526 | 225 |
| GRMR V3 3B | 50% | 98% | 2,746 | 3,496 |
| GRMR V3 4B | 75% | 98% | 3,910 | 4,118 |
| Gemma 3 4B Instruct | 25% | 92% | 4,411 | 4,119 |
| Mistral 7B Instruct v0.3 | 50% | 92% | 6,517 | 7,666 |
| Harper | 50% | 90% | 100 | 187 |
German
| Model | Exact Match (%) | Errors Fixed (%) | Time (ms) | Memory (MB) |
|---|---|---|---|---|
| OpenRouter gpt-4o-mini | 75% | 100% | 1,501 | 226 |
| Mistral 7B Instruct v0.3 | 50% | 95% | 9,395 | 7,666 |
| GRMR V3 4B | 0% | 68% | 4,532 | 4,118 |
| GRMR V3 3B | 0% | 32% | 4,157 | 3,496 |
| Gemma 3 4B Instruct | 0% | 32% | 4,728 | 4,119 |
| Harper | 0% | 0% | 278 | 202 |
What this means
The models fall into three categories. Harper is a rule-based grammar checker — it matches words against a dictionary and applies fixes instantly, but it only knows English and struggles with context-dependent errors. Is a linter at the end. The GRMR models are small LLMs fine-tuned specifically for grammar correction — they understand context and fix more errors than Harper, but they were trained primarily on English data, so German accuracy is limited. The general-purpose models (Gemma, Mistral) are larger instruction-following LLMs that use a system prompt to correct text — they aren't specialized for grammar but their broader training data gives them better multilingual support.
OpenRouter's GPT-4o-mini fixes every single error across both languages because it is a much larger model running on powerful remote hardware. The tradeoff is that your text leaves your device and you need an API key.
For local English-only use, GRMR V3 4B (2.5 GB) is the recommended choice — it fixes 87% of errors and is fast. For multilingual use, Mistral 7B (4.7 GB) is the only local model that handles German well. For the best results with no hardware constraints, OpenRouter is unbeatable.