See what your prompts cost across GPT, Claude, and Gemini — then cut it. Free calculator below, plus an MCP server that does it automatically inside Claude & Cursor.
mcp-token-optimizer · works with any MCP client · MIT
Add this to your MCP config (claude_desktop_config.json or .cursor/mcp.json):
{
"mcpServers": {
"token-optimizer": {
"command": "npx",
"args": ["-y", "mcp-token-optimizer"]
}
}
}
Then ask: "slim this system prompt and show what I'd save at 50k calls a month" or "which model is cheapest for this prompt?"
Exact token count + cost across models.
Per-call + monthly/yearly spend.
Compress prompts, measure $ saved.
Find the cheapest capable model.