The cost intelligence + agent credibility layer for AI agents.
@computegauge/mcp
The cost intelligence + agent credibility layer for AI agents. Install once, every session is cost-aware and credibility-building.
npm install -g @computegauge/mcp
Why This Exists
AI agents are powerful but expensive. A single Claude Code session can cost $5-50+. A multi-agent workflow can burn through $100 in minutes. And nobody tells the agent — or the human — until the bill arrives.
ComputeGauge MCP makes agents cost-aware AND gives them a credibility score. Any MCP-compatible agent (Claude, Cursor, Windsurf, custom agents) gets:
- Cost Intelligence — Know which model is optimal for every task, track spend in real-time
- Agent Credibility — Build a reputation score (0-1000) by making smart decisions. Compete on a leaderboard.
- Local→Cloud Routing — Detect when local inference isn't good enough, route to cloud, earn credibility for smart routing
The result: agents that spend 40-70% less, build visible credibility, and know when to route to cloud.
Setup — 30 Seconds
Claude Desktop / Claude Code
Add to ~/.config/claude/claude_desktop_config.json:
{
"mcpServers": {
"computegauge": {
"command": "npx",
"args": ["-y", "@computegauge/mcp"]
}
}
}
That's it. Restart Claude. Every conversation now has cost intelligence + credibility tracking.
With Provider API Keys (Enhanced)
{
"mcpServers": {
"computegauge": {
"command": "npx",
"args": ["-y", "@computegauge/mcp"],
"env": {
"ANTHROPIC_API_KEY": "sk-ant-...",
"OPENAI_API_KEY": "sk-...",
"COMPUTEGAUGE_BUDGET_TOTAL": "50"
}
}
}
}
With Local Inference (Ollama, vLLM, etc.)
{
"mcpServers": {
"computegauge": {
"command": "npx",
"args": ["-y", "@computegauge/mcp"],
"env": {
"OLLAMA_HOST": "http://localhost:11434",
"OLLAMA_MODELS": "llama3.3:70b,qwen2.5:7b,deepseek-r1:14b",
"ANTHROPIC_API_KEY": "sk-ant-...",
"COMPUTEGAUGE_BUDGET_TOTAL": "50"
}
}
}
}
Cursor
Add to Cursor MCP settings:
{
"computegauge": {
"command": "npx",
"args": ["-y", "@computegauge/mcp"]
}
}
Tools Reference
Agent-Native Tools (use automatically every session)
| Tool | When to Call | What It Does | Credibility |
|---|---|---|---|
pick_model |
Before any API request | Returns the optimal model for a task | +8 Routing Intelligence |
log_request |
After any API request | Logs the request cost | +3 Honest Reporting |
session_cost |
Every 5-10 requests | Shows cumulative cost and budget | — |
rate_recommendation |
After completing a task | Rate how well the model performed | +5 Quality Contribution |
model_ratings |
When curious about quality | View model quality leaderboard | — |
improvement_cycle |
At session end | Run continuous improvement engine | +15 Quality Contribution |
integrity_report |
For transparency | View rating acceptance/rejection stats | — |
Credibility Tools (the reputation protocol)
| Tool | When to Call | What It Does | Credibility |
|---|---|---|---|
credibility_profile |
Anytime | View your 0-1000 credibility score, tier, badges | — |
credibility_leaderboard |
To compete | See how you rank vs other agents | — |
route_to_cloud |
After local→cloud routing | Report smart routing decision | +70 Cloud Routing |
assess_routing |
Before choosing local vs cloud | Should this task stay local? | — |
cluster_status |
To check local capabilities | View local endpoints, models, hardware | — |
Intelligence Tools (for user questions)
| Tool | Description |
|---|---|
get_spend_summary |
User's total AI spend across all providers |
get_budget_status |
Budget utilization and alerts |
get_model_pricing |
Current pricing for any model |
get_cost_comparison |
Compare costs for specific workloads |
suggest_savings |
Actionable cost optimization recommendations |
get_usage_trend |
Spend trends and anomaly detection |
Resources
| Resource | URI | Description |
|---|---|---|
| Config | computegauge://config |
Current server configuration |
| Session | computegauge://session |
Real-time session cost data |
| Ratings | computegauge://ratings |
Model quality leaderboard |
| Credibility | computegauge://credibility |
Agent credibility profile + leaderboard |
| Cluster | computegauge://cluster |
Local inference cluster status |
| Quickstart | computegauge://quickstart |
Agent onboarding guide |
Prompts
| Prompt | Description |
|---|---|
cost_aware_system |
System prompt that makes any agent cost-aware + credibility-building |
daily_cost_report |
Generate a quick daily cost report |
optimize_workflow |
Analyze and optimize a described AI workflow |
Agen
Tools (18)
pick_modelReturns the optimal model for a tasklog_requestLogs the request costsession_costShows cumulative cost and budgetrate_recommendationRate how well the model performedmodel_ratingsView model quality leaderboardimprovement_cycleRun continuous improvement engineintegrity_reportView rating acceptance/rejection statscredibility_profileView your 0-1000 credibility score, tier, and badgescredibility_leaderboardSee how you rank vs other agentsroute_to_cloudReport smart routing decisionassess_routingDetermine if a task should stay localcluster_statusView local endpoints, models, and hardwareget_spend_summaryUser's total AI spend across all providersget_budget_statusBudget utilization and alertsget_model_pricingCurrent pricing for any modelget_cost_comparisonCompare costs for specific workloadssuggest_savingsActionable cost optimization recommendationsget_usage_trendSpend trends and anomaly detectionEnvironment Variables
ANTHROPIC_API_KEYAPI key for Anthropic servicesOPENAI_API_KEYAPI key for OpenAI servicesCOMPUTEGAUGE_BUDGET_TOTALTotal budget limit for cost trackingOLLAMA_HOSTURL for local Ollama instanceOLLAMA_MODELSList of available local modelsConfiguration
{"mcpServers": {"computegauge": {"command": "npx", "args": ["-y", "@computegauge/mcp"]}}}