Remote MCP endpoint

Let agents request cache estimates and rollout policy.

The endpoint is designed for paid token access. It exposes structured tools that help an agent reason about KV cache economics, rollout safety, and long-context reuse.

Open server-card

Hosted checkout activates when a payment URL is configured.

estimate_kv_cache_savings

Estimate reused prompt tokens, avoided prefill time, and rough GPU spend saved.

plan_lmcache_rollout

Create a rollout checklist covering vLLM, SGLang, storage tiers, cache invalidation, and observability.

generate_mcp_cache_policy

Produce cache rules that an agent workflow can read before routing long-context requests.