promptzip automatically injects prompt caching into every request. One line of code. No changes to your app. You only pay when you save.
# Before
import openai
client = openai.OpenAI(api_key="sk-...")
# After — one change, that's it
from promptzip import optimize
client = optimize(openai.OpenAI(api_key="pz-..."), proxy_url="https://api.promptzip.io")
# Everything else stays the same
response = client.chat.completions.create(
model="gpt-4o",
messages=[
{"role": "system", "content": system_prompt}, # ← auto-cached
{"role": "user", "content": user_message},
]
)
print(client.savings_report())
# {"cached_tokens": 23520, "saved_percent": 99.4, "target_reached": true}pip install promptzip, wrap your client. Takes 60 seconds.
client = optimize(openai.OpenAI(api_key="pz-..."))
Our proxy automatically adds cache_control to every system prompt. No Anthropic expertise needed.
// proxy injects cache_control // on every request ✓
Real-time dashboard shows tokens saved, money saved, cache hit rate.
99.4% cached $63/day saved
Claude Sonnet 4.6 · 10 requests · 2,365-token system prompt
Without promptzip
0%
cached tokens
$70.9/day
at 10k req/day
With promptzip
99.4%
cached tokens
$7.1/day
at 10k req/day
That's $1,890/month saved for a typical customer support bot.
You only pay when we save you money. No savings, no charge.
20%
of what we save you
No credit card required to start