Inference Gateway · Six Modalities · 50+ Models
PHAROS
The inference layer for AI agents that don't sleep.
One parameter · three lanes
Tune cost against quality, per request.
No other gateway lets you pivot model class on a single key. Set quality per request. Get the exact tradeoff for that exact call.
Cheapest
Open-weights backends. Draft speed, bulk jobs, high-volume pipelines where cost dominates quality.
Stable Diffusion XL · Flux Schnell
~1.4s p50
quality: "cheapest"Balanced
The sensible default. Quality you can ship, cost you can justify. Covers most production traffic.
Flux Pro · Ideogram v2
~3.2s p50
quality: "balanced"Best
Frontier models for customer-facing output. Hero shots, covers, anything that represents your brand.
GPT-Image-1 · Flux Ultra
~5.8s p50
quality: "best"Six modalities · one base URL
Everything you call OpenAI for. Plus everything you don't.
All routes are OpenAI-compatible. Drop-in replacement, same SDK, same response shapes. Video is async (custom 202 + poll) because nobody returns a 30-second clip in under 30 seconds.
Drop-in replacement
Two lines change. Everything else stays.
Point your OpenAI client at api.mypharos.ai/v1. Swap the key. Done.
api.mypharos.ai/v1# Install the SDK (placeholder — SDK publish pending) npm install pharos-sdk # Or use the OpenAI SDK directly — it's fully compatible npm install openai # Set your key export PHAROS_API_KEY="pharos_sk_live_…" # Verify the endpoint is reachable curl https://api.mypharos.ai/v1/models \ -H "Authorization: Bearer $PHAROS_API_KEY"
The quality parameter is the only Pharos-specific extension. Everything else is verbatim OpenAI.
Routes through
Pricing · usage-based
No seat fees. No tier traps. Flat markup.
You pay 30% over what we pay upstream. Routing, retries, one consolidated invoice.
Get on the list
$1 of free credit.
Six modalities.
One key.
Drop your email. We'll send your key the moment Pharos opens. No card required. Pay only when you scale past a dollar.
No spam · unsubscribe in one click