SSE & chunked, passed through.
Server-sent events or chunked transfer, forwarded to your caller as your target produces them — no waiting for the whole response.
Streaming
Point a streaming target at EchoRelay and we forward its SSE or chunked response to your caller, chunk by chunk — built for LLM token streaming.
Built for AI
Server-sent events or chunked transfer, forwarded to your caller as your target produces them — no waiting for the whole response.
Stream an LLM’s tokens or a long-running inference job straight to the caller — incremental, not buffered to the end.
Set up the streaming endpoint with a sentence to your agent — it speaks EchoRelay’s MCP surface and ships the config for you.
A streaming endpoint carries a single stream target — clear, predictable behaviour.
Streaming runs through the same durable relay as everything else — your caller gets the stream while we keep auth, validation, and rate limits at the edge. A stream is billed by the size of the response it streams back — heartbeats are free, and a cut-short stream only bills for what was delivered.
No credit card required.