Aywa RuntimeDocumentation
Website Create account

Providers

Bring your own model, speech, telephony, and data plane.

Aywa Runtime is designed to route between providers while keeping credentials in the deployed runtime environment and preserving a single call/session contract.

Default runtime profile

Assistant creation can apply Aywa's production-oriented defaults while preserving explicit values imported or provided by the operator.

LLMOpenAI gpt-4.1 for main reasoning; gpt-4.1-mini for warm-transfer assistant routes.
STTDeepgram nova-3 for streaming partial and final transcript events.
TTSElevenLabs eleven_turbo_v2_5, Cartesia sonic-3, and OpenAI gpt-4o-mini-tts paths.
Knowledge toolsGoogle Gemini 2.5 Flash Lite profile for query-style knowledge workflows.
TelephonyBYO SIP trunk credentials and BYO phone number resources.

LLM providers

The runtime accepts native and OpenAI-compatible model routes. Configure only what the deployment actually uses.

OpenAI

OPENAI_API_KEY, OPENAI_BASE_URL

Azure OpenAI

AZURE_OPENAI_API_KEY, AZURE_OPENAI_BASE_URL

Anthropic

ANTHROPIC_API_KEY, ANTHROPIC_BASE_URL

Google Gemini

GOOGLE_API_KEY, GOOGLE_BASE_URL

OpenAI-compatible

Groq, Mistral, Together, OpenRouter, DeepInfra, Perplexity, Cerebras, xAI, Anyscale, Inflection, or custom routes.

Streaming

LLM streaming is enabled by default so first-token and first-audio latency can be optimized.

Speech providers

Speech-to-text

Deepgram, AssemblyAI, Gladia, Speechmatics, and Soniox configuration hooks exist in the runtime config.

Text-to-speech

ElevenLabs, Cartesia, OpenAI, Deepgram Aura, PlayHT, Rime, LMNT, Azure Speech, Inworld, Smallest, Neuphonic, Hume, and MiniMax hooks exist in config.

Speech env
VOICE_PIPELINE_ENABLED=true
VOICE_PIPELINE_STT_ENABLED=true
VOICE_PIPELINE_TTS_ENABLED=true
DEEPGRAM_MODEL=nova-3
DEEPGRAM_ENDPOINTING_MS=250
ELEVENLABS_API_KEY=...
CARTESIA_API_KEY=...

Telephony and web media

Telephony is intentionally BYO. The runtime owns the voice session above the edge, while FreeSWITCH, SIP trunks, or WebRTC adapters handle transport.

BYO SIPSIP REGISTER/INVITE, inbound SIP edge, in-dialog controls, DTMF, RTP ingress, and RTP playout.
FreeSWITCHInbound hooks, lifecycle event hooks, transfer bridge, and lab originate adapter.
Twilio / Telnyx / VonageProvider-facing webhook routes exist for telephony adapter coverage and compatibility work.
WebSocketBidirectional web call transport for low-level browser or service audio clients.
WebRTCLiveKit and Daily adapter paths with bridge boundary for assistant audio and microphone ingress.

Data plane providers

Postgres

Documents, resources, logs, credentials, webhook jobs, and backup metadata.

Redis / Valkey

Active-call leases and job coordination for multi-node deployments.

S3 / R2 / MinIO

Recordings, stereo recordings, PCAP files, and durable artifact access.

ClickHouse

Append-only event mirror for p95/p99 latency, cost dashboards, and long-retention analytics.

Cost tracking

Calls can expose cost, costBreakdown, and costs[]. Configure the rate card to audit provider spend without adding an Aywa platform markup line to call minutes.

Rate card
RUNTIME_COST_PLATFORM_PER_MINUTE_USD=0
RUNTIME_COST_RATE_CARD_JSON='{
  "stt": {"deepgram:nova-3": {"perMinute": 0}},
  "llm": {"openai:gpt-4.1": {"promptPer1M": 0, "completionPer1M": 0}},
  "tts": {"11labs:eleven_turbo_v2_5": {"per1KCharacters": 0}}
}'