Default runtime profile
Assistant creation can apply Aywa's production-oriented defaults while preserving explicit values imported or provided by the operator.
LLMOpenAI gpt-4.1 for main reasoning; gpt-4.1-mini for warm-transfer assistant routes.STTDeepgram nova-3 for streaming partial and final transcript events.TTSElevenLabs eleven_turbo_v2_5, Cartesia sonic-3, and OpenAI gpt-4o-mini-tts paths.Knowledge toolsGoogle Gemini 2.5 Flash Lite profile for query-style knowledge workflows.TelephonyBYO SIP trunk credentials and BYO phone number resources.LLM providers
The runtime accepts native and OpenAI-compatible model routes. Configure only what the deployment actually uses.
OpenAI
OPENAI_API_KEY, OPENAI_BASE_URL
Azure OpenAI
AZURE_OPENAI_API_KEY, AZURE_OPENAI_BASE_URL
Anthropic
ANTHROPIC_API_KEY, ANTHROPIC_BASE_URL
Google Gemini
GOOGLE_API_KEY, GOOGLE_BASE_URL
OpenAI-compatible
Groq, Mistral, Together, OpenRouter, DeepInfra, Perplexity, Cerebras, xAI, Anyscale, Inflection, or custom routes.
Streaming
LLM streaming is enabled by default so first-token and first-audio latency can be optimized.
Speech providers
Speech-to-text
Deepgram, AssemblyAI, Gladia, Speechmatics, and Soniox configuration hooks exist in the runtime config.
Text-to-speech
ElevenLabs, Cartesia, OpenAI, Deepgram Aura, PlayHT, Rime, LMNT, Azure Speech, Inworld, Smallest, Neuphonic, Hume, and MiniMax hooks exist in config.
VOICE_PIPELINE_ENABLED=true
VOICE_PIPELINE_STT_ENABLED=true
VOICE_PIPELINE_TTS_ENABLED=true
DEEPGRAM_MODEL=nova-3
DEEPGRAM_ENDPOINTING_MS=250
ELEVENLABS_API_KEY=...
CARTESIA_API_KEY=...
Telephony and web media
Telephony is intentionally BYO. The runtime owns the voice session above the edge, while FreeSWITCH, SIP trunks, or WebRTC adapters handle transport.
BYO SIPSIP REGISTER/INVITE, inbound SIP edge, in-dialog controls, DTMF, RTP ingress, and RTP playout.FreeSWITCHInbound hooks, lifecycle event hooks, transfer bridge, and lab originate adapter.Twilio / Telnyx / VonageProvider-facing webhook routes exist for telephony adapter coverage and compatibility work.WebSocketBidirectional web call transport for low-level browser or service audio clients.WebRTCLiveKit and Daily adapter paths with bridge boundary for assistant audio and microphone ingress.Data plane providers
Postgres
Documents, resources, logs, credentials, webhook jobs, and backup metadata.
Redis / Valkey
Active-call leases and job coordination for multi-node deployments.
S3 / R2 / MinIO
Recordings, stereo recordings, PCAP files, and durable artifact access.
ClickHouse
Append-only event mirror for p95/p99 latency, cost dashboards, and long-retention analytics.
Cost tracking
Calls can expose cost, costBreakdown, and costs[]. Configure
the rate card to audit provider spend without adding an Aywa platform markup line to call minutes.
RUNTIME_COST_PLATFORM_PER_MINUTE_USD=0
RUNTIME_COST_RATE_CARD_JSON='{
"stt": {"deepgram:nova-3": {"perMinute": 0}},
"llm": {"openai:gpt-4.1": {"promptPer1M": 0, "completionPer1M": 0}},
"tts": {"11labs:eleven_turbo_v2_5": {"per1KCharacters": 0}}
}'