Troubleshooting
Last updated
Was this helpful?
✓ Verify API key and config ID
✓ Check that config exists in dashboard
✓ Ensure network allows WebSocket connections
✓ Verify text is sent as binary InteractionInput with payload type 0 (text), not 1 (audio)
✓ Confirm you sent EndInteraction JSON after the text — the model waits for this signal
✓ Check output format handling: PCM int16, 24 kHz mono (not 16 kHz — Oris Voice uses 24 kHz; persona video realtime APIs may differ)
✓ Ensure audio is decoded as 24,000 Hz sample rate (not 16,000 Hz)
✓ Verify little-endian byte order for PCM samples
✓ Check that you're reading the correct number of bytes per payload (from the payload size header)
✓ Use 5–15 seconds of clean reference audio (longer is not better)
✓ WAV format preferred for clone reference
✓ Avoid reference audio with background noise or music
Last updated
Was this helpful?
Was this helpful?