Troubleshooting

Connection Issues

  • ✓ Verify API key and config ID

  • ✓ Check that config exists in dashboard

  • ✓ Ensure network allows WebSocket connections

No Audio Output

  • ✓ Verify text is sent as binary InteractionInput with payload type 0 (text), not 1 (audio)

  • ✓ Confirm you sent EndInteraction JSON after the text — the model waits for this signal

  • ✓ Check output format handling: PCM int16, 24 kHz mono (not 16 kHz — Oris Voice uses 24 kHz; persona video realtime APIs may differ)

Garbled or Distorted Audio

  • ✓ Ensure audio is decoded as 24,000 Hz sample rate (not 16,000 Hz)

  • ✓ Verify little-endian byte order for PCM samples

  • ✓ Check that you're reading the correct number of bytes per payload (from the payload size header)

Voice Clone Doesn't Sound Right

  • ✓ Use 5–15 seconds of clean reference audio (longer is not better)

  • ✓ WAV format preferred for clone reference

  • ✓ Avoid reference audio with background noise or music

Last updated

Was this helpful?