Human Agent
Give your AI agent a face — a lifelike visual avatar that speaks, listens, and expresses emotion with ultra-low-latency synchronized animation.
Overview
The Human Agent is an end-to-end conversational AI application that combines speech-to-speech (STS) with a lifelike animated face powered by Ojin's avatar model ojin/oris-portrait. Instead of wiring up STT, LLM, TTS, and avatar services yourself, you create an agent in the Ojin dashboard, embed a single widget on your site, and your users get a live, ultra-low-latency conversation with a fully animated avatar.
Key Features
Lifelike visual avatar — your agent has a face with synchronized lip movements and natural expressions
End-to-end conversational AI — speech in, speech out, no pipeline assembly required
Ultra-low latency — real-time WebRTC transport for audio, video, and signaling
No pipeline assembly required — Ojin manages the full speech-to-speech stack (or bring your own provider)
Drop-in widget — one HTML tag to embed the agent in any web page
Dashboard configuration — create, configure, and monitor agents without writing code
Agent Modes
Ojin Agent (managed)
You configure the personality and appearance — system prompt, face, voice, and behaviour. Ojin handles everything else: the conversational pipeline, avatar rendering, and infrastructure. You never see or manage the underlying providers.
Third-Party Agent
You bring your own speech-to-speech provider — Hume, ElevenLabs Agents, or Ultravox. Ojin adds the visual avatar layer and runs the agent through Ojin infrastructure. You supply your provider API key and config ID in the dashboard; Ojin handles the rest.
How It Works
Create an agent in the Ojin dashboard — pick a mode, configure it, go online
Get your agent ID from the agent settings page
Embed the widget on your site or call the Session API from your backend
Your users interact via WebRTC — audio, video, and real-time avatar in a single connection
Use Cases
Customer Support — let customers talk to a lifelike agent instead of a chatbot
Sales — greet and qualify leads with a conversational avatar
Education — build interactive tutors with natural speech and expressions
Healthcare — create empathetic virtual health assistants
Reception — deploy a digital receptionist on your website or kiosk
Pricing
Agent sessions are metered based on usage. Visit ojin.ai/pricing for details on plans and per-session costs.
Quick Start
Create an API key — set up authentication for the Ojin platform
Create & configure your agent — set up appearance, voice, and behaviour
Widget Integration — drop the agent into your web page
Session API Reference — for custom integrations beyond the widget
Last updated
Was this helpful?