Human Agent

Give your AI agent a face — a lifelike visual avatar that speaks, listens, and expresses emotion with ultra-low-latency synchronized animation.

Overview

The Human Agent is an end-to-end conversational AI application that combines speech-to-speech (STS) with a lifelike animated face powered by Ojin's avatar model ojin/oris-portrait. Instead of wiring up STT, LLM, TTS, and avatar services yourself, you create an agent in the Ojin dashboard, embed a single widget on your site, and your users get a live, ultra-low-latency conversation with a fully animated avatar.

Key Features

  • Lifelike visual avatar — your agent has a face with synchronized lip movements and natural expressions

  • End-to-end conversational AI — speech in, speech out, no pipeline assembly required

  • Ultra-low latency — real-time WebRTC transport for audio, video, and signaling

  • No pipeline assembly required — Ojin manages the full speech-to-speech stack (or bring your own provider)

  • Drop-in widget — one HTML tag to embed the agent in any web page

  • Dashboard configuration — create, configure, and monitor agents without writing code

Agent Modes

Ojin Agent (managed)

You configure the personality and appearance — system prompt, face, voice, and behaviour. Ojin handles everything else: the conversational pipeline, avatar rendering, and infrastructure. You never see or manage the underlying providers.

Third-Party Agent

You bring your own speech-to-speech provider — Hume, ElevenLabs Agents, or Ultravox. Ojin adds the visual avatar layer and runs the agent through Ojin infrastructure. You supply your provider API key and config ID in the dashboard; Ojin handles the rest.

How It Works

  1. Create an agent in the Ojin dashboard — pick a mode, configure it, go online

  2. Get your agent ID from the agent settings page

  3. Embed the widget on your site or call the Session API from your backend

  4. Your users interact via WebRTC — audio, video, and real-time avatar in a single connection

Use Cases

  • Customer Support — let customers talk to a lifelike agent instead of a chatbot

  • Sales — greet and qualify leads with a conversational avatar

  • Education — build interactive tutors with natural speech and expressions

  • Healthcare — create empathetic virtual health assistants

  • Reception — deploy a digital receptionist on your website or kiosk

Pricing

Agent sessions are metered based on usage. Visit ojin.ai/pricing for details on plans and per-session costs.

Quick Start

  1. Create an API key — set up authentication for the Ojin platform

  2. Create & configure your agent — set up appearance, voice, and behaviour

  3. Widget Integration — drop the agent into your web page

  4. Session API Reference — for custom integrations beyond the widget

Last updated

Was this helpful?