.hvia avatar on a GPU, a LiveKit Agent running a full voice AI pipeline (STT → LLM → TTS), and a connected test client where you can speak to your avatar in real time.
Prerequisites
Confirm you have the following before you begin:
- Docker & Docker Compose installed on your server
- NVIDIA GPU (CUDA 12, min 6 GB VRAM) with the NVIDIA Container Toolkit installed
- An Avaluma license key and at least one
.hviaavatar file - A LiveKit account (Cloud or self-hosted) with an API key and secret
Deploy the Avatar Server
Clone or download the Open Start the server:The Avatar Server is now available at http://localhost:8080. For production, add the optional Caddy reverse proxy included in
avatar-server example directory, then place your .hvia avatar files where the container can find them:avatar-server/docker-compose.yaml and set a secure password for the utility API:avatar-server/reverse_proxy/ to terminate TLS automatically.Configure the LiveKit Agent
Clone or download the Open Next, open The You do not need to modify this block — it reads the values you set in
livekit-agent example directory. Copy the environment template:.env.local and fill in your credentials:If you are using Avaluma’s managed Avatar Server, set
AVATAR_SERVER_URL=https://api.avaluma.ai. If you deployed your own server in Step 1, use its public URL.agents/1-agent-with-livekit-inference/agent-1.py and set avatar_id to the filename of your .hvia file without the .hvia extension:AvatarSession block in that file wires the voice pipeline to the Avatar Server:.env.local and avatar_id automatically.Start the Agent
Run only the conversational agent (To start both example agents at once, omit the service name:
agent-1):Test the Connection
With all three services running — Avatar Server, LiveKit server, and the agent — connect a client to your LiveKit room.Your avatar should appear in the video track and respond to your voice in real time.
- Avaluma Test Client
- LiveKit Agent Playground
Open the Avaluma Test Client in your browser. It works with any LiveKit setup — self-hosted or cloud. Enter your LiveKit server URL and credentials, then click Connect to start speaking to your avatar.
Next Steps
Architecture
Learn how the Avatar Server and LiveKit Agent interact under the hood.
Avatar Server
Configure GPU resources, run multiple avatars simultaneously, and set up HTTPS.
LiveKit Agent
Swap in different STT, LLM, and TTS models and add custom agent logic.
External Audio
Stream audio from any external service directly to the avatar without an AgentSession.
