A single Voice Bot platform — receives and processes calls from all popular communication channels, syncing data in real-time
Customers call from any channel — Zalo, WhatsApp, Telephone, Viber, Messenger, or Web Widget
Voice Bot recognizes speech, understands intent, queries data, and responds with natural voice
All call data is synced to CRM, Call Center, Analytics, and backend systems
Complete AI call center solution for both incoming and outgoing calls, automating 80% of customer service processes
Receive & process incoming calls
Voice Bot receives incoming calls from customers, automatically identifies intent, looks up information, and responds with natural voice. When needed, the bot transfers to the appropriate agent with full conversation context.
AI-powered automatic call routing system, understanding customer intent through voice instead of key presses
Bot automatically handles 80% of common questions like order tracking, business hours, return policies
When beyond capability, bot transfers calls to the right agent with a conversation summary
Supports Vietnamese, English, Japanese, Korean with native pronunciation for each region
Identity verification via voice biometrics or automatic OTP, ensuring security
Never miss a call, receive and process anytime including after business hours
Two script creation modes — drag-and-drop visual Workflow or natural language description with Prompt Agent. No coding required
Describe scenarios in natural language — AI automatically creates a complete flow. Or let the Agent autonomously handle calls end-to-end
Detailed analytics dashboard for every call — sentiment, intent, agent performance, and business insights
STT, TTS, LLM models developed in-house, optimized for Vietnamese with superior accuracy — supports self-hosting on your own infrastructure
| Criteria | AntcoAI | VNPT AI | FPT.AI | Google STT |
|---|---|---|---|---|
STT Accuracy (Vi) Standard Vietnamese | 98.2% | 94.5% | 95.1% | 93.8% |
STT Accuracy (Vi dialects) Regional dialects | 96.5% | 88.2% | 89.7% | 85.4% |
STT Latency (Streaming) First token response | ~80ms | ~250ms | ~200ms | ~150ms |
TTS Naturalness (MOS) Mean Opinion Score /5 | 4.6 | 4.0 | 4.1 | 4.3 |
TTS Latency Time to first audio byte | ~60ms | ~300ms | ~250ms | ~180ms |
LLM Response (Vi) Vietnamese response quality | 9.2/10 | 7.5/10 | 7.8/10 | 8.5/10 |
LLM Latency Time to first token | ~90ms | ~400ms | ~350ms | ~200ms |
End-to-end Latency STT → LLM → TTS pipeline | ~280ms | ~1200ms | ~900ms | ~600ms |
Self-hosted Support On-premise deployment | Full | Limited | Limited | N/A |
Vietnamese Voices Number of voices | 20+ | 8 | 10 | 4 |
Concurrent Calls Concurrent calls / node | 500+ | 100 | 150 | 200 |
Deploy on your own infrastructure
All STT, TTS and LLM models are developed in-house by AntcoAI team, supporting on-premise or private cloud deployment. Data never leaves your infrastructure — ensuring compliance with data security regulations.
| Component | P50 | P90 | P99 |
|---|---|---|---|
STT (Streaming) First partial result | 45ms | 78ms | 120ms |
STT (Final) Complete transcription | 80ms | 135ms | 210ms |
NLU Intent Intent classification | 12ms | 25ms | 45ms |
LLM Response First token generation | 65ms | 95ms | 150ms |
LLM Complete Full response generated | 180ms | 320ms | 500ms |
TTS Synthesis First audio chunk | 35ms | 60ms | 95ms |
TTS Complete 10s audio generated | 85ms | 140ms | 220ms |
End-to-end User speaks → Bot replies | 280ms | 420ms | 650ms |
docker run -d --gpus all \
-p 8080:8080 \
-e LICENSE_KEY=YOUR_LICENSE_KEY \
-v /data/models:/models \
antcoai/voice-bot:latest \
--stt-model /models/stt-vi-v3 \
--tts-model /models/tts-vi-v3 \
--llm-model /models/llm-vi-7b \
--max-concurrent 500Integrate Voice Bot into any platform with just a few lines of code — Web, React Native, iOS, Android, and Desktop (Electron)
Integrate Voice Bot directly into websites with a voice call widget, supporting WebRTC
import { AntcoVoiceBot } from '@antcoai/voice-web-sdk';
const bot = new AntcoVoiceBot({
apiKey: 'YOUR_API_KEY',
botId: 'your-bot-id',
language: 'vi-VN',
position: 'bottom-right'
});
bot.on('message', (msg) => {
console.log('Bot:', msg.text);
});
bot.mount('#voice-widget');Voice Bot AI has been successfully deployed at many large enterprises across various industries
Explore and test Voice AI features with an intuitive interface
Tap the button to start a conversation
Press the record button to start a conversation...
98% accuracy STT
Context & intent understanding
Natural-sounding TTS
Processing under 100ms
Contact us for consultation and deployment of the Voice Bot solution that fits your business