Software Downloads
Capacity Private Cloud is deployed using Kubernetes and Helm charts, providing a scalable, containerized architecture for speech recognition (ASR), text-to-speech (TTS), and voice biometrics capabilities. This article provides guidance on accessing platform software, developer SDKs, and integration tools.
Platform Deployment
The platform is distributed as Kubernetes Helm charts, enabling streamlined deployment and management of the complete speech technology stack.
Helm Charts Repository: https://github.com/lumenvox/helm-charts
To add the Helm repository and deploy:
helm repo add lumenvox https://lumenvox.github.io/helm-charts helm repo update helm search repo lumenvox helm install lumenvox lumenvox/lumenvox -f my-lumenvox-values.yaml -n lumenvox --create-namespace
Prerequisites:
- Kubernetes 1.33 or earlier
- Helm 3+
- External dependencies: PostgreSQL, MongoDB, Redis, and RabbitMQ
Minimum Testing Environment:
- 3 nodes with 8+ CPUs and 16GB RAM each
- MongoDB: 16GB memory, 300GB storage
- PostgreSQL: 8GB memory, 30GB storage
- RabbitMQ: 1 CPU, 2GB memory
- Redis: 5GB memory
For detailed installation guidance, see the Getting Started section of this Knowledge Base.
Developer Integration
For developers building voice-enabled applications, the platform provides SDKs and APIs to integrate speech capabilities into your solutions.
Go SDK
The Go SDK provides a comprehensive client library for building voice-enabled applications. It handles gRPC connectivity, session management, audio streaming, and result processing.
Repository: https://github.com/lumenvox/go-sdk
Installation:
go get github.com/lumenvox/go-sdk
Requirements: Go 1.26 or later, plus access to a Capacity Private Cloud API endpoint.
Supported Capabilities:
- Speech Recognition: ASR and transcription (streaming, batch, continuous)
- Speech Synthesis: Text-to-speech with inline text or SSML URLs
- Language Understanding: NLU operations including summarization and sentiment analysis
- Audio Analysis: Answering machine detection, call progress analysis, speaker identification, and language detection
- Text Processing: Normalization, redaction, grammar parsing, and intent detection
The repository includes 32 complete examples covering transcription, ASR, TTS, NLU tasks, and specialized analyses like diarization and call screening.
MRCP Connectivity
For organizations requiring MRCP (Media Resource Control Protocol) integration with existing telephony infrastructure, the MRCP API provides a Docker-based media server that bridges MRCP operations with the Capacity Private Cloud API.
Repository: https://github.com/lumenvox/mrcp-api
Quick Start:
cd docker docker-compose up -d
Key Configuration:
MEDIA_SERVER__DEPLOYMENT_ID- Your deployment IDMEDIA_SERVER__NUM_CHANNELS- Concurrent channel capacity (default: 200)MEDIA_SERVER__SERVER_IP- Host machine IP for client routingMEDIA_SERVER__LUMENVOX_API_ADDRESS- API service endpoint
Features:
- OAuth support for secure API connections
- Management API for runtime control (logging, maintenance mode)
- Audio recording with save-waveform functionality
- Custom certificate support for encrypted communications
The MRCP API enables seamless integration with media servers such as FreeSWITCH, Asterisk, and commercial platforms that support MRCP.
Additional Resources
- Getting Started Guide - Platform deployment and configuration
- Integration Documentation - Detailed integration guides
- Product Glossary - Definitions of terms used in this documentation
For technical support inquiries, see When to Contact Support.
