Prometheus metrics available

Observability is a cornerstone of any production-grade speech platform. Capacity Private Cloud exposes a comprehensive set of Prometheus metrics across every microservice, giving operations teams deep visibility into system health, resource utilization, and request-level performance. Whether you are building Grafana dashboards, configuring alerting rules, or capacity planning for scale, these metrics provide the telemetry foundation you need.

The metrics below follow Prometheus conventions: counters track cumulative totals that only increase, gauges represent point-in-time values that rise and fall, histograms capture request duration and size distributions across configurable buckets, and gauge vectors expose labeled gauge values partitioned by response type or status.

A downloadable PDF reference is also available: Prometheus Metrics Reference (PDF)


Deployment

MeasurementDescriptionType
deployment_active_countNumber of active deploymentsgauge
deployment_total_responses_returnedNumber of responses returned by the containergauge vector
deployment_active_requestsTotal number of active requestsgauge
deployment_average_request_process_timeDistribution of average request processing timehistogram
deployment_max_requestsMaximum number of simultaneous deployment requestsgauge
deployment_total_requestsTotal number of deployment requestscounter

VAD (Voice Activity Detection)

MeasurementDescriptionType
vad_audio_streams_currentActive current VAD streamsgauge
vad_audio_streams_maxMaximum concurrent number of VAD streamsgauge
vad_audio_streams_totalTotal number of completed VAD streamscounter
vad_audio_timeout_totalTotal number of VAD stream timeoutscounter
vad_active_requestsTotal number of active requestsgauge
vad_total_cpa_requestsTotal number of CPA requests receivedcounter
vad_active_cpa_requestsTotal number of active CPA requestsgauge
vad_average_cpa_request_process_time_distDistribution of average CPA request processing timehistogram
vad_total_cpa_responses_returnedNumber of CPA responses returnedgauge vector
vad_total_amd_requestsTotal number of AMD requests receivedcounter
vad_active_amd_requestsTotal number of AMD active requestsgauge
vad_average_amd_request_process_time_distDistribution of average AMD request processing timehistogram
vad_total_amd_responses_returnedNumber of responses returnedgauge vector
vad_total_asr_requestsTotal number of ASR requests receivedcounter
vad_active_asr_requestsTotal number of active ASR requestsgauge
vad_average_asr_request_process_time_distDistribution of average ASR request processing timehistogram
vad_total_responses_returnedNumber of ASR responses returnedgauge vector
vad_total_transcription_requestsTotal number of transcription requests receivedcounter
vad_active_transcription_requestsTotal number of active transcription requestsgauge
vad_average_transcription_request_process_time_distDistribution of average transcription request processing timehistogram
vad_total_transcription_responses_returnedNumber of transcription responses returnedgauge vector
vad_stream_subscribe_durationHistogram of latencies for Redis stream subscriptionhistogram
vad_transcoding_durationHistogram of latencies for transcoding audio chunkshistogram
vad_processing_durationHistogram of latencies for engine processing of audio chunkshistogram

Session

MeasurementDescriptionType
session_total_requestsTotal number of requests receivedcounter
session_active_requestsTotal number of active requestsgauge
session_average_request_process_time_distDistribution of average request processing timehistogram
session_total_responses_returnedNumber of responses returned by the containergauge vector

ASR (Automatic Speech Recognition)

MeasurementDescriptionType
asr_total_asr_requestsTotal number of requests receivedcounter
asr_active_asr_requestsTotal number of active requestsgauge
asr_active_europa_requestsTotal number of active backend engine requestsgauge
asr_average_asr_request_process_time_distDistribution of average request processing timehistogram
asr_max_asr_requestsMaximum number of simultaneous ASR requestsgauge
asr_total_asr_responses_returnedNumber of responses returned by the containergauge vector
asr_total_grammar_requestsTotal number of grammar load requests receivedcounter
asr_active_grammar_load_requestsTotal number of active grammar load requestsgauge
asr_average_asr_stream_request_process_time_distDistribution of average stream request processing timehistogram
asr_max_concurrent_grammar_load_requestsMaximum number of simultaneously active grammar load requestsgauge
asr_average_grammar_load_request_process_time_distDistribution of average grammar load request processing timehistogram
asr_total_grammar_load_responses_returnedNumber of grammar load responses returned by the containergauge vector
asr_total_transcription_requestsTotal number of transcription requests receivedcounter
asr_active_transcription_requestsTotal number of active transcription requestsgauge
asr_total_transcription_responses_returnedNumber of transcription responses returnedgauge vector
asr_average_asr_batch_request_process_time_distDistribution of average batch request processing timehistogram
asr_average_transcription_batch_request_process_time_distDistribution of average transcription batch request processing timehistogram
asr_average_transcription_stream_request_process_time_distDistribution of average transcription stream request processing timehistogram
asr_max_transcription_requestsMaximum number of simultaneous transcription requestsgauge
asr_max_active_grammarsMaximum number of simultaneously active grammarsgauge
asr_max_active_parsesMaximum number of simultaneous active SISR parsesgauge
asr_active_decodesActive number of decodes being processedgauge
asr_active_grammarsActive number of grammars being processedgauge
asr_active_parsesActive number of SISR parses being processedgauge
asr_average_sisr_parse_text_request_process_time_distDistribution of average SISR parse request processing timehistogram
asr_sisr_parse_text_requests_totalTotal number of SISR parse requests receivedcounter
asr_total_sisr_parse_text_responses_returnedNumber of SISR parse responses returnedgauge vector
asr_total_ms_audio_pushedTotal milliseconds of audio pushed into ASRcounter
asr_cache_entriesNumber of entries currently present in the ASR grammar cachecounter
asr_cache_size_bytesCurrent size in bytes of the grammar cachecounter
asr_active_ms_audio_processingTotal milliseconds of audio currently being processed by ASRgauge
asr_fine_tuned_resultsASR fine-tuned model results (if enabled)gauge vector

TTS (Text-to-Speech)

MeasurementDescriptionType
tts_total_requestsTotal number of requests receivedcounter
tts_active_requestsTotal number of active requestsgauge
tts_average_request_process_time_distDistribution of average request processing timehistogram
tts_total_responses_returnedNumber of responses returned by the containergauge vector
tts_average_pending_queue_timeAverage time of requests queued for processinghistogram
tts_max_queue_size_synthesis_requests_tts1Maximum number of simultaneous TTS1 synthesis requestsgauge
tts_active_queue_size_synthesis_requests_tts1Current number of simultaneous TTS1 synthesis requestsgauge
tts_max_pending_requests_tts1Maximum number of pending TTS1 synthesis requestscounter
tts_preprocess_load_cache_resultsUsed internally for testinggauge
tts_postprocess_load_cache_resultsUsed internally for testinggauge
tts_max_queue_size_synthesis_requestsMaximum TTS requests per containergauge
tts_first_result_time_maxMaximum time between client making synthesis request and receiving first audio packethistogram
tts_first_result_time_minMinimum time between client making synthesis request and receiving first audio packethistogram

Resource Manager

MeasurementDescriptionType
resource_active_asr_installsActively installing ASR packagesgauge
resource_asr_download_attempts_counter_totalTotal number of ASR download attemptscounter
resource_asr_download_failure_counter_totalTotal number of failed ASR downloadscounter
resource_asr_download_success_counter_totalTotal number of successful ASR downloadscounter
resource_asr_language_packages_configuredNumber of ASR packages configured for the systemgauge
resource_tts_active_installsActively installing TTS packagesgauge
resource_tts_download_attempts_counter_totalTotal number of TTS download attemptscounter
resource_tts_download_failure_counter_totalTotal number of failed TTS downloadscounter
resource_tts_download_success_counter_totalTotal number of successful TTS downloadscounter
resource_tts_voice_packages_configuredNumber of TTS packages configured for the systemgauge
resource_active_vb_active_installsActively installing VB-Active packagesgauge
resource_vb_active_download_attempts_counter_totalTotal number of VB-Active download attemptscounter
resource_vb_active_download_failure_counter_totalTotal number of failed VB-Active downloadscounter
resource_vb_active_download_success_counter_totalTotal number of successful VB-Active downloadscounter
resource_vb_active_language_packages_configuredNumber of VB-Active packages configured for the systemgauge
resource_dnn_active_installsActively installing DNN packagesgauge
resource_dnn_download_attempts_counter_totalTotal number of DNN download attemptscounter
resource_dnn_download_failure_counter_totalTotal number of failed DNN downloadscounter
resource_dnn_download_success_counter_totalTotal number of successful DNN downloadscounter
resource_dnn_voice_packages_configuredNumber of DNN packages configured for the systemgauge
resource_itn_active_installsTotal number of ITN resource installscounter
resource_itn_download_attempts_counter_totalTotal number of ITN download attemptscounter

Licensing

MeasurementDescriptionType
license_invalid_check_ops_totalTotal number of unsuccessful license check eventscounter
license_sync_fail_ops_totalTotal number of unsuccessful license sync eventscounter
license_sync_ops_totalTotal number of attempted license sync eventscounter
license_sync_success_ops_totalTotal number of successful license sync eventscounter
license_valid_check_ops_totalTotal number of successful license check eventscounter
license_valid_licencesNumber of valid license deploymentsgauge
license_invalid_licencesNumber of invalid license deploymentsgauge

Configuration

MeasurementDescriptionType
configuration_total_requestsTotal number of requests receivedcounter
configuration_active_requestsTotal number of active requestsgauge
configuration_max_requestsMaximum number of configuration requestsgauge
configuration_average_request_process_time_distDistribution of average request processing timehistogram
configuration_total_responses_returnedNumber of responses returnedgauge vector

Binary Storage

MeasurementDescriptionType
binary_storage_total_requestsTotal number of requests receivedcounter
binary_storage_active_requestsTotal number of active requestsgauge
binary_storage_max_requestsMaximum number of binary storage requestsgauge
binary_storage_average_request_process_time_distDistribution of average request processing timehistogram

Admin Portal

MeasurementDescriptionType
admin_portal_total_requestsTotal number of admin portal requestscounter
admin_portal_average_request_process_time_distDistribution of average request processing timehistogram

Archive

MeasurementDescriptionType
archive_total_requestsTotal number of requests receivedcounter
archive_active_requestsTotal number of active requestsgauge
archive_average_request_process_time_distDistribution of average request processing timehistogram
archive_total_responses_returnedNumber of responses returnedgauge vector
archive_active_executionTotal number of active archive requests currently being executedgauge
archive_requests_maxMaximum number of archive requests receivedgauge
archive_total_executeTotal number of archive requests executedcounter

Deployment Portal

MeasurementDescriptionType
deployment_portal_total_requestsTotal number of deployment portal requestscounter
deployment_portal_active_requestsTotal number of active deployment portal requestsgauge
deployment_portal_requests_maxMaximum number of deployment portal requestsgauge
deployment_portal_average_request_process_time_distDistribution of average request processing timehistogram
deployment_portal_total_responses_returnedNumber of responses returnedgauge vector

Reporting

MeasurementDescriptionType
reporting_total_requestsTotal number of requests receivedcounter
reporting_active_requestsTotal number of active requestsgauge
reporting_average_request_process_time_distDistribution of average request processing timehistogram
reporting_requests_maxMaximum number of reporting requestsgauge

LumenVox API

MeasurementDescriptionType
lumenvox_api_total_requestsTotal number of initial requests receivedcounter
lumenvox_api_total_requests_within_sessionsTotal number of API requests within sessionscounter
lumenvox_api_active_requestsTotal number of active requestsgauge
lumenvox_api_total_responses_returnedNumber of responses returned by the containergauge vector
lumenvox_api_rmq_messages_receivedNumber of LumenVox API RabbitMQ messages receivedcounter
lumenvox_api_rmq_messages_sentNumber of LumenVox API RabbitMQ messages sentcounter

ITN (Inverse Text Normalization)

MeasurementDescriptionType
itn_request_timesITN request timeshistogram
itn_requests_currentActive ITN requestsgauge
itn_requests_maxMaximum number of simultaneous ITN requestsgauge
itn_requestsTotal ITN requestscounter

NLU (Natural Language Understanding)

MeasurementDescriptionType
nlu_average_request_process_time_distDistribution of average request processing timehistogram
nlu_active_requestsTotal number of active requestsgauge
nlu_total_requestsTotal number of initial requests receivedcounter
nlu_total_responses_returnedNumber of responses returned by the containergauge vector

MRCP

MeasurementDescriptionType
mrcp_total_requestsTotal number of calls (sessions) receivedcounter
mrcp_active_requestsTotal number of active requestsgauge
mrcp_average_request_process_time_distDistribution of average request processing timehistogram
mrcp_total_responses_returnedNumber of responses returnedgauge vector
mrcp_max_callsMaximum simultaneous number of calls processedgauge
mrcp_sip_callsTotal number of SIP calls processedcounter
mrcp_sip_tcp_connectionsTotal number of SIP TCP calls processedcounter
mrcp_rtsp_callsTotal number of RTSP calls processedcounter
mrcp_garbage_collection_callsTotal ended calls that are in the process of garbage collectiongauge

Diarization

MeasurementDescriptionType
diarization_average_request_process_time_distDistribution of average request processing timehistogram
diarization_active_requestsTotal number of active requestsgauge
diarization_total_requestsTotal number of initial requests receivedcounter

Language ID

MeasurementDescriptionType
lid_average_request_process_time_distDistribution of average request processing timehistogram
lid_active_requestsTotal number of active requestsgauge
lid_total_requestsTotal number of initial requests receivedcounter

Neuron

MeasurementDescriptionType
neuron_total_requestsTotal number of Neuron requests receivedcounter
neuron_active_requestsTotal number of active Neuron requestsgauge
neuron_active_requests_process_time_distDistribution of average Neuron request processing timehistogram
neuron_first_result_time_distDistribution of time to first audio byte in millisecondshistogram
neuron_first_result_time_maxMaximum time to first audio byte in millisecondsgauge
neuron_first_result_time_minMinimum time to first audio byte in millisecondsgauge
neuron_max_queue_size_requestsMaximum simultaneous Neuron requests at any one time since startupgauge
neuron_total_responses_returnedNumber of responses returned including typegauge vector

Filestore

MeasurementDescriptionType
filestore_active_deploymentsCurrent number of active deployments managed by the file-store servicegauge
filestore_cache_memory_bytesCurrent memory consumption of the file-store cache in bytesgauge
filestore_cache_sizeCurrent number of items in the file-store cachegauge
filestore_cache_reconciliationTotal number of cache reconciliation eventscounter
filestore_deployment_cache_sizeCurrent number of cached deployment artifacts/entriesgauge
filestore_http_request_duration_secondsDistribution of HTTP request latency in secondshistogram
filestore_http_requests_totalTotal number of HTTP requests receivedcounter
filestore_http_response_size_bytesDistribution of HTTP response body sizes in byteshistogram

Related Articles


Was this article helpful?