Oxide Rack comes with the infrastructure for collecting resource utilization metrics and operational data of hardware and software components. These metrics provide the foundation for operator features such as capacity forecasting, power management, and hardware maintenance planning in the future product releases. Control plane metrics also serve to inform sizing and tuning needs based on the rack usage patterns, providing useful feedback to Oxide for product improvements.
In the initial version of the Oxide Rack, there is a limited set of metrics captured in the ClickHouse time-series database:
Telemetry type | Accessible by | Access method |
---|---|---|
Capacity utilization | operators | API, Console |
Disk I/O | project users | API, Console |
Switch | Oxide support | Technician tools |
Datalink | Oxide support | Technician tools |
The types of metrics and integration methods available for system integration will be substantially expanded in the future product releases.
Capacity Utilization Metrics
virtual_disk_space_provisioned
: virtual disk space allocated to provisioned instances, including both attached and detached disks but excluding snapshots and imagescpus_provisioned
: total number of CPU cores allocated to provisioned instances, excluding stopped instancesram_provisioned
: total memory allocated to provisioned instances, excluding stopped instances
Disk Metrics
activated
: volume activation countflush
: flush (cache-to-disk) countread
: read IO opsread_bytes
: read bytes countwrite
: write IO opswrite_bytes
: write bytes count