Say goodbye to blind spots, guesswork, and swivel-chair monitoring. With Splunk Observability Cloud and AI Assistant, correlate all your metrics, logs, and traces automatically and in one place.
The shift to cloud-native infrastructure has transformed how we collect, store, and analyze metrics — bringing massive increases in data volume and cardinality.
In this article, we explore the concept of cardinality and its growing importance in modern monitoring and observability systems, especially in cloud-native environments.
Cardinality refers to the number of unique values in a dataset or field. A field or dataset may be either high cardinality or low cardinality:
In the context of observability, cardinality determines how finely we can slice and analyze our metrics. High cardinality provides rich insight but introduces challenges around scalability, performance, and cost. (More on these challenges later.)
See how low vs. high cardinality compares:
This infographic illustrates the difference between high and low cardinality in monitoring metrics. High cardinality involves many unique values — such as user IDs or transaction types — enabling granular insights but increasing system load. Low cardinality metrics, like server status or boolean flags, offer broader trends with less overhead. Understanding the trade-offs is key to designing efficient observability systems.
In monitoring and observability systems, the term “cardinality metrics” refers to metrics that track the number of unique combinations of a metric name and its associated labels or dimensions.
Here’s an example: A metric might be something like http_requests_total. Labels/dimensions are additional data points that give context, like:
Each unique combination of these label values creates a new time series. The more unique combinations, the higher the cardinality.
So, let’s say a simple app has 10 HTTP methods, 50 status codes, and 20 regions, you could have 10,000 unique time series, just for a single metrics.
High cardinality fields enable granular tracking of individual records or events. In monitoring, high cardinality helps to:
Observability takes things further — combining logs, metrics, and traces to deliver a complete picture of system behavior.
High cardinality is critical for debugging distributed systems because the higher the detail, the easier it is for engineers to trace individual user requests across microservices, identify anomalies within specific regions or clusters, and analyze vast datasets without losing crucial details.
We can sum up how high cardinality is so valuable, because it delivers:
Despite its benefits, high cardinality introduces significant operational challenges:
Modern observability platforms like Splunk Observability Cloud are built to support high cardinality at scale. These platforms:
Platforms like Splunk handle high cardinality and high dimensionality data, allowing users to freely query and filter on any attribute regardless of its cardinality.
Workload monitoring. Moving beyond system metrics like CPU or memory, workload monitoring focuses on application-specific metrics such as:
These metrics are often high cardinality due to their association with specific services, endpoints, or user actions.
(Related reading: common SRE metrics and monitoring for RED rate, errors, and duration.)
E-commerce. In an online store, each customer and order has unique identifiers. High-cardinality metrics allow monitoring by user ID or order ID, enabling use cases like:
Without this level of detail, troubleshooting and incident resolution would be limited to general trends, making it harder to identify the root cause.
Together, all of this makes a better customer experience.
Cardinality metrics are essential to modern monitoring and observability. They empower teams to diagnose issues with precision and maintain performance across complex, cloud-native architectures. But with great detail comes great responsibility — managing high cardinality requires thoughtful system design, efficient tooling, and disciplined metric instrumentation.
Understanding and harnessing cardinality can make the difference between reactive firefighting and proactive performance management.
See an error or have a suggestion? Please let us know by emailing splunkblogs@cisco.com.
This posting does not necessarily represent Splunk's position, strategies or opinion.
The world’s leading organizations rely on Splunk, a Cisco company, to continuously strengthen digital resilience with our unified security and observability platform, powered by industry-leading AI.
Our customers trust Splunk’s award-winning security and observability solutions to secure and improve the reliability of their complex digital environments, at any scale.