Observability Blogs
Latest Articles
APM: Not an Infrastructure Monitoring Strategy
Infrastructure monitoring fills a large gap not previously addressed by APM monitoring (or log management): intelligent and timely alerting on service-wide issues and trends across the environment (whether in the cloud or on-prem, or a mix of legacy and new architectures).
Splunk Insights EOL: Infrastructure and AWS Cloud Monitoring
Announcing the end of life for Splunk Insights for AWS Cloud Monitoring and Splunk Insights for Infrastructure on June 30, 2020.
Monitoring Amazon EC2 with Splunk Infrastructure Monitoring
Explore the top 12 challenges of monitoring Amazon EC2 when dealing with larger scale production deployments in part one of this two-part blog series.
12 Top Things to Monitor in Amazon EC2
Despite Amazon EC2 resilience and elasticity, monitoring ongoing objectives requires close tracking of capacity, predictability, and interdependence. Splunk Infrastructure Monitoring offers a dashboard out of the box that shows you the most important EC2 metrics at a glance.
Survivorship Bias in Observability
Observability is a powerful concept in microservice-based applications, but be careful you aren’t biased toward the surviving data.
Monitoring Docker Containers: What Does It Take to Get Started?
Operationalizing Docker means more complexity, and greater need for monitoring and alerting on production environments. Learn docker container monitoring best practices.
A Deep Dive into Splunk APM Alerts
Learn about some of the statistical underpinnings of Splunk APM Alerts, and how to use them.
Global Restart: CIOs Need to Simplify in the Face of Complexity
The global restart of economies derailed by the coronavirus pandemic is challenging organizations across the board. And from one industry to the next, IT must be a central player in establishing a new normal.
A Deep Dive Into Built-In Anomaly Detection: How the Algorithm Works
Discover how Built-in Alert Conditions and Alert Preview in Splunk Infrastructure Monitoring allow cloud operations to exploit the full power of our real-time analytics engine in a way that is both intuitive and flexible.
Controlling Trace Metadata in Splunk APM
Distributed tracing shows the details of a particular transaction or operation with the potential for very detailed metadata. Learn how you can protect or remove sensitive customer data from traces using the Splunk APM.
Monitor Microsoft Azure Functions in Real-Time
Discover how we've extended our Splunk Infrastructure Monitoring analytics capabilities to our Microsoft Azure customers so they too can monitor their functions in real-time.
Mirrored Dashboards: Efficient Dashboard Management for Enterprise-Scale Monitoring
Mirrored Dashboards help organizations establish best practices in monitoring by enabling the broad distribution of standard dashboards, keeping them up to date over time, and allowing localized customizations all while keeping proliferation in check.
Strategies for Monitoring Docker and Kubernetes Environments
Explore some of the challenges with scale, churn, and correlation in container environments, and some strategies for overcoming them.
Monitoring Amazon RDS with Splunk Infrastructure Monitoring
Amazon’s Relational Database Service (RDS) is one of the most popular database services in the world, used by 47% of companies on AWS according to 2nd Watch’s 2015 AWS Scorecard. In part one of this blog series, I described the top 10 challenges of monitoring Amazon RDS when dealing with larger scale production...
10 Top Things to Monitor in Amazon RDS
Splunk Infrastructure Monitoring offers a dashboard out of the box that shows you the most important RDS metrics at a glance.
Get the HEC Out of Splunk App for Infrastructure
The Splunk App for Infrastructure (SAI) has changed the game when it comes to IT Operations monitoring and alerting of metrics and logs.
How We Monitor and Run Kafka at Scale
Learn from our experience with Kafka at scale: what to monitor and alert on, troubleshooting, and capacity planning. Splunk Infrastructure Monitoring offers a dashboard out of the box that shows you the most important Kafka metrics at a glance.
Ask an Expert: How Splunk is Addressing New Technology Challenges
Key take-aways from Splunk's blog series, Ask an Expert, addressing the technology challenges organizations are managing in the face of the COVID-19 pandemic.
Alerts to Incident Response in Three Easy Steps
Get Splunk alerts escalated to the right teams and people with a mobile app notification, SMS message or a live phone call with VictorOps