Splunk, a Cisco company, is building a safer and more resilient digital world with an end-to-end full stack platform made for a hybrid, multi-cloud world. Leading enterprises use our unified security and observability platform to keep their digital systems secure and reliable. Our customers love our technology, but it's our caring employees that make Splunk stand out as an amazing career destination. No matter where in the world or what level of the organization, we approach our work with kindness. So bring your work experience, problem-solving skills and talent, of course, but also bring your joy, your passion and all the things that make you, you. Come help organizations be their best, while you reach new heights with a team that has your back.
Role Summary
At Splunk, we strive to be a reliable partner for our customers. Splunk Cloud Platform is monitored 24x7 worldwide by our Network Operations Center (NOC). Splunk monitors the service with goals to detect issues, restore service as quickly as possible, and keep customers and their stakeholders informed about outages. We have made great strides in building world class observability, but there is room to grow.
That's where we need you. We're looking for an accomplished engineering leader to champion an atmosphere of continuous improvement by serving as a leader, mentor, and technical advisor for managers and engineers on our frontline observability and network operations center teams.
You will help design and implement processes, tools, and systems that improve the operational experience overall for Splunk engineering and cloud customers. You will lead creating the process of producing insights from NOC alert data and turning that into feedback to engineering teams about what is breaking, the impact of those breakages, and influence where the business should invest in improving reliability.
Your ability to ideate, gain buy-in, and effectively roll out changes will be pivotal in shaping the future of our engineering culture and the experience felt by each and every customer.
Meet the Team
Splunk is a 20-year old / multi-million LOC product suite with over 2,000 engineers working across it.
The organization you are joining is responsible for the end-to-end feedback cycle of code to customer back to developer. We provide services for producing and consuming logs, metrics, and telemetry as well as generating meaningful data-driven insights. You would be the Engineering Manager leading teams that create automation and respond to alerts for Splunk cloud customer environments. The network operations team is a group of system-minded engineers handling cloud alert remediation 24x7 across three shifts. The responsibility of these teams are to monitor and resolve issues that affect the availability and performance of Splunk for our cloud customers. As the authority on our customer’s experience, our team is the frontline of defense in making sure each of our customers have an outstanding experience.
Your peers will be other Engineering Managers that lead other parts of developer observability (Platforms, Insights, and Enablement) as well as Principal Engineers who drive data ingestion quality, usability, and SRE standards.
What we offer you
- A constant stream of new things to learn. You'll learn how our whole stack works, from code compilation to log/metric observation in the wild. We're also always expanding into new areas like bringing in open source projects and contributing back, exploring new technologies, and seeking new ways to make our ecosystem more developer-friendly.
- Impact. We give our leaders an environment in which they can contribute from day one while also providing opportunities for learning and growth.
- Skilled and dedicated peers, all the way from engineering to product management and customer support. We are an engineering and product-focused company. Our engineers take a leading role in designing, architecting, building and testing our product.
- Growth and mentorship. We believe in growing engineering leaders through partnership and mentorship opportunities.
- A stable, collaborative and supportive work environment. We are totally remote friendly. You can choose to work from a Splunk location or you can be in any US time zone and work with the rest of the team around the globe.
- Work-life balance. We don't expect people to work 12-hour days. We want you to have a successful time outside of work too. We trust our colleagues to be responsible with their time and commitment, and believe that balance helps cultivate an outstanding environment.
What you'll get to do
- Bring and share your diverse ideas and lived experiences to set the standard for what good observability and reliability could look like.
- Orchestrate the creation of innovative approaches for expanding our operations center and empower automation and data-driven insights.
- Collaborate closely with customer support and engineering teams to understand their needs and challenges with our products.
- Partner with sister teams across the company (from developers to customer support and more) to create a cohesive story of how a product change gets into our customer's hands.
Must-have Qualifications
- Extensive background (12+ years) in engineering with at least 5 in management focusing on service observability for enterprise software.
- Deep knowledge and ability to use observability tooling.
- Experience tuning alerts and understanding best practices related to alerting.
- Background in maintaining 24x7 production systems for customers.
- Experience supporting customers in a SaaS environment.
- Excellent communication skills and a history of collaborating effectively with cross-functional teams.
Nice-to-have Qualifications
We’ve taken special care to separate the must-have qualifications from the nice-to-haves. “Nice-to-have” means just that: Nice. To. Have. So, don’t worry if you can’t check off every box. We’re not hiring a list of bullet points–we’re interested in the whole you.
- Participated in crafting and delivering software as a service and working with cloud infrastructure services such as AWS EC2, S3, Kubernetes, etc. is desirable but not required.
- Experience working with multiple cloud providers (like AWS, GCP, and Azure).
- Implemented multiple types of observability (logs, metrics, events, telemetry) for large-scale software deployments.
Splunk is an Equal Opportunity Employer
At Splunk, we believe creating a culture of belonging isn’t just the right thing to do; it’s also the smart thing. We prioritize diversity, equity, inclusion, and belonging to ensure our employees are supported to bring their best, most authentic selves to work where they can thrive. Qualified applicants receive consideration for employment without regard to race, religion, color, national origin, ancestry, sex, gender, gender identity, gender expression, sexual orientation, marital status, age, physical or mental disability or medical condition, genetic information, veteran status, or any other consideration made unlawful by federal, state, or local laws. We consider qualified applicants with criminal histories, consistent with legal requirements.
Note:
Base Pay Range
SF Bay Area, Seattle Metro, and New York City Metro Area
Base Pay Range: $203,200.00 - 279,400.00 per year
California (excludes SF Bay Area), Washington (excludes Seattle Metro), Washington DC Metro, and Massachusetts
Base Pay Range: $182,880.00 - 251,460.00 per year
All other cities and states excluding California, Washington, Massachusetts, New York City Metro Area and Washington DC Metro Area.
Base Pay Range: $162,560.00 - 223,520.00 per year
Splunk provides flexibility and choice in the working arrangement for most roles, including remote and/or in-office roles. We have a market-based pay structure which varies by location. Please note that the base pay range is a guideline and for candidates who receive an offer, the base pay will vary based on factors such as work location as set out above, as well as the knowledge, skills and experience of the candidate. In addition to base pay, this role is eligible for incentive compensation and may be eligible for equity or long-term cash awards.
Benefits are an important part of Splunk's Total Rewards package. This role is eligible for a competitive benefits package which includes medical, dental, vision, a 401(k) plan and match, paid time off and much more! Learn more about our next-level benefits at https://splunkbenefits.com.