Deep Learning Toolkit 3.1 - Release for Kubernetes and OpenShift

By Philipp Drieger

In sync with the upcoming release of Splunk’s Machine Learning Toolkit 5.2, we have launched a new release of the Deep Learning Toolkit for Splunk (DLTK) along with a brand new “golden” container image. This includes a few new and exciting algorithm examples which I will cover in part 2 of this blog post series. Examples include forecasting with Prophet, graph analysis with NetworkX, distributed machine learning with DASK and a new hybrid CPU/GPU example for PyTorch and a very recent contribution from the Splunk community for a NLP library for Japanese.

Connect to Kubernetes or Openshift

Up until DLTK version 3.0 we’ve only had Docker as a target environment to run containers for advanced machine learning and deep learning use cases. It was useful at the beginning and still is for local development, but now many enterprise customers want to connect to their Openshift or Kubernetes stack, either on-prem or in the cloud.

Deep Learning Toolkit 3.1

The latest version 3.1 spells good news for those deployment scenarios as the Deep Learning Toolkit now allows you to connect to Kubernetes and OpenShift with a series of authentication options. We’ll organise them in an overview matrix to better illustrate all options at a glance:

Splunk>	Docker	Kubernetes	Openshift
Authentication Options	Docker Client Standard	Certificate & Key, User Token, Username & Password, AWS IAM	Certificate & Key, User Token, Username & Password, AWS IAM
Ingress Type	Docker Networking Standard	Load Balancer, Node Port	Route
Cluster Connection Examples	unix://var/run/docker.sock, tcp://remotehost:2375	https://ec2-xx-xx-xxx-xxx.compute-1.amazonaws.com:16443	https://dltk.openshift.my company.com

With the new setup options you can even connect to both, Docker and Kubernetes/Openshift in parallel. This can be handy for situations where developing in a local Docker environment is needed to then move over to Kubernetes for putting models into production. In any case, please make sure your setup of DLTK is appropriately secure as it involves network communication between your Splunk search head and your container environment.

The Golden Image

As you may have read in Anthony’s blog post, you can easily build your own custom container images for DLTK using the bits from Github. Conveniently, you can simply pull the existing, prebuilt images from Dockerhub to work with frameworks like TensorFlow, PyTorch or NLP libraries. However, you might ask: “Why are there 4 different images?” And you are right, there should be a much simpler way which we will now follow by introducing the new “Golden Image GPU (3.1)”.

To improve security, this image was further modified to run model code as non-root user in the container which is a must have for most production deployments. All libraries and frameworks have been updated to the most current stable versions and consolidated into a unified image that now supports both CPU and GPU execution. On top of that, Jupyter Lab has added extensions to run TensorBoard and DASK directly as widgets in its UI interface.

Deep Learning Toolkit 3.1

.conf20 Calling

With the upcoming .conf20 and the recently opened 'Call For Papers', I want to encourage you to submit your amazing machine learning or deep learning use cases by May 20. Let me know in case you have any questions or need support.

Last but not least I would like to thank all colleagues and contributors who helped me a lot to finish this release. Special thanks to Robert Fujara for his continued support on DLTK and making Kubernetes a reality today!

Read about all the exciting new examples in part 2 of this blog post.

Happy Splunking,

Philipp

Cyclical Statistical Forecasts and Anomalies – Part 5

When your datasets are far from simple, your anomaly detection techniques must evolve to scale with the growing complexity. In this blog, you will learn various ways to take your anomaly detection to the next level no matter the complexity of your data.

Platform 3 Min Read

Splunk AR: Taking Remote Collaboration To The Future is Already Here

Remote Collaboration in Splunk AR takes “collaboration” to the next level. Our Remote Collaboration solution is different from other existing solutions today, and we walk you through why.

Platform 7 Min Read

Building Digital Resilience for SecOps, ITOps and DevOps

Splunk announces innovations and enhancements to the unified security and observability platform to help customers mitigate these challenges and build digital resilience.

About Splunk

The world’s leading organizations rely on Splunk, a Cisco company, to continuously strengthen digital resilience with our unified security and observability platform, powered by industry-leading AI.

Our customers trust Splunk’s award-winning security and observability solutions to secure and improve the reliability of their complex digital environments, at any scale.