Heroes examples
Automate Ansible Deployments with Open Source Ansible Tower
- Automate instalation of AWX (open source Ansible Tower)
- Infrastructure as Code using ansible playbooks and collections
- Write ansible playbooks and collections
- Design and test AWX backup and recovery (TBD)
Design Progressive Delivery with Argo Rollouts
- Investigate different open source solutions of progressive delivery
- Implement and test Blue/Green deployment using Argo Rollouts
- Write background pre-promotion and post-promotion analysis templates
- Test automated and manual rollback and promotion
- Write how-to for developers and system administrators
Design and Implement Vulnerability Detection and Monitoring
- Compare different open source scanning tools like Clair and Trivy
- Automate vulnerability detection throughout Jenkins pipelines
- Continous vulnerability monitoring withing Kubernetes clusters
- Scannig for vulnerabilities in container images, file systems, OS packages
- Shift Left, explain developers how to remediate critical vulnerabilities
Design and Implement Linting Solution for helm charts and Kubernetes manifests
- Compare different open source linting solutions like config-lint, kubeval, kube-score
- Implement YAML linting solution within Jenkins pipelines
- Create well-defined linting policies for helm charts and Kubernetes manifests
- Write how-to for developers, e.g. resources requests and limits of Java applications
Design Status Page automation with Google Cloud Function and PagerDuty
- Use Monitoring Services to automate creation of Status Page incidents and announcements
- Setup the PagerDuty API trigger to run a workflow which integrates with the Google Cloud
- Create a handler in Sensu Monitoring Service
- Write a Google Cloud Function with Python
Design Rundeck Self-Service Operations using Runbook Automation
- Create role based access control policies for Dev, Ops, QA and Customer Support teams
- Write a custom puppet module and add it to Infrastructure as Code
- Create and test Rundeck database backup and restore procedure
- Design a Rundeck upgrade procedure including a database backup
Transition Nagios Monitoring to Infrastructure as Code
- Write a custom puppet module for nagios-core, check_mk and custom scripts
- Move performance data from nagios rrd database to graphite whisper database
- Upgrade nagios from version 3 to version 4
- Upgrade python checks from version 2.7 to version 3.4
Explore ChatOps solutions
- Investigate different chat bots like hubot, errbot or lita
- Choose a team messaging app (Slack, HipChat, MatteryMost, etc...)
- Write a custom puppet module for hubot
- Integrate chatops with other DevOps Tools like Rundeck, PagerDuty, Slack, HipChat
Investigate High Availability Solution for Redis NoSQL Database
- Choose between Redis Sentinel and Redis Cluster
- Demonstrate the implementation of HAProxy in front of a Redis master and slave (agent) replication
- Ensure that client and application connections are always redirected to the master server
- Perform a Rolling Update of all Redis Nodes with zero downtime
Implement High Availability Solution for InfluxDB
- Configure all involved services
- Write a custom puppet module for High Availability Solution for InfluxDB
- Create Grafana Dashboards
- Add all services to monitoring to ensure HA is in place