Site Reliability Engineer (f/m/d) (SRE): PaaS @ United Internet in Germany

With its business applications, IONOS is one of the leading hosting and cloud applications providers in Europe. With our state-of-the-art technologies, we convince more than 8 million customers in many different countries every day.

Your Tasks

As a team we have three areas of responsibility: platform operations, supporting services for the product teams and telemetry. We are responsible for the continued operation of our Platform as a Service offerings, including incident handling. We work with the product development teams to establish and maintain our service offerings and provide a tight feedback loop on their services’ performance. Accordingly, we provide monitoring, logging, metrics and other cross-product infrastructure on Kubernetes, so our product teams don’t have to worry about it. Finally, we gather a Site Reliability Engineer (f/m/d) (SRE): PaaS required metrics to enable data-driven decision making for our platforms and services. We are a development focussed team. While we absolutely need to work with all available tools to react to incidents, solutions should first and foremost result in code and automation. Our weapons of choice are Ansible, GoLang, GitOps and CI/CD - not root shells and bash scripts.

You will be responsible for the following tasks:

  • Running our Kubernetes and service infrastructure.
  • Building software and systems to manage platform infrastructure and applications.
  • Developing monitoring and alerting rules for symptoms and not outages.
  • Participating in system design consulting, platform management, and capacity planning.
  • Balance feature development speed and reliability with well-defined service level objectives.
  • Improving the deployment process to make it as uncomplicated as possible.
  • Be on an on call rotation to respond to availability incidents and provide support for service engineers with customer incidents.

We appreciate

  • Agile mindset and experience with modern development practices.
  • A proactive approach to spotting problems, areas for improvement, and performance bottlenecks.
  • Ability to program (structured and OO) with one or more high level languages, such as Golang, Python, Java and JavaScript.
  • Profound experience with cloud environments and Kubernetes.
  • Profound experience with the Linux operating system.
  • Experience with network fundamentals.

Apply here


I would be glad to assist you.

To discuss further in detail kindly reach me at or Skype me: cis.garry

Looking forward for your response.




If you are interested in this job please apply using the application link. It is a full time job which requires a permanent residence in Germany. You should be ready to relocate.

Regards, Pawel