We are looking for a Site Reliability Engineer, a creative thinker who loves to be on the cutting edge and to solve problems through technology.
Our product itself is DevOps oriented )
As a company, we are doubling down on SaaS product which raises the bar in terms of availability and maintainability (compared to on-prem software). This requires us to re-imagine the way we deploy, maintain and manage our infrastructure to deliver a stable and reliable SaaS experience for users who make our SaaS platform a critical part of their production infrastructure management.
Terraform provisions and manages the lifecycle of infrastructure. Scalr builds a management layer on top of Terraform, which helps DevOps scale to their entire organization. As an engineering organization, we follow a DevOps approach as well, researching cloud services, adopting best practices, and using Terraform throughout, which helps us better understand problems and use-cases.
By joining our team, you’ll face the challenges and tradeoffs of building a highly reliable production system composed of modern microservices and the previous monolithic approach. The principal stack is Terraform for Google Cloud Platform, Linux, Docker, Kubernetes, experience in Python scripting, CI platforms and monitoring and logging tools
At Scalr, we believe that the best software is produced when engineers take pride and ownership of the code they write, which is why engineering is expected to provide customer support. We value troubleshooting skills and customer empathy because at the end of the day, writing good code and helping customers be successful is what lays the foundation of building great companies.
Python (experience in Python scripting is enough)
Terraform (for GCP)
Linux (RHEL/Debian, bash scripting)
Google Cloud Platform
Experience with monitoring and logging tools such as Grafana, Prometheus, Datadog, New Relic etc.
Experience with CI platforms such as GitHub Actions, Drone, CircleCI etc.
Would be a plus:
Leading SRE teams or initiatives
Experience with GitOps, Argo CD, Flux CD or similar
Chef, Omnibus, Ruby
As part of our team, you will work on:
Own and maintain production infrastructure in GCP and Kubernetes
Implement and maintain Infrastructure as Code in Terraform
Take part in rolling out new releases and improving the efficiency and reliability of releases
Assist customers with on-prem installations of our product
Work with developers to ensure customer data security and isolation in Docker
Take ownership of system monitoring, logging and alerting
Own and maintain complex CI pipelines
Maintain a self-service test environment platform
A leadership role with a great deal of impact and responsibility
Our product itself is DevOps oriented
Migration to Kubernetes
Working with complex CI pipelines that involve cross-project end-to-end tests and continuous delivery (Drone and GitHub Actions)
What does Scalr offer?
Work with interesting product in an enjoyable environment
The opportunity to see how your ideas and visions are realized
Attractive compensation and benefits package
Long-term contract and tax compensations
Remote work or in the office
20 working days of paid vacation and unlimited paid sick leaves