DevOps

Site Reliability Engineer Full-time


SaM Solutions is an international IT-services and software solutions provider with 30 years of experience. We focus on IT consulting and custom software engineering services for both European and the U.S. markets, leveraging global resources. The geographical presence of SaM Solutions covers the USA, Germany, the Netherlands and countries in the Eastern Europe – Latvia, Lithuania, Poland.


SaM Solutions is looking for an experienced AWS System/DevOps Engineer with a good knowledge of English and in-depth knowledge of AWS to join our team as a Site Reliability Engineer (SRE) to work on a procurement project.


Responsibilities:


  • You will design, implement and manage a highly available and scalable cloud infrastructure on AWS, using best practices to optimize performance, security and cost;


  • You will work with software development teams to ensure applications are designed and built with reliability and observability in mind;


  • You will set up and maintain automated deployment, monitoring and alerting systems to proactively identify and resolve potential issues;


  • Perform performance testing, capacity planning and system tuning to ensure optimal performance of AWS services and applications;


  • Continuously seek process automation and improvement opportunities to increase the efficiency of our SRE operations.


Requirements:


  • In-depth knowledge of AWS services, including EC2, S3, RDS, key management and IAM, Systems Manager, OpenSearch Service, SQS. Hands-on experience with AWS cloud networking concepts VPC, WAF, Route53 and Elastic Load Balancers as well as monitoring tools such as CloudWatch, Prometheus, Grafana or similar;


  • Proficiency with Infrastructure-as-Code (IAC) tools such as Terraform or AWS CloudFormation for automated infrastructure provisioning and management;


  • Experience with containerisation technologies such as Docker and orchestration tools such as Kubernetes;


  • Strong scripting and automation skills using languages such as Python, Bash or PowerShell;


  • Hands-on experience with Git, JIRA and Confluence;


  • Practical experience and expertise in managing domains, DNS settings and maintaining an email infrastructure;


  • Sound knowledge of algorithms and techniques for encrypting data in transit/at rest. Experience with RDBMS (PostgreSQL) in a 24x7 production environment;


  • Knowledge of backup and recovery procedures and technologies, data storage and retention requirements. Experience in cost optimisation and budget management.


What we offer:


  • Official employment;


  • A flexible schedule, the possibility of remote work;


  • Paid time off and paid sick leave days;


  • English and German language classes;


  • Сorporate trainings, seminars, activities within internal PMO;


  • Possible business trips to customer's location;


  • For more information see our company description page.

Overview

  • Employer: SaM Solutions
  • Job Title: Site Reliability Engineer
  • Published: 7 months, 4 weeks ago
Apply For This Job