One of the biggest software development in iGaming is looking for a SRE to join the team in Isle of Man or Barcelona. The Site Reliability Engineer is going to play a critical role in the day-to-day operations of services and products relied on across the organization partnering with development engineering and operational technology teams to ensure maximum platform stability availability and reliability.
Responsibilities
- Leading investigations during live outages and repeat outages where cause is unknown
- Partner with development teams to drive DevOps best practices through review of both processes and standards.
- Collaborate and communicate effectively with all stake holders across the business
- Measure and report on system performance (SLO’s), with an eye toward moving our system capabilities forward, getting ahead of customer needs and continually improving our service offering.
- Identify best of breed as well as develop custom tools to aid in the execution of the SRE job function
- Participate in system design consulting, platform management, and capacity planning
- Provide Technical Leadership and consulting to key business projects where SRE services are required.
- Driving quality Root Cause Analysis (RCA) investigations and technical oversight into incident postmortems.
- Contributions to process runbooks to improve operational processes
- Have an urge to document all the things so you don't need to learn the same thing twice.
- Work across multiple disciplines in the company, from infrastructure to observability and development teams to ensure reliability objectives are met.
- Assist with system performance tuning and proactive issue detection
- Identify areas of automation for manual tasks/toil and better leveraging automation and Orchestration platforms to drive auto remediation and self-healing
- Have an enthusiastic, go-for-it attitude. When you see something broken or inefficient, you can't help but fix it.
Requirements:
- Infrastructure and cloud technologies (on-prem and cloud)
- Software development with any of the following, C#, JS, SQL
- Knowledge of how to implement "Infrastructure as Code"
- Scripting and automation skills mandatory (Python is highly recommended)
- Observability and logging technologies such as Elastic (ELK), Graphite, APM, Splunk, Prometheus
- Bachelor’s degree in computer science or equivalent engineering discipline
- 5 years plus experience in software development or infrastructure
- SDLC and DevOps Mindset
- Relevant Industry technical certifications (Cloud, Terraform etc.)
- Linux system experience advantageous
Benefits
- Reallocation package;
- For Isle of Man office - Visa Sponsorship and Accommodation support;
- International Environment;
- Flexible schedule of work;