Site Reliability Engineer

Location Barcelona,Spain and London,UK
Job Type Permanent
Salary Attractive
Reference 31210

Our client, the leading software development house in the online gaming world, is looking for a Site Reliability Engineer for his office in Barcelona and London.

Key Deliverables

  • Provide primary ownership of stability, automation and reliability issues within the Load and Performance environment
  • Lead investigations working with the appropriate teams to drive issue resolution
  • Improve our ability to proactively address problem spaces
  • Utilize the observability stack to effectively carry out SRE role responsibilities
  • Engage with the SME’s of different infrastructure areas and hold conversations about best practices for the given area
  • Help develop system administration standards and procedures to maintain consistent practices
  • Develop tools to aid in the execution of the SRE and Load & Performance job functions
  • Drive efficiencies in systems and processes: capacity planning, configuration management, performance tuning, monitoring and root cause analysis.

Technical Competencies 

  • Strong technical problem-solving skills and troubleshooting ability
  • Excellent cross functional domain knowledge
  • Design and implement Tools and Applications needed for both internal and external problem resolution
  • Be able to gather requirements, working knowledge of business analysis
  • High level understanding of following products/platforms and their configurations: Networks, Storage, Hosting site setups, Server Infrastructure (on-premise and cloud)
  • Ability to read, understand and code in at least 2 languages or frameworks: e.g. C#, Ruby, Java, Scala
  • Experience with technologies: e.g. C#, SQL, Windows Server, Linux, Rabbit MQ, REDIS, F5
  • Knowledge of industry performance standards, performance bottlenecks and web performance measures
  • Previous experience with monitoring technologies: e.g. Grafana, Splunk, Azure AppInsights, Nagios, ELK
Apply Now