Site Reliability Engineer

London FULL TIME £55,000 - £75,000 / Year
(£4,583 - £6,250 / Month)

Job Description

We are seeking a Site Reliability Engineer to enhance our service reliability and performance. This role is crucial in optimising system uptime and managing scalable infrastructures. The ideal candidate will have a strong background in both operations and software engineering, ensuring our platforms run smoothly and efficiently.

Responsibilities

  • Oversee infrastructure deployments and manage system performance.
  • Develop strategies for reducing downtime and improving responsiveness.
  • Integrate monitoring tools to continuously assess system health.
  • Collaborate with cross-functional teams to optimize service performance.
  • Prepare technical documentation and operational playbooks.
  • Lead efforts to improve system automation and orchestration.
  • Ensure capacity planning and resource management is effectively implemented.

Requirements

Education
  • Bachelor's degree in Computer Science or related field
  • Master's degree in a related field preferred
Experience
  • 4+ years of experience in site reliability engineering or DevOps roles
Technical Skills
  • Scripting Languages
  • Docker
Soft Skills
  • Communication
  • Problem-solving
Certifications
  • Certified Kubernetes Administrator (CKA)
  • AWS Certified DevOps Engineer
Languages
  • English: Fluent

Advantageous

  • Experience with incident response and management: Proficient in handling incidents to minimize downtime.
  • Knowledge of Agile methodologies: Understanding of Agile processes and best practices.

Benefits

  • Comprehensive health, dental, and vision plans
  • Employee stock options
  • Flexible working arrangements
  • Training and development opportunities

Company Culture

  • Diversity and Inclusion: We are committed to diversity and inclusivity, welcoming talents from all backgrounds.
  • Work-life Balance: We promote a healthy work-life balance with flexible work arrangements.
Status: Closed