Site Reliability Engineer

London, United Kingdom



No further applications

We are a team on a mission, to put accessible and affordable healthcare in the hands of every person on earth. Our mission is bold and ambitious, and it’s one that’s shared by our team who shares our values, to dream big, build fast and be brilliant.

To achieve this, we’ve brought together one of the largest teams of scientists, clinicians, mathematicians and engineers to focus on combining the ever-growing computing power of machines, with the best medical expertise of humans, to create a comprehensive, immediate and personalized health service and make it universally available.

Babylon was included in WIRED’s 2016 Top 100 Hottest Startups in Europe and CB Insights 2017 Global ‘AI 100’ list. Fortune Magazine included Babylon in their 2017 list of ’50 Companies Leading the AI Revolution’, the only listed company using AI in healthcare delivery.

At Babylon our people aren’t just part of a team, they’re part of something bigger. We’re a vibrant community of creative thinkers and doers, forging the way for a new generation of healthcare. We’re only as good as our people. So, finding the best people is everything to us.

We serve millions, but we choose our people one at a time…

We are looking for Site Reliability Engineers to help scale our systems for AI systems. We are launching babylon across the globe and need to scale. As a Site Reliability Engineer you'll be working along side our chatbot, diagnostics or monitoring tems and help build a scalable, stable and safe systems. You'll work with every other team within engineering from product to data to help build and scale systems.

What will you do?

    • Work closely with teams from across the AI engineering teams to deliver world-class applications
    • Implement Monitoring, Alerting and testing solutions
    • Continually review and maintain the security of our applications and systems
    • Implement maintain, and scale our ever continuous integration and delivery pipeline
    • Work with developers and QA teams to optimise our deployment, monitoring and debugging processes
    • Participate in on-call rotation
    • Define and evangelize SRE best practices to improve reliability and performance
    • Help to scale our services globally
    • Automate everything!

Who fits the role?

    • 3+ years of experience in deploying and managing distributed applications
    • Good level in Python or equivalent languages
    • Kubernetes wizard
    • Passionate in building tools and automation to improve the quality of the infrastructure
    • Must be able to identify problems before they happen, dig deep for root causes, and implement solutions that prevent future occurrences
    • A natural team player who enjoys working multiple development collaboratively with colleagues
    • Focused on delivery with a passion for quality and innovation

Babylon believes it is possible to put an accessible and affordable health service in the hands of every person on earth. How? By combining the ever-growing computing power of machines with the best medical expertise of humans to create a comprehensive, immediate and personalised health service and making it universally available.