About the role
We are looking for a Staff Site Reliability Engineer, who will join our fantastic highly skilled platform team and will help us manage, develop and extend our robust cloud-based infrastructure.
Beat has numerous custom and complex systems running on Kubernetes clusters to provide the best experience for our customers.
We are looking for someone who loves Linux, who is intimately familiar with cloud infrastructure, and with a strong understanding of scaling and load management.
The successful candidate will be a team player, capable of collaborating in a fast paced, DevOps oriented environment.
Our mission is to build a self-service, reliable, efficient, and secure platform in an automated way. We’ll always improve it with user feedback, an effective business continuity plan, and be prepared against the variety of occurrences which can impact our business at any time.
What will you have to do day in day out :
Contribute into building our internal platform for running our Beat service
Write automation tools for system and application maintenance
Help solve business needs with technology by evaluating different technology options and vendor products
System troubleshooting and problem-solving across platform and application domains - will be expected to participate in on-call escalations to troubleshoot customer facing issues
What you need to have :
5+ years of experience running production workloads
3+ years experience designing and building distributed systems 3+ years of experience with Go, Python or Ruby
2+ year experience with Infrastructure as Code tools like Terraform
Experience managing container orchestration systems like Kubernetes
Strong understanding of Software Development Life Cycle, Test Driven Development, Continuous Integration and Continuous Delivery
Passion for improving process and products
Belief in automating away problems
What it's nice to have :
Experience on AWS, Google Cloud, or Azure
Experience with managing CI / CD systems
Experience with managing and utilising observability tools like Prometheus, Thanos, Loki, Tempo, Jaeger, Grafana, ELK
Experience in database systems like MySQL, MongoDB and Elasticsearch
Experience with Apache Kafka
Experience with writing end2end tests for Infrastructure
Experience with GitOps and Chaos Engineering
Knowledge of Agile methodologies
Behavioural & Leadership Competencies :
Ability to work independently and with little direct supervision
Demonstrate accountability and team leadership in the delivery of work
Mentoring peers and helping them grow
Driving company-wide projects by committing to them independently
Excellent verbal and written communication skills
Must be a team player
Superior problem-solving skills
Bonus points for a sense of humor, empathy, and curiosity
What's in it for you :
Competitive full-time salary
Flexible working hours, top Line tools
Working in a hyper-growth environment, you will enjoy numerous learning and career development opportunities
Breakfast, high-quality daily lunch on a very low cost, fruit and snacks all day long
Commuter Benefits Program
A great opportunity to grow and work with the most amazing people in the industry
Being part of an environment that offers challenging goals, autonomy and mentoring, which creates incredible opportunities, both for you and the company.
As part of our dedication to the diversity of our workforce, Beat is committed to Equal Employment Opportunity without regard for race, colour, national origin, ethnicity, gender, disability, sexual orientation, gender identity, or religion.