Site Reliability Engineer

Job No: GOSREBNE1
Location: Brisbane

The opportunity

We are currently looking for a Site Reliability Engineer to join our growing team in Brisbane.

You’ll be responsible for monitoring, identifying, troubleshooting and reporting platform problems to product engineers (or fixing the code yourself) in order to ensure stable and reliable service.

We’re looking for people who are just as passionate about troubleshooting issues with distributed systems as they are about automation, code and collaborating to solve problems.

 

Who are we looking for?

  • You have at least 5 years’ experience working in infrastructure and or development environments

  • You have at least three years of experience using a public Cloud; AWS, GCP, Azure

  • Experience working in Linux environments

  • Understanding of Docker and other container orchestration tools like Kubernetes

  • You have experience with PHP or Golang or Nodejs and preferably a DevOps background

  • You are comfortable writing software to automate API-driven tasks at scale

  • You have used Terraform, Ansible, Puppet, Chef or another config management suite, know where it's broken, and open to trying new alternatives

  • Healthy knowledge of Linux (you have compiled your own kernel at some point, know how to trace syscalls, understand TCP, care about the difference between sysvinit/runit/systemd, etc.)

  • Relentless desire to automate and build software tools

  • Desire to represent work in git, driven by a workflow through issues and pull requests

  • Problem solving - you will encounter many problems you will not have seen before. You can research an issue and find a solution

  • Communication - you can work with both technical and non-technical users to solve issues for both internal and external users

  • You have strong knowledge with software delivery processes

  • Strong exposure to agile project delivery environments with rapid design, prototyping, and rapid feature iteration;

  • Strong technical writing and documentation skills;

  • High alignment with the GO1 Values

 

Your Responsibilities

  • You will participate in 24x365 on-call schedules
  • Track and report on frequency and severity of incidents
  • You will monitor the GO1 platform and Cloud infrastructure, responding to incidents, correcting and improving systems to prevent incidents and planning capacity
  • You will report and solve problems within the GO1 infrastructure services and collaborate on issues with product engineers
  • You will manage Cloud provider infrastructure, system deployments and product releases
  • You will participate in SRE software engineering, writing code for the continuing reduction of human intervention in operational tasks and automation of processes
  • Implement and maintain highly available production hosting environment
  • Implement and maintain monitoring systems for high availability
  • Implement and maintain low latency event streaming infrastructure
  • Implement security layers and monitoring in all production environments (ie: Firewalls/IDS)
  • Provide recommendations regarding future technology and strategy

 

Meet GO1

A graduate of Y-Combinator, the GO1 team are disrupting workplace learning and changing the way organisations across the world train their teams.

With the goal of being the most used source for professional learning by 2020, GO1 merges technology and learning to help millions of users learn and grow in their careers.

Our GO1 employees enjoy a relaxed office environment and remote working options. We also benefit in company wide team building events and a diverse and supportive team to help advance your career.

 

Why you will love working with us

  • Vibrant & Dynamic – We love what we do, we work hard to do it but we’re flexible and we have fun in the process 

  • Smart & Authentic – We’re a clever bunch and we love to share knowledge and experiences with each other – we’re also not afraid of some robust discussion

  • Great work space – Enjoy free coffee, snacks and regular celebrations on site. We have been listed as a 2018 Australian HR Awards finalist in the Employer of Choice category 

  • We are passionate about Learning - Enjoy unlimited access to the GO1 Learning platform - thousands of training modules are waiting for you! Our Engineering teams also regularly participate in hackathons.

  • We’re in serious growth – GO1 is on a stable and rapid-growth curve and you’ll have the opportunity to grow with us

 

Personal Details * Required field

Questions