Job Description

Job Header

Lead Site Reliability Engineer

Advertiser: Rezdy3 out of 53.0 overall rating (3 employee reviews) More jobs from this company

Job Information

Job Listing Date
22 Oct 2019
Location
Sydney, CBD, Inner West & Eastern Suburbs
Work Type
Full Time
Classification
Engineering, Management

Lead Site Reliability Engineer

Rezdy is the world’s leading independent B2B cloud-based booking and distribution platform in the experiences sector. Our mission is to power the growth of the experiences industry through tools and connections that make life easier. 

Chances are, if you’ve ever booked a tour or activity, you’ve interacted with Rezdy’s platform. On the one hand, we help ‘experience’ suppliers, such as whale watching tours, cooking schools and amusement parks manage their bookings online and connect with a wide distribution and resale network. On the other hand, we help those looking for experiences and inventory to resell to expand their own customer offering, such as online travel agencies (OTAs), tourist information centres, hotel concierges and local travel agents. 

Rezdy has been focused on growing over the last several months which has created a new opportunity for a Lead Site Reliability Engineer to join our equally scaling Engineering team. 

We want to continue to go faster but we don’t want to compromise on quality and performance. As the Lead Site Reliability Engineer, you’ll be responsible for setting, communicating and delivering a clear technical vision for infrastructure and platform health across our software engineering squads, building operational tooling, automating operational workflows, performing architecture/design reviews, investigating failures, outages and service degradation. 

Improvement is your middle name as you’ll focus on enhancing and optimising Rezdy’s monitoring infrastructure, defining SLIs / SLOs  with the teams responsible for various systems and services, and the flow of our development efforts. As we believe we achieve more together, the Lead Site Reliability Engineer will actively partner with our engineering squads with the ultimate goal of crafting products and services that make life easier for our customers. 

What you’ll be doing: 
  • Ensuring strategies include provisions for security, including reporting and alerting
  • Craft, implement and support the automated tools and systems necessary for multiple teams to deploy many times a day, across multiple environments 
  • Building resilient and self-scaling systems/environments so that you can sleep through the night
  • Design and own the strategy as well as the implementation and management of 24x7 support of systems/platforms for the SRE team
  • Network engineering and system administration
  • Coaching, mentoring, training, developing and managing the SRE team and building a high performance team and culture 
  • On-call support for our critical systems as well as lead incident response post-mortem analysis and review of incidents
  • Contributing to the technical design and architecture of software/platforms
  • Driving quality by conducting code reviews, unit testing and other automated tests

Who you are: 
  • You’ve had experience in a similar role and/or you’re looking for your next step up working within and scaling agile and devOps development environments
  • You have a deep knowledge of AWS cloud infrastructure and services (inc. CloudFormation, ECS, EC2, SNS, SQS, Aurora / RDS, DynamoDB, Cognito, ElasticSearch, S3, EventBridge, ALBs, API Gateway, Lambda, etc), CI/CD pipelines, practical experience in cloud application/platforms software engineering and best practices
  • You’re comfortable reading, writing and debugging code, and preferably have hands-on experience with JavaScript, TypeScript, Java or PHP
  • You possess  demonstrable experience in network management and diagnostics
  • You are skilled at configuration management, automation, and infrastructure-as-code tools and techniques
  • You’ve experienced working with modern, cloud based architectures, RESTful APIs, event driven services and serverless tools
  • You’re hungry to develop your knowledge about web technologies with an ability to evangelise to the team internally and externally
  • You’re passionate about leadership, both in yourself and others - growing and mentoring others is reward 
  • Ever-curious and adventurous, you love learning new things. You’re analytical and detail oriented with a strong drive and innovative and collaborative approach
  • You’re down-to-earth, ever-curious and thrive on being involved - you enjoy making things happen
  • You’re a great teammate who’s passionate about sharing with others and building the best possible product for our customers

What’s in it for you:
  • Your Rezdy anniversary day off to thank you for a great year!
  • How do you like the sound of company-wide social events, team outings, and exclusive discounts on tours and activities?
  • Work with a fascinating product in a high growth tech-thirsty industry 
  • Passionate team with opportunity to work across exciting projects. You can own it and make it happen
  • Opportunities to step up for high achievers
  • Diverse Scaleup culture 
  • Company wide innovative and collaborative hack events every quarter

Reasons to believe:
  • We’re a winning brand - Capterra Ease of Use winner 2018
  • Our marketing is recognised and respected amongst the big players https://bit.ly/2ke66xc
  • We’re a startup that’s going places - ranked 10th in Australia, and top 500 globally. www.startupranking.com/rezdy
  • Here’s a taste of what our customers say about us: https://youtu.be/wqLZv7q1SIU

Rezdy champions different ways of thinking. We are committed to establishing a team that represents a variety of backgrounds, perspectives and skills. The more inclusive we are, the better our work will be.
Come join us, apply now.
Rezdy is the world’s leading independent B2B cloud-based booking and distribution platform in the experiences sector. Our mission is to power the growth of the experiences industry through tools and connections that make life easier. 

Chances are, if you’ve ever booked a tour or activity, you’ve interacted with Rezdy’s platform. On the one hand, we help ‘experience’ suppliers, such as whale watching tours, cooking schools and amusement parks manage their bookings online and connect with a wide distribution and resale network. On the other hand, we help those looking for experiences and inventory to resell to expand their own customer offering, such as online travel agencies (OTAs), tourist information centres, hotel concierges and local travel agents. 

Rezdy has been focused on growing over the last several months which has created a new opportunity for a Lead Site Reliability Engineer to join our equally scaling Engineering team. 

We want to continue to go faster but we don’t want to compromise on quality and performance. As the Lead Site Reliability Engineer, you’ll be responsible for setting, communicating and delivering a clear technical vision for infrastructure and platform health across our software engineering squads, building operational tooling, automating operational workflows, performing architecture/design reviews, investigating failures, outages and service degradation. 

Improvement is your middle name as you’ll focus on enhancing and optimising Rezdy’s monitoring infrastructure, defining SLIs / SLOs  with the teams responsible for various systems and services, and the flow of our development efforts. As we believe we achieve more together, the Lead Site Reliability Engineer will actively partner with our engineering squads with the ultimate goal of crafting products and services that make life easier for our customers. 

What you’ll be doing: 
  • Ensuring strategies include provisions for security, including reporting and alerting
  • Craft, implement and support the automated tools and systems necessary for multiple teams to deploy many times a day, across multiple environments 
  • Building resilient and self-scaling systems/environments so that you can sleep through the night
  • Design and own the strategy as well as the implementation and management of 24x7 support of systems/platforms for the SRE team
  • Network engineering and system administration
  • Coaching, mentoring, training, developing and managing the SRE team and building a high performance team and culture 
  • On-call support for our critical systems as well as lead incident response post-mortem analysis and review of incidents
  • Contributing to the technical design and architecture of software/platforms
  • Driving quality by conducting code reviews, unit testing and other automated tests

Who you are: 
  • You’ve had experience in a similar role and/or you’re looking for your next step up working within and scaling agile and devOps development environments
  • You have a deep knowledge of AWS cloud infrastructure and services (inc. CloudFormation, ECS, EC2, SNS, SQS, Aurora / RDS, DynamoDB, Cognito, ElasticSearch, S3, EventBridge, ALBs, API Gateway, Lambda, etc), CI/CD pipelines, practical experience in cloud application/platforms software engineering and best practices
  • You’re comfortable reading, writing and debugging code, and preferably have hands-on experience with JavaScript, TypeScript, Java or PHP
  • You possess  demonstrable experience in network management and diagnostics
  • You are skilled at configuration management, automation, and infrastructure-as-code tools and techniques
  • You’ve experienced working with modern, cloud based architectures, RESTful APIs, event driven services and serverless tools
  • You’re hungry to develop your knowledge about web technologies with an ability to evangelise to the team internally and externally
  • You’re passionate about leadership, both in yourself and others - growing and mentoring others is reward 
  • Ever-curious and adventurous, you love learning new things. You’re analytical and detail oriented with a strong drive and innovative and collaborative approach
  • You’re down-to-earth, ever-curious and thrive on being involved - you enjoy making things happen
  • You’re a great teammate who’s passionate about sharing with others and building the best possible product for our customers

What’s in it for you:
  • Your Rezdy anniversary day off to thank you for a great year!
  • How do you like the sound of company-wide social events, team outings, and exclusive discounts on tours and activities?
  • Work with a fascinating product in a high growth tech-thirsty industry 
  • Passionate team with opportunity to work across exciting projects. You can own it and make it happen
  • Opportunities to step up for high achievers
  • Diverse Scaleup culture 
  • Company wide innovative and collaborative hack events every quarter

Reasons to believe:
  • We’re a winning brand - Capterra Ease of Use winner 2018
  • Our marketing is recognised and respected amongst the big players https://bit.ly/2ke66xc
  • We’re a startup that’s going places - ranked 10th in Australia, and top 500 globally. www.startupranking.com/rezdy
  • Here’s a taste of what our customers say about us: https://youtu.be/wqLZv7q1SIU

Rezdy champions different ways of thinking. We are committed to establishing a team that represents a variety of backgrounds, perspectives and skills. The more inclusive we are, the better our work will be.
Come join us, apply now.

Report this job advert

Be careful- Don’t provide your bank or credit card details when applying for jobs. If you see something suspicious .

Share this role