DE Jobs

Search from over 2 Million Available Jobs, No Extra Steps, No Extra Forms, Just DirectEmployers

Job Information

Robert Half Site Reliability Engineer III in Alpharetta, Georgia

Description

This role is designed for a professional with a focus on AWS and Site Reliability Engineering (SRE). The ideal candidate will handle basic reliability enhancements and efforts to reduce operational toil. They will monitor system performance, set up proactive alerting to maintain service levels, and join the on-call rotation. Additionally, they are expected to engage in disaster recovery exercises in production environments and may also be involved in training new team members.

Requirements

Scope and Responsibilities:

  • Develop monitoring queries and establish service level baselines.

  • Provide support to senior engineers during incidents.

  • Actively contribute to post-mortems and Root Cause Analyses (RCAs).

  • Participate in disaster recovery testing.

  • Implement automation and execute code in production environments.

  • Contribute to the documentation of SRE practices and knowledge.

Technical Competencies:

  • Observability: Capable of creating proactive alert rules, monitoring app performance through browser agents, scripting synthetic transactions, conducting advanced Application Performance Monitoring (APM), and formulating Service Level Objectives based on the Golden Signals.

  • Incident Management: Skilled in developing RCAs, leading scenario modeling, providing on-call support, and writing advanced automation scripts for incident management.

  • Design for Reliability: Able to make theoretical performance and capacity recommendations, with a strong understanding of DevOps practices including monitoring, cloud storage, and CI/CD.

  • Disaster Recovery: Proficient in on-call disaster recovery support, testing system component failover, and automating recovery using Infrastructure-as-Code.

  • Platforms and Automation: Experienced in identifying improvements for the developer experience, enhancing software delivery through automation, and maintaining secure cloud environments.

  • Reliability Culture: Able to contribute to the SRE knowledge base, analyze operational toil, and independently manage small toil reduction projects.

    Technology Doesn't Change the World, People Do.®

Robert Half is the world’s first and largest specialized talent solutions firm that connects highly qualified job seekers to opportunities at great companies. We offer contract, temporary and permanent placement solutions for finance and accounting, technology, marketing and creative, legal, and administrative and customer support roles.

Robert Half works to put you in the best position to succeed. We provide access to top jobs, competitive compensation and benefits, and free online training. Stay on top of every opportunity - whenever you choose - even on the go. Download the Robert Half app (https://www.roberthalf.com/us/en/mobile-app) and get 1-tap apply, notifications of AI-matched jobs, and much more.

All applicants applying for U.S. job openings must be legally authorized to work in the United States. Benefits are available to contract/temporary professionals, including medical, vision, dental, and life and disability insurance. Hired contract/temporary professionals are also eligible to enroll in our company 401(k) plan. Visit roberthalf.gobenefits.net for more information.

© 2024 Robert Half. An Equal Opportunity Employer. M/F/Disability/Veterans. By clicking “Apply Now,” you’re agreeing to Robert Half’s Terms of Use (https:///www.roberthalf.com/us/en/terms) .

DirectEmployers