Know ATS Score
CV/Résumé Score
  • Expertini Resume Scoring: Our Semantic Matching Algorithm evaluates your CV/Résumé before you apply for this job role: Manager, Site Reliability Engineer.
Mexico Jobs Expertini

Urgent! Manager, Site Reliability Engineer Job Opening In Mexico City – Now Hiring Royal Caribbean Group

Manager, Site Reliability Engineer



Job description

Position Summary


Manager, Site Reliability Engineer will lead the SRE team in support of the Royal Caribbean website ($183M gross revenue in 2021) using application and user performance data to guide informed decision making.

The Lead SRE will use site performance metrics collected by various sources and tools to support the following tasks: the initial triage of critical production incidents, analysis of bugs, implementing best practices in site reliability engineering, optimizing infrastructure, ensuring seamless collaboration between internal teams and external service providers, among other operational initiatives.


EssentialDutiesandResponsibilities:

Critical Incident Support

  • Review ticket analysis and approve closure of tickets/incidents

  • Understands architecture of Royal website and escalates incidents as needed to the appropriate
    team for further triage.

  • Synthesizes and communicates incident details to the production team, stakeholders, including executive level stakeholders.

  • Review postmortem / RCA document and follow up
  • Collaboration with Cross-Functional Teams

  • Ensure all team members are informed about relevant updates and changes that may affect the website
  • Qualifications, Knowledge and Skills:

    Experience

  • Minimum Years of Experience: 10+ years in Site Reliability Engineering (SRE), DevOps, or a related IT operations role.
  • Management Experience: At least 3 years of experience managing teams and collaborating with external service providers.
  • Skills and Abilities

  • Technical Expertise: Proficiency in cloud platforms such as AWS, AWS Elastic Beanstalk.
    Understanding of API design principles: REST, SOAP, Graph
    Advanced knowledge of monitoring and logging tools (AppDynamics, DataDog, Splunk, New Relic, etc).


  • Problem-Solving Skills: Strong analytical and troubleshooting skills to diagnose and resolve complex production issues swiftly.
    Communication and Collaboration:
  • Excellent written and verbal communication skills for effective interaction with cross-functional teams and documentation.

  • Ability to collaborate with Development, QA, IT, and external managed service providers to ensure seamless operations.

  • Education

  • Bachelor’s Degree: In Computer Science, Information Technology, Engineering, or a related field.

  • Certifications

  • Preferred Certifications: Any monitoring and alerting tools equivalent certification
    Any certification or equivalent knowledge of IT service management.
    AWS Certified Solutions Architect, Google Professional Cloud Architect, or Microsoft Certified: Azure Solutions Architect.
    ITIL Foundation certification or equivalent knowledge of IT service management.

  • Power Skills:

  • Action Oriented

  • Collaborates Effectively

  • Communicates Effectively

  • Drives Results

  • Situational Adaptability

  • Required Skill Profession

    Operations Specialties Managers



    Your Complete Job Search Toolkit

    ✨ Smart • Intelligent • Private • Secure

    Start Using Our Tools

    Join thousands of professionals who've advanced their careers with our platform

    Rate or Report This Job
    If you feel this job is inaccurate or spam kindly report to us using below form.
    Please Note: This is NOT a job application form.


      Unlock Your Manager Site Potential: Insight & Career Growth Guide