Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence.
Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services.
Design and develop designs, architectures, standards, and methods for large-scale distributed systems.
Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.
We are looking for profiles with experience in:
Site Reliability Engineering (SRE) or similar roles with a focus on full stack ownership of critical services and technology areas.
Design and delivery of mission-critical systems, with a strong focus on security, resiliency, scalability, and performance.
Deep understanding of end-to-end configuration, technical dependencies, and production service characteristics.
Experience acting as a technical authority for end-to-end performance, operability, and scalability.
Collaboration with development teams to define and implement improvements in cloud service architectures.
Ability to articulate and guide on the technical characteristics of services and technology stacks.
Knowledge of automation and orchestration tools (DevOps, CI/CD, Terraform, Ansible, Kubernetes, .
Experience managing complex escalations and defining mitigations for distributed systems.
Understanding of the impact of architectural decisions on distributed systems and cloud services.
Strong professional curiosity and motivation to develop a deep understanding of advanced services and technologies.
 Keywords: 
SRE, Site Reliability Engineer, Cloud Architect, DevOps Engineer, Principal Engineer, Full Stack Engineer, Distributed Systems, Automation, Orchestration, Scalability, Resiliency, Kubernetes, Docker, Terraform, Ansible, CI/CD, Cloud (Oracle, AWS, Azure, GCP).
Career Level - IC3