Expert Site Reliability Engineer Job Vacancy in Finastra Pune, Maharashtra – Updated today
Are you looking for a New Job or Looking for better opportunities?
We got a New Job Opening for
Full Details :
Company Name : Finastra
Location : Pune, Maharashtra
Position :
Job Description : Role: Expert Site Reliability Engineer
About the Role
Finastra enables the financial services world to deliver the future of banking with applications that power financial institutions, marketplaces that accelerate industry & an open innovation platform for banks, fintechs & non-banks to connect and collaborate.
FusionOperate is Finastra’s DevOps Self Service PaaS transforming how we develop and operate striving for software delivery & operational excellence from commit to production and deployment frequency through to reliability measured in our change success rate and mean time to repair. FusionOperate is a Multi Cloud DevOps PaaS focused on Container Orchestration, Continuous Delivery, Observability, AIOPs, Insights & Data.
As a Site Reliability Engineer your mission is to protect and advance the software & systems behind Finastra’s Cloud hosted services running on FusionOperate for the biggest Financial Institutions in the world. Finastra believes in a blameless culture where the primary objective is continuous improvement. You’ll be treating operations as a software engineering problem aiming to build reactive systems that self-heal, ensuring we keep revenue-critical systems up & running despite natural disasters, unexpected surges in traffic, and configuration errors.
Your day will vary from the fine-grained details of optimizing disk performance, authoring operational code for our applications to the big picture of reliability modelling . You will operate as part of a global scaled agile SRE team applying your experience in Continuous Delivery .
Experience & Qualifications (relate to TP4 levels)
12+ years of experience in Computer Science
Application Development using Continuous Delivery for a SaaS or Managed Hosted application with operations experience
Authoring and consuming Open API , gRPC based APIs
Instrumenting metrics, logs & traces for applications & infrastructure you have worked on
Implementing Alerts/Logs Correlation , De-duplication for Noise Reduction
Design and Implementing Self-Healing Scenarios
Integration with Monitoring tools, AIOPs and Incident Management eco systems
Implementing and Delivering robust Infrastructure as code ( IaC )
Designing, deploying and orchestrating microservices using Kubernetes
Appropriate RHEL , Kubernetes & Cloud Certifications a plus
Responsibilities:
Proactively identifying & eliminating excess operational work and poorly performing services
Authoring observability for applications, infrastructure using RED & USE methods
Defining the required reliability of your service through service-level indicators (SLI) and service-level objectives (SLO) & utilization of an error budget to manage the pace of innovation with reliability
Participate in continuous operations review and feedback loop to improve effectiveness of monitoring
Implementing Resiliency Tests, Self-Healing & Circuit Breakers to handle chaotic conditions & ensure your service behaves reasonably even in the face of unexpected demand
Practicing Chaos Engineering in pipeline helping us implement and mature
Capacity Planning to determine resource requirements of your service for it to be scalable, efficient, and reliable
Leading Blameless Postmortems analysis for Incidents
Technology Stack Experience Required – Must Have at least minimum 5 years of relevant experience
Multi Cloud; Azure, AWS, GCP
Programming (Python, Nodejs, Nest Js, Golang, Java, JavaScript)
Kubernetes, Helm, & ArgoCD, Serverless (OpenShift a plus)
Terraform, Ansible and/or Puppet
Moogsoft or similar AIOPs Tools
Prometheus, Grafana, Loki and Tempo ( Open Telemetry a plus)
Data Services (delta lake, knative, mongodb, postgresql/cockroachdb, kafka, spark, camel)
*************************************************************************************************************
The above statements describe the general nature and level of work being performed by people assigned to this job. They are not intended to be an exhaustive list of all responsibilities, duties, and skills required. Reasonable accommodations may be made to enable qualified individuals with disabilities to perform the essential job functions. If you need assistance or an accommodation due to disability please contact your recruitment partner.
*************************************************************************************************************
This post is listed Under Technology
Disclaimer : Hugeshout works to publish latest job info only and is no where responsible for any errors. Users must Research on their own before joining any company