Incident Management Reliability Engineer

海得拉巴, 印度 Regular 发布于 May. 25, 2026 申请截止于 Jun. 05, 2026

立即申请

Our Team:

Service Quality cultivates a culture of service excellence where quality is more than a benchmark – it's a shared purpose. Through synergistic collaboration, advanced monitoring, and empathetic customer advocacy, we strive to elevate every interaction and transform challenges into opportunities for growth.

Main responsibilities:

The Incident Management Reliability Engineer is responsible for ensuring the stability, resilience, and reliability of critical IT services. This role combines strong incident management expertise with reliability engineering principles to minimize disruptions, drive rapid recovery from major incidents, and continuously improve system performance and availability.

Incident Management
Lead the end-to-end management of Major Incidents (P1/P2), ensuring timely resolution and effective stakeholder communication.
Act as command centre lead during critical outages, coordinating across technical and business teams.
Ensure accurate and detailed incident documentation, including root cause, timeline and resolution steps.
Drive post-incident-reviews and ensure action items are implemented to prevent recurrence.
Maintain consistent communication and escalation processes aligned with ITSM best practices (e.g. ITIL)
Reliability Engineering
Collaborate with service owners and platform teams to enhance service reliability, observability, and fault tolerance.
Implement proactive monitoring, alerting, and automated recovery mechanisms.
Analyse incident trends and develop reliability improvement plans.
Participate in capacity planning, change reviews, and failure mode analysis to anticipate and mitigate risks.
Develop and track SLOs/SLIs/SLAs to measure service health and performance.
Continuous Improvement
Partner with problem management to identify recurring issues and lead root cause elimination initiatives.
Automate operational tasks and enhance service recovery using scripts, runbooks, and AIOps tools.
Contribute to the evolution of the Major Incident Process, ensuring best practices are embedded across the organization.
Key Performance Indicators
Mean Time to Resolve (MTTR) and Mean Time to Detect (MTTD).
Reduction in number and impact of recurring incidents.
Adherence to SLA/SLO targets.
Completion rate of post-incident actions.
Stakeholder satisfaction and transparency during incidents.

About you

Experience:
8+ years' experience.
Preferred Certifications:
ITIL v4 or Service Operations certification.
SRE Foundation / Practitioner certification.
Cloud certifications (AWS, Azure, or GCP).
Incident Command System (ICS) or equivalent leadership training in crisis response.
Soft skills:
Communication (verbal and written).
Technical skills:
Virtualization
Cloud Technologies
Database
Networking
Containerization
Automation
Middleware/Scheduling
Infrastructure as code
Languages:
English

Pursue progress, discover extraordinary

Better is out there. Better medications, better outcomes, better science. But progress doesn’t happen without people – people from different backgrounds, in different locations, doing different roles, all united by one thing: a desire to make miracles happen. So, let’s be those people.

At Sanofi, we provide equal opportunities to all regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, or gender identity.

Watch our ALL IN video and check out our Diversity Equity and Inclusion actions at sanofi.com!

null

追寻发展。探索菲凡。

进步需要我们每个人的参与——不论其背景、地域、或职业，我们都有一个共同的愿望：创造奇迹。你也可以成为其中的一员。我们不断追求变革，拥抱新思想，探索我们所能提供的一切机会。让我们一起追求进步。共同发现非凡。

在赛诺菲，不分种族、肤色、血统、宗教、性别、国籍、性取向、年龄、公民身份、婚姻状况、残疾或性别认同，我们为所有人提供平等的机会。

观看 “在赛诺菲的一天” ，并在官网 (sanofi.com) 上查看赛诺菲的多元化、公平与包容倡议！

立即申请

View All of Our Available Opportunities

您还没有查看任何职位。

您还没有保存任何职位。

体验可能性

Ama

Ama puts her project management techniques and ServiceNow knowledge to use to help advance Sanofi’s Digital Data operating model. Learn how our team connects data and AI to do what’s never been done before.
了解更多
Cambridge Crossing

We're bringing together 2,500 people from across our organization — R&D, Medical, Commercial and Global colleagues all working to realize the power of collaboration.
了解更多
Innovation in Action

Our flexible lab of the future will transform how we conduct research, while our innovation center will be fully integrated with existing R&D locations.
了解更多
Sanofi’s AI Centre of Excellence in Toronto

The Centre is focused on using leading technologies to develop world-class data and artificial intelligence (AI) products to create value for the health sector.
了解更多
Sanofi Canada's Philanthropic Efforts

By chasing the miracles of science to improve people’s lives, we surprise ourselves with what we can achieve. Our team is humbled by the impact our efforts make.
了解更多
Sustainable and Green

Our new facility was built to minimize the environmental impact — helping protect our planet and people. Using resources efficiently, we're providing greener, healthier workspaces.
了解更多
您保存的职位

了解更多
了解更多
心怀梦想，成就一番事业

我们希望您以饱满的热情投入到自己的工作岗位中，给全球数百万人带来美好生活。您的职业发展道路由您自己来掌控。您只管制定目标，我们会提供充足的培训机会和支持，让您得偿所愿。
了解更多
我们的故事

我们关注每一个员工的声音。因为，我们的未来取决于所有员工的付出与努力。正因为他们的助力，我们才能追求远大的理想。
了解更多

Incident Management Reliability Engineer

将此发送给 您认识的人

体验可能性

将此发送给
您认识的人