Site Reliability Engineer - Integrated Order Management

The Coca-Cola Company
مصر
منذ يومين
Egypt • Bulgaria • Greece
Information Technology
Hybrid
Experienced Professionals
Department: Consumer and Customer Platform and Digital Factory, Digital & Technology Platform Services.

Site Reliability Engineer (SRE) for Integrated Order Management (IOM) is responsible for maintaining and improving the reliability, availability, performance, and scalability of the IOM platform and provided services. This includes designing, implementing, and maintaining the infrastructure, tools, and processes necessary to support the platform operations. The SRE will work closely with development and operation teams, from the core IOM platform based on SAP S/4 Order To Cash (OTC, former SD) as well as cross-platform ones, covering various sales order entry channels and integrations, to ensure that the platform is scalable, secure, and highly available. SRE will also be responsible for monitoring the platform, identifying and resolving complex issues, and implementing improvements to prevent future incidents. The SRE will be expected to have a strong understanding of the SAP S/4 OTC configuration and systems operations, experienced with highly-integrated order management landscapes, DevOps practices, and cloud infrastructure with a focus on automating operations and resolving complex technical challenges.

YOUR KEY RESPONSIBILITIES:
  • Availability & Performance Monitoring:
o Monitor product systems health, availability, and performance to ensure services meet defined service-level objectives (SLOs) and service-level agreements (SLAs).Work on system reliability engineering by designing and implementing system that is resilient, fault-tolerant, and capable of withstanding both expected and unexpected failures.
o Proactively identify, troubleshoot, and resolve issues affecting the stability of the platform and its integrations.
o Implement and maintain monitoring and alerting systems to provide visibility on performance, application health, and user experience.
o Leverage observability tools (e.g., Azure Monitor, Application Insights, Power Platform Monitor, Dynatrace) to enhance real-time visibility.
  • Problem Management & Root Cause Analysis:
o Lead troubleshooting efforts for complex issues (problems), diagnose root causes, and drive improvements to prevent recurrence in collaboration with Development teams and Product Mgr. partners. Ensure proactive management of alerts and participation in crisis management teams as required
o Perform root cause analyses with the relevant Development & Operations teams to identify and resolve recurring issues, documenting findings, and driving systemic changes to prevent future failures. Lead post-incident reviews to document learnings and work with teams to implement long-term fixes.
  • Automation & Continuous Improvement:
o Implement processes and practices that continuously improve the reliability, performance, and scalability of the system.
o Develop & maintain automations for repetitive tasks to increase efficiency, reduce manual effort, and improve response times.
o Collaborate with DevOps teams ensuring that releases follow testing and quality assurance standards.
o Ensure the platform can scale efficiently, meet growing user demand, and plan for future capacity needs.
  • Collaboration & Cross-Functional Coordination:
o Collaborate with product development teams, cloud operations, integration, and other platform teams to define reliability requirements and prioritize improvement initiatives.
o Partner with the security team to implement best practices and maintain compliance within the environment.
o Provide training and support to team members and users on incident response protocols, monitoring tools, and troubleshooting techniques.
o Share insights and best practices with other SREs to foster knowledge sharing and improve reliability across the organization.
Work with SRE Manager and other Platform SREs, SIAM (Service & Integration Management) team, and the SRE Community of Practice to leverage synergies and share best practices to ensure Operational Excellence.

ARE THESE YOUR SECRET INGREDIENTS?
  • Bachelor’s degree in Computer Science, Engineering, Information Technology, or a related field.
  • 5+ years of experience in Site Reliability Engineering, DevOps, or IT operations with a focus on application reliability and observability.
  • 3+ years of experience with SAP S/4 OTC/SD and D365 CRM, and order management interfaces within B2B or B2C preferably in the FMCG industry, including understanding its architecture, configuration, and integrations.
  • Hands-on experience with systems monitoring, alerting, and observability tools, especially within the Microsoft ecosystem (e.g., xx-xx-xx).
  • Strong knowledge of cloud infrastructure, data structures and algorithms
  • Experience with SAP TMS, Azure DevOps CD/CI Pipelines (ALM)
  • Excellent troubleshooting and problem-solving skills, with a focus on improving long-term reliability and efficiency.
  • Strong communication and collaboration skills, with the ability to work effectively across cross-functional teams. Fluent in written and verbal English
  • Detail-oriented and capable of managing multiple priorities in a fast-paced environment.
  • English proficiency

ABOUT YOUR NEW TEAM:
We are Coca-Cola Hellenic, a growth-focused consumer goods business and strategic bottling partner of the Coca-Cola Company. We bottle, distribute and sell an unrivalled range of products in 29 markets in Europe, Africa and Eurasia. As we do, we create value for all stakeholders, support socio-economic growth and build a more positive environmental impact.
We bring together more than 30,000 people from over 70 nationalities, coming from five continents. The diversity of our markets, from mature to emerging economies, provides a wide range of attractive opportunities for growth.
We nurture our talents. We give opportunities to people across all functions and levels, as well as different geographies, backgrounds and education. We are willing to take a risk on the people we believe in, even if they don’t have the perfect experience. We have faith in what every person can be.
And although we have so much to be proud of, we always stay humble. We believe the real magic happens – for us and for you – when we OPEN UP.

AT COCA-COLA HBC, DIVERSITY HELPS US THRIVE
At Coca-Cola HBC, we are an inclusive employer that thrives on diversity. This means our environment provides equal opportunities for all, regardless of race, color, religion, age, disability, sexual orientation, or gender identity. Join us in nurturing a culture where everyone belongs and contributes to our collective success.
تقديم
توصيات وظائف أخرى:

Site Reliability Engineer (VOIS)

Vodafone
Giza, الجيزة
Insights from previous hires 1. Software Developer 2. Senior Software Engineer 3. Senior Software Developer 4. Software Engineer...
منذ أسبوعين

Enterprise Network Support Engineer

Vodafone
Cairo, القاهرة
Insights from previous hires 1. Network Engineer 2. Senior Support Engineer 3. Senior Network Engineer 4. Support Engineer 5....
منذ يومين

Track Engineer Intern

RATP Dev
Cairo, القاهرة
The intern will gain hands-on experience in worksite coordination, compliance procedures, and technical support, contributing to...
منذ أسبوعين

Senior Site Civil Engineer

Employ me
New Cairo City, القاهرة
  • Oversee civil works for solar farm and rooftop...
  • Ensure structural integrity and compliance with engineering...
منذ أسبوع

OCC Engineer - Internship

RATP Dev
Cairo, القاهرة
  • Monitoring and control of train movements.
  • Monitoring and control of power supply systems and...
منذ أسبوعين

Software Engineer - Integrations

Sahl
6th of October, الجيزة
  • Build and maintain integrations with external services and...
  • Collaborate with product and engineering teams to scope,...
منذ أسبوعين

AFC Engineer - Internship

RATP Dev
Cairo, القاهرة
This internship offers hands-on experience in the maintenance and operation of Automated Fare Collection (AFC) systems Through...
منذ أسبوعين

Civil Engineer - Junior (Section 1)

SYSTRA
Cairo, القاهرة
  • Liaising with the technical office about the ordering and...
  • Liaising with procurement about the ordering and...
منذ أسبوع

Control & Instrumentation (C&I) Reliability Engineer

Genser Energy
Kristiansand
  • Ensure the reliability and maintainability of installations
  • Support with designing, developing, monitoring, and refining...
منذ 3 أسابيع

Senior Machine Learning Engineer

Evolvice
Cairo, القاهرة
  • good understanding of neural network architectures,...
  • Design and develop machine learning models and algorithms...
منذ 3 أسابيع