Role Overview: We are looking for a Mid level DevOps Engineer/Tech interested in building performant, stable and resilient infrastructure. You will be responsible for architecting and building the infrastructure, as well as coordinating with the teams responsible for other layers of the product infrastructure. Building a stable infrastructure is a highly collaborative effort, and as such, a strong team player with a commitment to perfection is required.
Key Responsibilities:
- Take responsibility for the scalability, stability and availability of our low-latency & mission-critical systems
- Enhance the CI/CD pipeline using Github Actions
- Able to generate and maintain helm manifests for Kubernetes
- Maintain our IaC
- Able to setup and maintain monitoring via NewRelic, Prometheus & Grafana
- Ability to debug infrastructure and detect bottlenecks
- Take responsibility for company-wide whole tech infrastructure for all environments.
- Responsibility for managing vpn, load balancers and firewalls
- Responsible for high availability and disaster recovery
- Implementation of secure and stable infrastructure
- Responsibility for continuous improvement and cost optimization
- Responsibility of 24x7 monitoring and resolving infrastructure tier incidents and escalate to upper tiers where necessary
- Responsibility for investigating and clear documentation of incident reports
Qualifications:
- Excellent understanding in AWS cloud
- Knowledge of containerized environments with Kubernetes & Docker
- Ability to develop well managed infrastructure setup
- Deep knowledge of Linux internals, networking routing & protocols
- Strong hands-on building High availability infrastructure using EKS on AWS
- Strong understanding in IaC specifically Terraform
- Strong understanding in GitOps specifically ArgoCD
- Strong understanding in securing production infrastructure
Experience:
- Minimum 2 years of DevOps experience in production applications
- At least 1 year working with Terraform
- At least 1 year with monitoring tools (NewRelic, Prometheus, Grafana, etc.)
- Experience designing and architecting high-performing applications
- Background in Agile development practices
- Experience in a fast-growing startup (preferred)
- AWS Certifications (preferred)