Work closely with various teams across the organization, including the security team, development team, and operations team for any AWS infrastructure requests.
Execute with minimal technical supervision, embrace reliability constraints, and be proactive in contributing improvements to the platform.
Encourage best practice policies. Adapt to various technologies and be willing to get involved with Kubernetes and AWS community as needed.
Assist in troubleshooting network, DNS, and storage issues and make recommendations for cost-saving options and growth.
Administrate our Kubernetes/ECS clusters and their lifecycle, in order to guarantee a high degree of reliability, security, scalability, and confidence at any given time.
Provide support, improve, and implement Kubernetes/ECS internal components and applications on top of multiple clusters. Troubleshoot and triage issues as they arise.
Coordinate and communicate with customers to articulate needs, gather requirements and engage in technical discussion as needed.
Participate in a shared on-call rotation Lead and mentor junior team members. Provide associated training as needed.
Basic Qualifications:
4+ Years’ experience in infrastructure design/implementation that directly aligns with the specific responsibilities for this position. (Required).
Should be able to Identify and correct the root cause of various system alarms. Recommend changes to avoid their recurrence
Should be able to resolve incidents escalated by L1 as per the agreed SLAs and timelines.
Experience with development on any modern languages (Java, .NET , Python, Go, etc.
Experience in configuration and management of databases such as MySQL, Mongo, etc.
Hands-on experience and knowledge of Kubernetes architecture and operations. · Expertise experience in container platform infrastructure in AWS (EKS, ECS, Fargate)
Should be able to perform an AWS Well-Architected
Review on AWS Accounts · Clear understanding of container technologies and the tools/challenges around them.
Ability to code/script in Python /Shell/PowerShell Language
Senior-level experience managing and working on Linux/Windows-based systems.
Extensive AWS Cloud Formation knowledge (YAML · Experience working with GitHub workflows and supporting CI/CD pipelines like Jenkins etc.
Experience delivering projects via Agile methodologies and have knowledge of SDLC · Excellent verbal and written communication skills.