Home / Jobs / Paya Lebar Air Base Jobs / Operations / Cloud Operations engineer
Apply Now
Requirements
Preferred Qualifications
Key Competencies
Cloud Operations engineer
YEPEESOFT PTE. LTD.
Full Time
Paya Lebar Air Base, East Region
Mid Level
Competitive
Description
Key Responsibilities
- Design and implement scalable, secure, and highly available cloud infrastructure across multi-region environments
- Manage and optimize Kubernetes clusters, ensuring reliability, performance, and scalability
- Develop and maintain CI/CD pipelines and automated deployment frameworks to improve release efficiency and reduce errors
- Lead cloud migration initiatives, ensuring zero downtime and data integrity
- Build and enhance observability frameworks (monitoring, logging, alerting) using tools such as Prometheus, Grafana, and ELK
- Drive system reliability improvements, including incident response, root cause analysis, and performance tuning
- Optimize infrastructure costs and resource utilization without compromising system stability
- Implement traffic management and capacity planning strategies for high-concurrency systems
- Develop tools and platforms to improve automation, operational efficiency, and developer productivity
- Collaborate with cross-functional teams to ensure high-quality system design and delivery
Requirements
- Minimum 5–8 years of experience in SRE / DevOps / Cloud Engineering roles
- Strong hands-on experience with:Kubernetes, Docker, and container orchestrationCloud platforms (AWS, Alibaba Cloud, or similar)CI/CD tools (Jenkins, GitHub Actions, etc.)
- Proficiency in infrastructure as code (e.g., Terraform, Ansible)
- Experience with monitoring and logging tools (Prometheus, Grafana, ELK stack)
- Strong knowledge of Linux systems, networking (VPC, DNS, CDN), and security best practices
- Programming/scripting experience in Go, Python, or Shell
- Experience with high-scale distributed systems and microservices architecture
Preferred Qualifications
- Experience in large-scale internet or e-commerce platforms
- Proven track record in cloud migration and cost optimization initiatives
- Exposure to multi-cluster Kubernetes management and automation platforms
- Experience in AI/ML platform infrastructure or data-intensive systems
- Leadership experience or ability to mentor junior engineers
Key Competencies
- Strong problem-solving and analytical skills
- Ability to work in fast-paced, high-availability environments
- Excellent communication and stakeholder management skills
- Proactive mindset with a focus on automation and continuous improvement