Site Reliability Engineer
6 days ago
Location:
- Ho Chi Minh City- Job Type:
- Permanent- Discipline:
- Software Engineering- Salary:
- Negotiable- Contact:
- Chelsea Phan**Site Reliability Engineer**
**Ho Chi Minh City**
**About NextWave**
NextWave Partners is the Recruitment Partner of choice within the Clean Energy, Sustainable Infrastructure, ESG, Impact Investment, Climate-Tech & Technology sectors. We are committed to supporting industries battling climate change towards a net-zero future and a sustainable economy.
**About the role**
**Roles and responsibilities**
- Implement and uphold a monitoring and alerting system to detect performance bottlenecks, system failures, and other issues.
- Create and assess disaster recovery plans to ensure the uninterrupted operation of the business.
- Comprehend and oversee production code through Infrastructure as Code (IaC).
- React to production incidents, and coordinate collaborative efforts across various teams and third-party partners to resolve issues.
- Lead root cause analyses (RCAs) and post-mortem assessments after incident resolution.
- Independently acquire proficiency in new tools and technologies as dictated by project requirements.
- Document solutions and tooling, disseminate knowledge, and provide training as needed.
- Formulate operational guides (run books) and scripts for automated issue resolution.
**Requirements**:
- A Bachelor's Degree or equivalent practical experience.
- More than 5 years of IT experience with a focus on Enterprise Cloud infrastructure (GCP, AWS, or Azure), with at least 4 years in AWS, particularly in a mission-critical environment.
- Extensive expertise in cloud architecture with a focus on resilience and security.
- Proficiency in identifying potential system bottlenecks and proposing enhancements.
- Sound comprehension of microservices, event-driven architectures, and the adoption of DevSecOps practices for supporting complex distributed systems.
- Background in maintaining and supporting cloud-related infrastructures, with a preference for experience in managing ECS and AWS Services.
- Hands-on experience in conducting resilience, chaos, and stress testing within a cloud context.
- Comprehensive understanding of various concepts, technologies, and frameworks.
- Solid knowledge of Unix/Linux operating systems, as well as proficiency in Python, Bash, Shell scripting, and SQL.
- Familiarity with engineering practices, including test automation, CI/CD, and release automation.
- Experience in incident response planning and automation.
- A self-reliant problem solver with a strong work ethic, capable of adhering to established architectural constraints.
- Willingness to participate in 24/7 on-call rotations (overtime or off-in-lieu provided).
- Proficiency in both written and spoken English.
- Profound knowledge of operating systems (RHEL, Ubuntu, Windows Server) with excellent debugging, troubleshooting, and problem-solving skills.
- Expertise in one of the following programming languages: Python, Shell, Golang, or JavaScript, emphasizing Site Reliability Engineering and the support of cloud services.
- Practical experience with cloud-based technologies and tools, particularly in deployment, monitoring, and operations, such as New Relic, Zabbix, CloudWatch, Snyk+Fugue, Grafana, and Prometheus.
- Strong familiarity with modern development technologies and tools, including Agile, CI/CD, Git, Terraform, and CircleCI.
- A solid understanding of networking protocols and cybersecurity best practices in a cloud environment.
- AWS certification is highly desirable
**Application**
**Keep in touch**
If you would wish to keep up to date with the latest NextWave opportunities and industry updates, please follow us on LinkedIn and create your profile on our website to receive a weekly newsletter in your inbox
**Our commitment**
Diversity is a core value at NextWave Partners, and we are proud to be partnering with equal opportunities employers. All qualified applicants will receive consideration for employment without regard to race, colour, religion, gender, gender identity or expression, sexual orientation, national origin, disability or age.
EA Registration No: R2199999
NextWave Partners Ltd. (EA License No: 16S8303 - UEN: 201602833E)
-
Sales Engineer
4 days ago
Ho Chi Minh City Metropolitan Area, Vietnam VPOWER RELIABILITY Full time $40,000 - $60,000 per yearCompany DescriptionVPOWER RELIABILITY is dedicated to providing a one-stop solution for improving reliability and ensuring safety. Driven by a commitment to connecting passion, sharing knowledge, and enhancing capabilities, VPOWER RELIABILITY strives to minimize downtime and optimize operational efficiency. Our mission is to empower businesses with reliable...
-
Site Reliability Engineer
6 days ago
Ho Chi Minh City, Ho Chi Minh, Vietnam HRS Group Full time $50,000 - $120,000 per yearHrs As a CompanyHRS, a pioneer in business travel, aims to elevate every stay through innovative technology. With over 50 years of experience, their digital platform, driven by ProcureTech, TravelTech, and FinTech, transforms how companies and travelers Stay, Work, and Pay.ProcureTech digitally revolutionizes lodging procurement, connecting corporations and...
-
Site Reliability Engineer
4 days ago
Ho Chi Minh City, Vietnam Wizeline Full time**Site Reliability Engineer / DevOps**: Wizeline - Ứng Tuyển Cloud System Admin AWS - Đăng nhập để xem mức lương - 285 Cách Mạng Tháng 8, District 10, Ho Chi Minh- Xem bản đồ- Tại văn phòng- 4 giờ trước **3 Lý Do Để Gia Nhập Công Ty**: - Leading Technologies to Deliver Great Solutions - Enjoy Competitive &...
-
Senior Site Reliability Engineer
6 days ago
Ho Chi Minh City, Ho Chi Minh, Vietnam Zalopay Full time $40,000 - $120,000 per yearWe are seeking a Senior Site Reliability Engineer (SRE) with a strong DevOps mindset to drive automation, delivery excellence, and infrastructure scalability for our high-throughput payment platform. You will partner with engineering teams to streamline CI/CD pipelines, implement GitOps workflows, and build internal tools that improve developer productivity...
-
Site Reliability Engineer
6 days ago
Thành phố Hồ Chí Minh, Vietnam Pizza Hut Digital & Technology Full timePizza Hut Digital & Technology *** - Waseco Building - 10 Pho Quang Street, Ward 02, Tan Binh, Ho Chi Minh- Hybrid- Posted 11 minutes ago- Skills: - AWS English Azure **Top 3 reasons to join us**: - Flexible Friday afternoon - 18 Annual Leave + 5 Recharge Days/ Year - Hybrid working model **Job description**: **Role Overview** - As a site reliability...
-
Site Reliability Engineer
2 weeks ago
Ho Chi Minh City, Vietnam Tyme Full time**Site Reliability Engineer**: Tyme - Ứng Tuyển AWS Python DevOps - Đăng nhập để xem mức lương - HIU Tower, 215 Điện Biên Phủ, Phường 15, Binh Thanh, Ho Chi Minh- Xem bản đồ- Linh hoạt- 2 giờ trước **3 Lý Do Để Gia Nhập Công Ty**: - Excellent environment and team to help you grow. - Competitive salary and...
-
Sr. Site Reliability Engineer
2 weeks ago
Ho Chi Minh City, Ho Chi Minh, Vietnam HRS Group Full time $120,000 - $180,000 per yearHrs As a CompanyHRS, a pioneer in business travel, aims to elevate every stay through innovative technology. With over 50 years of experience, their digital platform, driven by ProcureTech, TravelTech, and FinTech, transforms how companies and travelers Stay, Work, and Pay.ProcureTech digitally revolutionizes lodging procurement, connecting corporations and...
-
Sr. Site Reliability Engineer
2 hours ago
Ho Chi Minh City, Ho Chi Minh, Vietnam HRS Group Full timeHRS AS A COMPANY HRS, a pioneer in business travel, aims to elevate every stay through innovative technology. With over 50 years of experience, their digital platform, driven by ProcureTech, TravelTech, and FinTech, transforms how companies and travelers Stay, Work, and Pay.ProcureTech digitally revolutionizes lodging procurement, connecting corporations and...
-
Site Reliability Engineer
6 days ago
District , Ho Chi Minh City, Vietnam Moatable Full time $90,000 - $120,000 per yearWe are seeking an experienced and highly skilledSite Reliability Engineerto join our dynamic team. The ideal candidate will have a strong background in AWS, Jenkins, GitLab CI, and Infrastructure as Code (IaC). As a Senior DevOps Engineer, you will play a critical role in enhancing our CI/CD pipelines, automating infrastructure, and ensuring the reliability...
-
Senior Site Reliability Engineer
2 days ago
Ho Chi Minh City, Ho Chi Minh, Vietnam VNG Full time $30,000 - $120,000 per yearWe are looking for aSenior Site Reliability Engineer (SRE)with deep expertise in deploying, operating, and optimizing database systems on Kubernetes (K8s). In this role, you will play a critical part in ensuring the data infrastructure is highly reliable, high-performance, scalable, and proactively monitored through modern observability systems.Key...