Site Reliability Engineer
4 days ago
Hồ Chí Minh
Full-time
A Backend Reliability Engineer (BRE) in Zalo is a crucial role responsible for ensuring the constant availability, optimal performance, and robust scalability of ZA's in-house backend systems. This position blends the skills of a traditional backend administrator with the principles of software engineering and site reliability engineering (SRE). DREs are proactive problem-solvers who leverage automation, deep technical expertise, and a collaborative mindset to build and maintain resilient and efficient data infrastructure
**What you will do**:
- System Reliability and Availability: Design, build, and maintain highly available and fault-tolerant backend systems. Develop and implement strategies for disaster recovery, backup, and restore processes to minimize downtime and data loss;
- Performance and Scalability: Proactively monitor backend performance, identifying and resolving bottlenecks. Optimize queries, tune backend configurations, and plan for future capacity needs to ensure the system can handle growing data volumes and user loads;
- Automation and Tooling: Develop and implement automation for routine backend tasks, such as provisioning, configuration management, and patching. Build and maintain tools to improve the observability and manageability of the backend environment;
- Incident Response and Troubleshooting: Serve as a primary point of contact for backend-related incidents. Troubleshoot and resolve complex production issues, conducting root cause analysis to prevent recurrence. Participate in on-call rotations;
- Collaboration and Consultation: Work closely with software development teams to advise on backend design, schema changes, and query optimization. Collaborate with infrastructure and SRE teams to ensure the backend environment aligns with overall system architecture and reliability goals;
- Security and Compliance: Implement and maintain security best practices for backends, including access control, encryption, and auditing. Ensure compliance with relevant data protection regulations;
- Documentation and Knowledge Sharing: Create and maintain comprehensive documentation for backend architecture, processes, and procedures. Share knowledge and best practices with other engineering teams;
**What you will need**:
- Proven experience in a backend administration, backend engineering, or a similar role. Experience with SRE principles is highly desirable;
- Experience with NoSQL databases like MongoDB, Cassandra, Redis, Scylla, etc
- Proficiency in programing languages such as C++, Python, Java, etc
- Strong understanding of cloud platforms (AWS, Google Cloud, Azure) and their database services (e.g., RDS, Aurora, Cloud SQL);
- Experience with infrastructure-as-code tools like Terraform or Ansible;
- Knowledge of monitoring and observability tools (e.g., Prometheus, Grafana, Datadog);
- Familiarity with containerization and orchestration technologies (Docker, Kubernetes).
-
Site Reliability Engineer
2 weeks ago
Thành phố Hồ Chí Minh, Vietnam HRS Full time**City**:Ho Chi Minh **Job Function**:Tech **Job Area**:Product & IT **Seniority Level**:Mid-Senior level **Date**:Apr 23, 2025 **HRS AS A COMPANY** - HRS, a pioneer in business travel, aims to elevate every stay through innovative technology. With over 50 years of experience, their digital platform, driven by ProcureTech, TravelTech, and FinTech,...
-
Lead Site Reliability Engineer
1 week ago
Thành phố Hồ Chí Minh, Vietnam GFT Technologies SE Full time**Role Summary** We are seeking a highly skilled and motivated Lead Site Reliability Engineer (SRE) with strong AWS expertise to lead our Service Operations team. You will be responsible for driving SRE practices, ensuring the scalability, reliability, and performance of mission-critical systems for our digital banking clients. This role requires balancing...
-
Site Reliability Engineer
2 weeks ago
Ho Chi Minh City, Vietnam Wizeline Full time**Site Reliability Engineer / DevOps**: Wizeline - Ứng Tuyển Cloud System Admin AWS - Đăng nhập để xem mức lương - 285 Cách Mạng Tháng 8, District 10, Ho Chi Minh- Xem bản đồ- Tại văn phòng- 4 giờ trước **3 Lý Do Để Gia Nhập Công Ty**: - Leading Technologies to Deliver Great Solutions - Enjoy Competitive &...
-
Site Reliability Engineer
1 week ago
Ho Chi Minh City, Ho Chi Minh, Vietnam Techcombank (TCB) Full timeData Engineering Function - Data and Analytics DivisionAbout the RoleWe are seeking a highly skilledSite Reliability Engineerwith experience applying Generative AI(GenAI) to automate and enhance the reliability of complex data platforms. You will beresponsible for building self-healing infrastructure, AI-powered observability, and automatingincident response...
-
Site Reliability Engineer
4 days ago
Thành phố Hồ Chí Minh, Vietnam Ninja Van Full timeNinja Van is a late-stage logtech startup that is disrupting a massive industry with innovation and cutting edge technology. Launched 2014 in Singapore, we have grown rapidly to become one of Southeast Asia's largest and fastest-growing express logistics companies. Since our inception, we’ve delivered to 100 million different customers across the region...
-
Senior) Site Reliability Engineer
1 week ago
Thành phố Hồ Chí Minh, Vietnam GFT Technologies SE Full time**Role Summary** We are seeking an experienced and passionate (Senior) Site Reliability Engineer for the Service Operations team as we continue to grow our Operations-as-Service for our prime Digital banking client. This role comes with the opportunity to expand across other Digital Banking clients within our growing Vietnam delivery portfolio in the future...
-
Site Reliability Engineer
11 hours ago
Ho Chi Minh City, Vietnam Global Fashion Group Full time**Site Reliability Engineer**: Global Fashion Group - Ứng Tuyển DevOps AWS Golang - Đăng nhập để xem mức lương - Copac Square Building, 12 Tôn Đản, phường 13, District 4, Ho Chi Minh- Xem bản đồ- Linh hoạt- 7 giờ trước **3 Lý Do Để Gia Nhập Công Ty**: - International working environment - Highly scalable platforms...
-
Site Reliability Engineer
1 week ago
Ho Chi Minh City, Vietnam Tyme Full time**Site Reliability Engineer**: Tyme - Ứng Tuyển Agile English DevOps - Đăng nhập để xem mức lương - HIU Tower, 215 Điện Biên Phủ, Phường 15, Binh Thanh, Ho Chi Minh- Xem bản đồ- Linh hoạt- 2 giờ trước **3 Lý Do Để Gia Nhập Công Ty**: - Excellent environment and team to help you grow. - Competitive salary and...
-
Sr. Site Reliability Engineer
1 week ago
Ho Chi Minh City, Ho Chi Minh, Vietnam HRS Group Full timeHRS AS A COMPANY HRS, a pioneer in business travel, aims to elevate every stay through innovative technology. With over 50 years of experience, their digital platform, driven by ProcureTech, TravelTech, and FinTech, transforms how companies and travelers Stay, Work, and Pay.ProcureTech digitally revolutionizes lodging procurement, connecting corporations and...
-
Senior Site Reliability Engineer
21 hours ago
Ho Chi Minh City, Ho Chi Minh, Vietnam HRS Group Full timeHRS AS A COMPANY HRS, a pioneer in business travel, aims to elevate every stay through innovative technology. With over 50 years of experience, their digital platform, driven by ProcureTech, TravelTech, and FinTech, transforms how companies and travelers Stay, Work, and Pay.ProcureTech digitally revolutionizes lodging procurement, connecting corporations and...