Principal Site Reliability Engineer
9 hours ago
Implementing SRE automation, developing automation across the stack, and optimizing operations hours by reducing manual operations.
- Eliminating toil by automation across all the layers - infrastructure provisioning, configuration management, deployment, testing, and operation on premise and public clouds (Google Cloud and AWS)
- Working on retooling our infrastructure to provide an agile, cloud based foundation that provides common infrastructure management and automation framework.
- Interfacing directly with senior staff members within the organization to discuss and assess compliance with IT policies, standards and procedures, suggest opportunities for improvement, and report on the status of specific. Work with development teams throughout the software life cycle ensuring sustainable software releases.
- Practicing sustainable incident response and blameless postmortems
- Help to build methodology to manage infrastructure and platform cost
- Train SRE junior members
- Manage small SRE team (4-6 members) to drive automation, scalability, high availability and performance of ZaloPay
**Yêu cầu**:
- Bachelor’s degree with five or more years of work experience.
- Six or more years of SRE relevant work experience.
- Experience in Systems Architecture, in-depth knowledge on SRE, IT Operations, Cloud, Coding and Scripting experience with Golang, Java, Python and automation tool: Terraform, Ansible,
- Strong experience with Google, AWS cloud environments, with working knowledge in standard cloud services, features and tool, with Certification in appropriate areas.
- Strong experience with automation provisioning dependency software on premises.
- Have experience building Disaster recovery solution is preferred
**Preferred**
- Five or more years of experience working on middle technologies like Kafka/ RabbitMQ, Springboot, REDIS, Elasticsearch MySQL, ETCD.
- Automation experience and ability to code or script at an advance level.
- Experience in Cloud & Container platform Strategies, Design, Architecture and Migration.
- Experience with designing and implementing CI/CD DevOps solutions using Jenkins pipelines using Python, Git, Shell, YAML, Kubernetes and Docker.
- Configuration Management experience with Chef, Puppet, Ansible or Python.
- Experience serving as both a mentor and advocate for your team.
- Experience performing analytics on previous incidents and usage patterns to better predict issues and take proactive actions.
-
Sales Engineer
2 weeks ago
Ho Chi Minh City Metropolitan Area, Vietnam VPOWER RELIABILITY Full time $40,000 - $60,000 per yearCompany DescriptionVPOWER RELIABILITY is dedicated to providing a one-stop solution for improving reliability and ensuring safety. Driven by a commitment to connecting passion, sharing knowledge, and enhancing capabilities, VPOWER RELIABILITY strives to minimize downtime and optimize operational efficiency. Our mission is to empower businesses with reliable...
-
Site Reliability Engineer
2 weeks ago
Ho Chi Minh City, Vietnam Wizeline Full time**Site Reliability Engineer / DevOps**: Wizeline - Ứng Tuyển Cloud System Admin AWS - Đăng nhập để xem mức lương - 285 Cách Mạng Tháng 8, District 10, Ho Chi Minh- Xem bản đồ- Tại văn phòng- 4 giờ trước **3 Lý Do Để Gia Nhập Công Ty**: - Leading Technologies to Deliver Great Solutions - Enjoy Competitive &...
-
Site Reliability Engineer
6 days ago
Ho Chi Minh City, Ho Chi Minh, Vietnam Techcombank (TCB) Full timeData Engineering Function - Data and Analytics DivisionAbout the RoleWe are seeking a highly skilledSite Reliability Engineerwith experience applying Generative AI(GenAI) to automate and enhance the reliability of complex data platforms. You will beresponsible for building self-healing infrastructure, AI-powered observability, and automatingincident response...
-
Site Reliability Engineer
2 weeks ago
Ho Chi Minh City, Vietnam NextWave Partners Full timeLocation: - Ho Chi Minh City- Job Type: - Permanent- Discipline: - Software Engineering- Salary: - Negotiable- Contact: - Chelsea Phan**Site Reliability Engineer** **Ho Chi Minh City** **About NextWave** NextWave Partners is the Recruitment Partner of choice within the Clean Energy, Sustainable Infrastructure, ESG, Impact Investment, Climate-Tech &...
-
Site Reliability Engineer
2 weeks ago
Thành phố Hồ Chí Minh, Vietnam Pizza Hut Digital & Technology Full timePizza Hut Digital & Technology *** - Waseco Building - 10 Pho Quang Street, Ward 02, Tan Binh, Ho Chi Minh- Hybrid- Posted 11 minutes ago- Skills: - AWS English Azure **Top 3 reasons to join us**: - Flexible Friday afternoon - 18 Annual Leave + 5 Recharge Days/ Year - Hybrid working model **Job description**: **Role Overview** - As a site reliability...
-
Site Reliability Engineer
1 week ago
Ho Chi Minh City, Vietnam Tyme Full time**Site Reliability Engineer**: Tyme - Ứng Tuyển Agile English DevOps - Đăng nhập để xem mức lương - HIU Tower, 215 Điện Biên Phủ, Phường 15, Binh Thanh, Ho Chi Minh- Xem bản đồ- Linh hoạt- 2 giờ trước **3 Lý Do Để Gia Nhập Công Ty**: - Excellent environment and team to help you grow. - Competitive salary and...
-
Sr. Site Reliability Engineer
1 week ago
Ho Chi Minh City, Ho Chi Minh, Vietnam HRS Group Full timeHRS AS A COMPANY HRS, a pioneer in business travel, aims to elevate every stay through innovative technology. With over 50 years of experience, their digital platform, driven by ProcureTech, TravelTech, and FinTech, transforms how companies and travelers Stay, Work, and Pay.ProcureTech digitally revolutionizes lodging procurement, connecting corporations and...
-
Site Reliability Engineer Lead
4 days ago
Ho Chi Minh City, Vietnam Aperia Solutions Vietnam Co Ltd Full time**Site Reliability Engineer Lead (Linux)**: Aperia Solutions Vietnam Co Ltd - Ứng Tuyển Linux System Admin English - Đăng nhập để xem mức lương - 12 BIMI Tower, Song Thao Street, Ward 2, Tan Binh, Ho Chi Minh- Xem bản đồ- Tại văn phòng- 5 giờ trước **3 Lý Do Để Gia Nhập Công Ty**: - 4/5 work from home days - Subsidizing...
-
Senior Site Reliability Engineer
2 weeks ago
Ho Chi Minh City, Ho Chi Minh, Vietnam VNG Full time $30,000 - $120,000 per yearWe are looking for aSenior Site Reliability Engineer (SRE)with deep expertise in deploying, operating, and optimizing database systems on Kubernetes (K8s). In this role, you will play a critical part in ensuring the data infrastructure is highly reliable, high-performance, scalable, and proactively monitored through modern observability systems.Key...
-
Site Reliability Engineer
6 days ago
Ho Chi Minh City, Ho Chi Minh, Vietnam Amanotos Full timeThông tin tuyển dụngVị tríNhân viên - Hạn nộp30/04/2025 - Số lượng cần tuyển1 người - Giới tínhKhông yêu cầu - Kinh nghiệmKhông yêu cầu kinh nghiệm - Bằng cấpKhông yêu cầu - Nơi làm việcTPHCM - Lĩnh vựcIT/CNTT - IT Phần mềmMô tả công việcJob descriptionCollaborate with development teams...