Officer, Site Reliability Engineering
4 hours ago
Job Purpose
'Responsible for daily monitoring of IT infrastructure/applications/services for critical services (T24, ROC, COC, CARD, etc.), ensuring these critical services meet the committed SLAs with the business. Additionally, participate in handling alerts and incidents to restore services as quickly as possible and address any outstanding issues to ensure the best service delivery for customers.
Key Accountabilities (1)
'Participate In Monitoring And Handling System Alerts/incidents/problems
- Perform 24/7 monitoring and handle alerts of services of the entire IT infrastructure/application/services. In case encounter difficulties, escalate to L3 for coordinated processing.
- Ensure projects/specialized operations departments provide adequate alerts/incident handling instructions for new services before going live and periodically review and update existing warning/incident handling instructions.
- Perform periodic reviews of issues/vulnerabilities in IT infrastructure/applications/services within the scope of responsibility
- Participate in standardizing and developing relevant processes and regulations to ensure effective monitoring and handling of alerts/incidents.
- Coordinate with relevant units to promptly restore services/systems, investigate root causes, propose solutions and implement solutions.
- Participate in implementing changes across the software development environment, including on Prem and cloud.
Participate In Building And Optimizing Centralized Monitoring Tools
- Implement the development and promulgation of standards and operate centralized monitoring tools (Dynatrace, Grafana, Splunk...)
- Implement monitoring tool integration and support building monitoring dashboards for new IT infrastructure/applications/services
- Ensure projects/specialized operations departments provide adequate monitoring indicators/monitoring thresholds for new services before golive.
Key Accountabilities (2)
'System Problem And Incident Management
- Manage the lifecycle of IT incidents, including identifying, classifying, coordinating and resolving incidents according to SLAs
- Be the contact point during troubleshooting, ensuring effective communication between technical, operations and sales departments
- Root cause analysis (RCA) after each incident, recommending preventive measures and process improvements. Coordinate with relevant teams to minimize downtime and improve system availability.
- Participate in developing and maintaining incident management processes according to standards and best practices
Key Accountabilities (3)
'Responsibilities In Risk Management And Compliance
- Support control and ensure the unit's activities comply with issued policies, regulations, procedures and instructions.
- Identify the unit's risks during operations, coordinate with relevant units to develop methods to measure, evaluate and minimize risks.
Report periodically to management levels and perform other tasks as directed by management
Key Relationships - Direct Manager
'Director / Senior Manager / Manager of ITSE
Key Relationships - Direct Reports
N/A
Key Relationships - Internal Stakeholders
'Departments in IT and business
Key Relationships - External Stakeholders
'Partners providing professional services
Success Profile - Qualification and Experiences
'Qualifications
- Bachelor's degree or higher in Finance, Economics, Banking, Business Administration, or Computer Science.
Working Experience
- At least 3 year of relevant work experience.
- Minimum of 3 year in IT development and operations at a large enterprise, especially in banking
- International certification in Systems.
-
Senior Site Reliability Engineer
4 hours ago
Hanoi, Hanoi, Vietnam Optimizely Full timeAt Optimizely, we're on a mission to help people unlock their digital potential. We do that by reinventing how marketing and product teams work to create and optimize digital experiences across all channels. With Optimizely One, our industry-first operating system for marketers, we offer teams flexibility and choice to build their stack their way with our...
-
Officer, Site Reliability Engineering
4 hours ago
Hanoi, Hanoi, Vietnam Techcombank Full time8 Dec 2025Officer, Site Reliability Engineering Category: Technology DivisionJob Type:Facility: TechnologyJob Purpose'Responsible for daily monitoring of IT infrastructure/applications/services for critical services (T24, ROC, COC, CARD, etc.), ensuring these critical services meet the committed SLAs with the business. Additionally, participate in handling...
-
Site Reliability Engineer
6 days ago
Hanoi, Hanoi, Vietnam GFT Technologies APAC & GCC Full timeJob description:Role ResponsibilitiesBuild and maintain our cloud platforms and support applications, demonstrating agile and dynamic application support capabilities.Contribute in our continuous improvement and continuous delivery while increasing maturity of SRE practices.Contribute in developing and implementing automated DevOps capability for our...
-
Hardware Reliability Engineer
2 weeks ago
Hanoi, Hanoi, Vietnam Google Full time ₫50,000,000 - ₫100,000,000 per yearMinimum qualifications:Bachelor's degree or above.At least 2 years of experience in a Hardware Reliability Engineer (Electronic Product Manufacturing) position or equivalent experience.Experience in applying Design for Reliability techniques.Manufacturing or operations experience in Original Equipment Manufacturing (OEM) management and Supply Chain...
-
Hardware Reliability Engineer
2 weeks ago
Hanoi, Hanoi, Vietnam Google Full time ₫900,000 - ₫1,200,000 per yearMinimum qualifications:Bachelor's degree or above.At least 2 years of experience in a Hardware Reliability Engineer (Electronic Product Manufacturing) position or equivalent experience.Experience in applying Design for Reliability techniques.Manufacturing or operations experience in Original Equipment Manufacturing (OEM) management and Supply Chain...
-
Site Reliability Engineer
1 week ago
Hanoi, Hanoi, Vietnam OPENCOMMERCE GROUP Full time ₫12,000,000 - ₫15,000,000 per yearTop 3 reasons to join usLàm việc với các đội ngũ trẻ tài năng và máu lửaMôi trường làm việc thoải mái, năng độngSản phẩm của người Việt chinh phục toàn cầuJob descriptionManage and improve system reliability through SLO, SLI, and SLA practices.Design and implement observability systems (metrics, logs, tracing,...
-
Medical Engineer
2 weeks ago
Hanoi, Hanoi, Vietnam Công ty Cổ phần Thương mại Cổng Vàng Full time ₫1,500,000 - ₫4,500,000 per yearCompany DescriptionGoldenGate JSC is a leading distributor in the fields of Obstetrics, Gynecology, Infection Control, and Cardiology, along with other medical specialties. With offices in Hanoi, Ho Chi Minh City, and Da Nang, the company has established a strong nationwide distribution network.We provide advanced medical products and solutions to a wide...
-
Network Reliability Engineer
6 days ago
Hanoi, Hanoi, Vietnam Apple Full time $50,000 - $150,000 per yearImagine what you could do here. At Apple, new ideas have a way of becoming phenomenal products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish.Manufacturing Network Team is a unique security engineering group within Apple IS&T. We provide critical network services for...
-
Site Safety Supervisor
2 weeks ago
Hanoi, Hanoi, Vietnam Turner & Townsend Pty Limited Full time $60,000 - $80,000 per yearCompany DescriptionTurner & Townsend is a global professional services company with over 22,000 people in more than 60 countries.Working with our clients across real estate, infrastructure, energy and natural resources, we transform together delivering outcomes that improve people's lives. Working in partnership makes it possible to deliver the world's most...
-
Site Safety Supervisor
2 weeks ago
Hanoi, Hanoi, Vietnam Turner & Townsend Full time ₫8,000,000 - ₫11,200,000 per yearCompany Description Turner & Townsend is a global professional services company with over 22,000 people in more than 60 countries.Working with our clients across real estate, infrastructure, energy and natural resources, we transform together delivering outcomes that improve people's lives. Working in partnership makes it possible to deliver the world's most...