Lead Site Reliability Engineer
2 weeks ago
**What do we do?**
**Who are we?**
Having started in Germany in 1987, GFT Technologies has grown to become a trusted Software Engineering and Consulting specialist for the international financial industry, counting many of the world’s largest and best-known Banks as our clients. We are an organization that empowers you to not only explore but raise your potential and seek out opportunities that add value. At GFT, diversity, equality, and inclusion are at the core of who we are. Ensuring a diverse and inclusive working environment for all communities is one of the main pillars of our diversity strategy, based on our core values and culture. We have been certified for 2022/23 as a ‘Great place to work’ in the APAC region. So, if you want to have the opportunity to work with an outstanding and progressive organization this position could be right for you.
**Role Summary**
In this role, you will make significant decisions with a huge impact on building modern banking technology. You would be part of a team, responsible for designing & architecting new solutions, finding creative ways to optimize existing solutions which will improve agility for managing hundreds of microservices infrastructure in a stable & reliable way.
**Note**: This is an **Individual Contributor (IC) role** focused on hands-on technical expertise.
**Key Responsibilities**
- A strong believer of automating DevOps & SRE aspects like infrastructure provisioning, deployment, observability, incident lifecycle, uptime SLA etc.
- Bold to challenge, open to get challenged, curious to learn & grow
**The day-to-day activities**:
- Working with Kubernetes clusters hosted in AWS
- Using Infrastructure As Code (IaC) tooling like Terraform, and Ansible to manage AWS, Azure & Kubernetes resources
- Engage with the development teams throughout the life cycle to help develop software for reliability and scale.
- Coaching teams SRE best practices
- Troubleshoot priority incidents, facilitate blameless post-mortems and ensure permanent closure of incidents
- Perform analytics on previous incidents and usage patterns to better predict issues and take proactive actions
- Build and drive adoption for greater self-healing and resiliency patterns
- Design automated software and product upgrades, change management, and release management solutions
- Design, code, test and deliver software to automate manual operational work. Own your tools and services end to end.
- Performance and cost optimization for infrastructure
- Be part of on-call rotation for the team’s tooling and 24x7 support coverage as needed
- Succeed, fail, and learn together with other talented people. We believe in an environment that provides an opportunity for growth and sees education as an outcome of failure that gets us closer to the next breakthrough
**Required Skills**:
- Bachelor's degree in information systems, information technology, computer science, or similar.
- 8 - 11 years of professional experience in software engineering
- Experience with administering Kubernetes cluster
- Experience with managing Infrastructure as code using Terraform
- Direct production operations experience in a cloud environment
- Experience contributing to technology and product strategy
- Experience leading capability-building initiatives across diverse areas such as infrastructure and operations automation, observability, incident management, architecting HA systems, and other core engineering
- Demonstrated experience in driving operational efficiency and transparency of a growing engineering organization
**What can we offer you?**
- Competitive salary
- 13th-month salary guarantee
- Performance bonus
- Professional English course for employees
- Premium health insurance
**About Us**:
We show commitment to our investors and stand for solid, long-term growth performance. Founded in Germany in 1987 and in American territory since 2008, GFT expanded globally to over 10,000 experts. And to more than 15 markets to ensure proximity to clients. With new opportunities from Asia to Brazil, the international growth story continues. We are committed to grow tech talents worldwide. Because our team’s strong consulting and development skills across legacy and pioneering technologies, like GreenCoding, underpin success. We maintain a family atmosphere in an inclusive work environment.
**Why Choose GFT?**:
- Competitive Compensation
- Benefits package including comprehensive medical, dental, vision and others
- Company Culture based on our Core Values
- Professional Development Training with Individual Development Plans to map out your career growth
- Opportunity to work in a global environment with diverse teams built with colleagues from around the world
- Opportunity to work with technology industry leaders in the financial services industry
- Opportunity to work for big name clients in capital markets, banking and other industries
-
Hà Nội, Vietnam Amazon Corporate Services Vietnam Company Limited Full timeBachelor's degree or above in electrical engineering, material engineering, mechanical engineering or related fields. - 5+ years experience as an engineering lead or engineering project manager. - Good understanding of the principles and basic structures of measuring instruments such as HALT, chambers, oscilloscopes, multimeters etc. - Strong technical...
-
Hà Nội, Vietnam Amazon Full timeDESCRIPTION Amazon develops innovative consumer-centric product solutions. As a reliability program engineer you will be part of an exciting team developing, testing, and delivering new products. Your primary responsibility will be the development and implementation of methodologies/techniques to enhance product reliability. You will work closely internal...
-
Senior Officer, Site Reliability Engineering
7 days ago
Hà Nội, Vietnam Techcombank Full time5 May 2025 **Senior Officer, Site Reliability Engineering (40001670)**: - Category: Technology Division - Job Type: - Facility: Technology **Job Purpose**: **Key Accountabilities (1)**: - 'Participate in monitoring and handling system alerts/incidents/problems: - Ensure projects/specialized operations departments provide adequate warning/incident...
-
Hardware Reliability Engineer
2 weeks ago
Hà Nội, Vietnam Google Full time**Minimum qualifications**: - Bachelor's degree in Hardware Engineering or equivalent practical experience. - 4 years of experience in reliability engineering with consumer products. - Experience in building and leading testing (e.g., reliability, Quality Control (QC) etc.) and validation efforts to support hardware products. **Preferred...
-
Site Reliability Engineer
2 weeks ago
Hà Nội, Vietnam OpenCommerce Group Full time**Top 3 reasons to join us**: - Làm việc với các đội ngũ trẻ tài năng và máu lửa - Môi trường làm việc thoải mái, năng động - Sản phẩm của người Việt chinh phục toàn cầu **Job description**: - Manage and improve system reliability through SLO, SLI, and SLA practices. - Design and implement observability...
-
Site Reliability Engineer
3 days ago
Hà Nội, Vietnam UpBase Full time**UpBase***: Đồng hành cùng doanh nghiệp phát triển bền vững trên Thương mại điện tử - Company type - IT Service and IT Consulting - Company industry - IT Services and IT Consulting - Company size - 1-50 employees - Country - Vietnam - Working days - Monday - Friday - Overtime policy - No OT - At office - Skills: - DevOps System...
-
Site Technical Manager, Global Manufacturing
5 days ago
Hà Nội, Vietnam Amazon Corporate Services Vietnam Company Limited Full time* Bachelor degree or above in engineering, with 8+ years’ experience in Production or Manufacturing Engineering. - Deep knowledge of manufacturing processes, production concepts, and high-volume manufacturing - Desired experience of managing suppliers covering plastics, mechanical parts Metal Forming, and product final assembly testing and packaging. -...
-
Site Architect
2 weeks ago
Hà Nội, Vietnam CapitaLand Development (Vietnam) Full time**Mô tả công việc**: (Mức lương: Thỏa thuận) - Lead all functions rated to architectural matters at site and supervise team members in coordinating drawings and updating information on project. - Coordinate with other departments/relevant parties for updates and communications of contractor's shop drawings and construction drawings. - Monitor...
-
Site Technical Manager
5 days ago
Hà Nội, Vietnam Amazon Corporate Services Vietnam Company Limited - K62 Full timeBachelor of engineer degree. - 5+ years’ experience in manufacturing and process engineering of CE industrial. - Familiar with electronic circuit and/or hardware test. - Excellent analytical and problem solving skills, including experience in leading root cause and corrective action investigations. - Strong understanding of manufacturing and test...
-
Senior/lead Data Engineer
2 weeks ago
Hà Nội, Vietnam FPT Software Ha Noi Full timeKey Responsibilities: - In charge of the design, development, and optimization of scalable data pipelines and ETL processes. - Architect and manage data storage solutions, including databases and data warehouses. - Collaborate with cross-functional teams to understand data requirements and deliver robust data solutions. - Implement and enforce data quality...