Senior Software Engineering Lead, Resilience and Chaos Engineering
GOOGLE ASIA PACIFIC PTE. LTD. Pasir Ris Full-time
Role
In this role, you will establish and lead an engineering team dedicated to the stability and endurance of our robotics software platform. You will design systems that proactively identify vulnerabilities within our APIs, SDKs, web interfaces, and cloud-to-edge communication layers.By simulating scenarios such as AI model inference timeouts, high network latency, data pipeline congestion, and malformed input, you will ensure the platform maintains a safe and predictable state even when the environment is not. You will guide a specialized team in developing automated frameworks that replicate real-world disruptions, thus providing a dependable infrastructure for the developers building the next generation of AI-driven robotics.
This responsibility includes developing the necessary monitoring tools to gain deep insights into overall system health. As a key technical leader, you will collaborate with world-class engineering teams in Mountain View and Munich to synchronize resilience strategies and set global standards for software reliability.
How your work moves the mission forward- Create automated resilience tests focusing on service boundaries and hybrid environments (on-prem and Cloud).
- Bolster the robustness of AI integrations by implementing failure injection within data pipelines.
- Deploy fuzzing and property-based testing techniques platform-wide to guarantee graceful degradation.
- Enhance the stability of developer tools and frontend systems against latency and service interruptions.
- Cultivate a culture of reliability through engineer mentorship in defensive programming and by spearheading global "Game Day" exercises.
- Construct observability tools to monitor and analyze holistic system health.
- 4-year degree in Computer Science or equivalent professional experience.
- At least 5 years experience in software engineering.
- Demonstrable experience with cloud computing.
- Proven ability to lead a team, providing architectural guidance and fostering professional growth for other engineers in a global setting.
- Experience with one of Go, Python, or C++.
- Strong communication skills.
- Distributed Systems Architecture: Strong experience building and debugging hybrid software environments where local runtimes interact with cloud-hosted services.
- Reliability Engineering Patterns: Deep understanding of software patterns for resilience, such as circuit breaking, retries with backoff, and bulkhead isolation.
- Automated Testing Proficiency: Expertise in creating frameworks for fault injection, property-based testing or coverage-guided fuzzing.
- Full-Stack Technical Knowledge: Competency in systems-level languages and an understanding of how to build resilient frontend interfaces.
- Infrastructure & Tooling: Hands-on experience with container orchestration (Kubernetes/Docker) and CI/CD pipelines to automate failure simulations.
- Robotics Software Experience: Hands-on experience with robotics frameworks or complex hardware-interfacing software.
RUCKUS NETWORKS SINGAPORE PTE. LTD.Pasir Ris
In our'always on' world, we believe it's essential to have a genuine connection withthe work you do.
Weare looking for a Senior Software Engineer to join our growing team inSingapore. You will work with adynamic and focused team to develop state...
Cargo Community Network Pte LtdQueenstown, 20 km from Pasir Ris
Role Overview
We are seeking a highly motivated Software Engineer / Senior Software Engineer to join our application development and support team. The role involves developing, enhancing, and supporting enterprise applications, while working...
UPPER SPRING CONSULTING PTE. LTD.Geylang, 9 km from Pasir Ris
We are representing our client in the global technology and engineering space to look for an experienced Senior/Software Engineer to join them. The candidate is responsible for designing, developing, and implementing software solutions to meet...