Systems Engineer - Observability & Infra Automation (Central Infrasecurity)

apartmentSynapxe placeQueenstown scheduleFull-time calendar_month 

Company description:

Synapxe is the national HealthTech agency inspiring tomorrow's health. The nexus of HealthTech, we connect people and systems to power a healthier Singapore.

Together with partners, we create intelligent technological solutions to improve the health of millions of people every day, everywhere. Reimagine the future of health together with us at www.synapxe.sg

Job description:

Position Overview

This role sits within the Observability Platform Engineering team, focusing on managing and scaling an Elastic SaaS-based observability platform to enhance infrastructure resiliency. The engineer designs, deploys, and optimizes telemetry pipelines for logs, metrics, and traces using tools like Elastic Stack, Logstash, and supporting infrastructure.

Responsibilities emphasize platform availability (99.99% SLO), security, performance, and self-service capabilities for operations teams.

Candidates should have a strong interest in automation and a proactive attitude towards embracing new technologies and challenges.

This is a 2 year direct contract role.

Role & Responsibilities
  • Manage Elastic SaaS availability, security (RBAC, ILM policies), performance tuning, autoscaling, upgrades, DR testing, patching, and cost optimization via tiered storage
  • Operate Logstash servers, Elastic Agents/policies, and supporting infrastructure such as Nutanix hypervisors and F5 load balancers
  • Maintain observability pipelines for data ingestion, parsing, normalization, dashboards, alerting rules, and Kibana visualizations
  • Integrate with automation platforms (e.g., Ansible), ServiceNow for incident workflows, and AIOps for event correlation and self-healing remediation
  • Troubleshoot ingestion issues, conduct root cause analysis, and collaborate on resiliency metrics and SLOs
  • Collaborate with development and operations teams to instrument applications and infrastructure for better visibility
  • Document processes, configurations, and best practices to ensure knowledge sharing and continuity
Requirements
  • Degree in Computer Science, Information Technology, or a related field
  • Min 5-7 years equivalent practical experience in IT
  • Hands-on experience with modern observability tools and frameworks is preferred but open to candidates with prior experience in monitoring solutions who are looking to transition into modern observability practices (I.e. ELK)
  • Have some Infra/ incident monitoring / data intelligence (e.g. AI, machine learning) background before
  • Experience with cloud-native environments, Nutanix, F5, and integrations for enterprise observability
  • Candidate should have a strong foundation in monitoring and troubleshooting
  • Proficiency in Linux/Unix, scripting (Ruby, Shell, Painless)
  • Strong problem-solving skills and a proactive attitude towards learning new technologies
  • Excellent communication and teamwork skills, with the ability to work effectively in a collaborative environment

Apply Now

NOTE: It only takes a few minutes to apply for a meaningful career in HealthTech - GO FOR IT!!

#LI-SYNX08

starFeatured

Systems Engineer

apartmentCLOUDFLARE, PTE. LTD.placeBukit Merah, 4 km from Queenstown
About the Role Cloudflare is looking for an experienced Systems Engineer to join our Singapore team. In this mid-to-senior level role, you will be responsible for designing, building, and scaling the core infrastructure that powers Cloudflare’s...
apartmentANTlabsplaceKallang, 9 km from Queenstown
We are seeking a proactive and skilled Systems Engineer to join our Service Delivery team. You will be responsible for the deployment of Linux-based applications, systems and related infrastructure for our projects and customers. This role requires...
electric_boltImmediate start

Senior Systems Engineer

apartmentRSM STONE FOREST IT PTE. LTD.placeBukit Merah, 4 km from Queenstown
Job Description:  •  Ensure day-to-day IT system performance, security, stability, and reliability.  •  Provide support to engineers in resolving endpoint and system-related issues.  •  Manage and resolve reported incidents within agreed timelines...