Site Reliability Engineer (SRE)
Esker is a worldwide leader in AI-driven process automation software, helping financial and customer service departments digitally transform their source-to-pay (S2P) and order-to-cash (O2C) cycle. Companies use our cloud-based solutions to drive greater efficiency, accuracy, visibility and cost savings throughout their S2P and O2C processes.
POSITION : Site Reliability Engineer (SRE)
We’re seeking a Site Reliability Engineer to grow our SRE team in Malaysia, part of a global SRE organization of 35+ professionals responsible for the reliability, scalability, and performance of our multi-tenant SaaS platform.
In this role, you’ll work hands-on with large-scale production systems, collaborating with experienced engineers while helping drive automation, observability, and reliability improvements across the platform. This is an exciting opportunity for someone moving from systems administration into SRE, or for an early-career SRE looking to deepen their expertise across a broad and modern technology stack—all while supporting a platform used by millions of users worldwide.
Responsibilities
As a member of the operations team for the Asia region, you will take part in first- and second-level incident handling on the Esker On Demand SaaS platform. Your role includes qualifying incidents affecting customers in Asian time zones, resolving them when possible, or escalating to teams in France or the United States.
Your main focus will be platform operations:
Ensure the operational stability of the Esker On Demand SaaS platform, with a target of 24/7 availability
Monitor production systems (applications, databases, servers, etc.)
Maintain the security of all platform services
You will also contribute to ongoing platform improvements:
Design and implement infrastructure to support growth
Improve and automate administration and monitoring tools
Work closely with Ops teams in France and the US, as well as Support and Development teams
Environment
You'll work in a hybrid cloud environment with infrastructure spanning Azure and on-premises data centers. The SRE role at Esker is versatile and may involve working with all technologies of our stack:
Azure, Hyper-V, Windows Server, Red Hat Linux, Fortinet, F5 BigIP
PostgreSQL, Elasticsearch, Redis, Prometheus, Kibana
Ansible, Terraform, Kubernetes, Azure DevOps, PowerShell
AI-based tools for decision support and development assistance
You will work in an international environment with daily interactions in English. We do not expect you to know all these technologies. We are looking for someone curious, with solid fundamentals in systems and networking, and an interest in automation and Infrastructure as Code.
Requirement/Qualification
Bachelor's degree in Computer Science or related field preferred (equivalent experience considered)
Experience operating and administering Windows and Linux servers
Interest in improving infrastructure through automation and scripting
Open-minded, pragmatic, and willing to learn
Customer-oriented, with attention to service quality
Proficiency in English and Bahasa Malaysia is mandatory. Additional language skills such as Chinese, Cantonese or French are advantages.
- Division
- Esker Asia
- Locations
- Kuala Lumpur, Malaysia
- Remote status
- Hybrid
- Employment type
- Full-time