Platform Monitoring Engineer
We are seeking a Platform Monitoring Engineer to ensure the optimal performance, reliability, and observability of our infrastructure and applications. The ideal candidate will have strong expertise in monitoring tools and database performance, with a proactive approach to identifying and resolving issues. Experience with Kubernetes and hands-on DevOps tasks is a plus.
Responsibilities:
- Design, implement, and maintain platform monitoring systems using Grafana, Loki, Mimir, and Tempo.
- Monitor and analyze Azure SQL database performance, optimize queries, and identify bottlenecks.
- Develop dashboards, alerts, and reports to ensure real-time visibility into system health and performance.
- Collaborate with development and infrastructure teams to enhance observability and incident response processes.
- Investigate and troubleshoot platform and application performance issues.
- Continuously improve monitoring strategies to ensure scalability and reliability.
- Support and monitor Kubernetes clusters and related workloads, ensuring smooth operation and performance.
Skills and Qualifications:
- Strong experience with Grafana, Loki, Mimir, Tempo, and Azure SQL.
- Solid understanding of performance monitoring and query optimization.
- Familiarity with Kubernetes is a plus.
- Hands-on experience in DevOps tasks such as workstation setup, environment configuration, and network support.
- Excellent problem-solving and collaboration skills.
- Ability to work proactively to ensure platform stability and observability.
Workplace: Suthep, Chiang Mai
Salary: Depends on skills and experience.