What are the responsibilities and job description for the Senior SRE Engineer position at ApTask?
Title: Senior SRE Engineer
Job Description:
- We are seeking a highly skilled and experienced Senior Site Reliability Engineering (SRE) Engineer to lead our SRE team in ensuring the reliability, scalability, and performance of our production systems.
- The ideal candidate will have a strong background in cloud infrastructure, automation, and system monitoring, with excellent leadership and communication skills to collaborate across teams and foster a culture of operational excellence
Responsibilities:
- Design and develop enterprise-grade APIs and configuration solutions.
- Contribute to enterprise and application architecture design.
- Lead observability initiatives including monitoring, alerting, and incident response.
- Build and maintain dashboards and alerting systems using Grafana, Prometheus, Splunk, etc.
- Create and maintain detailed runbooks for operational procedures and incident handling.
- Define and monitor SLAs, SLOs, and KPIs for critical services.
- Collaborate with architecture, development, and security teams to ensure system reliability.
- Evaluate and adopt new technologies to improve system performance and maintainability.
Requirements:
- Strong background in IT infrastructure, cloud platforms (AWS, Azure, GCP), and SRE practices.
- Experience in enterprise and application architecture.
- Proven experience in building APIs and backend services.
- Hands-on experience with tools:
- Monitoring & Observability: Grafana, Prometheus, Splunk
- ITSM & Operations: ServiceNow, OpsRamp
- Project & Incident Tracking: JIRA
- Experience in building alerts, dashboards, and operational runbooks.
- Experience managing distributed systems and large-scale production environments.
- Strong leadership, communication, and problem-solving skills.
- Ability to quickly learn and adapt to new technologies and environments.
Preferred, but not required:
- Exposure to OpenShift and Azure cloud platforms.
- Certifications: SRE Foundation, ITIL, or relevant cloud certifications.
Salary : $75 - $80
Senior Platform Engineer (DevOps/SRE/Infra)
coast -
New York, NY