Israel Job Openings
Millennium Management
Site Reliability Engineer
Ramat Gan
August 28, 2024
Site Reliability Engineer
We are seeking an experienced Site Reliability Engineer (SRE) specialized in the Observability space to join our team. This role will be responsible for the design and implementation of observability solutions that ensure the reliable, performance, and scalable infrastructure. In addition, this role will involve reviewing our current observability stack, planning for future enhancements, implementing new solutions, and collaborating with developers to create actionable insights through effective dashboards and automated alerting systems. The ideal candidate will have a strong background in analytics and experience with advanced monitoring techniques to help us achieve metrics baselining, anomaly detection, and enhanced correlation and causation analysis.
Our Israel office is located in the Bursa area of Ramat Gan and offers a hybrid work model.
As a global firm, English proficiency is a must.
Responsibilities
Required Skills:
We are seeking an experienced Site Reliability Engineer (SRE) specialized in the Observability space to join our team. This role will be responsible for the design and implementation of observability solutions that ensure the reliable, performance, and scalable infrastructure. In addition, this role will involve reviewing our current observability stack, planning for future enhancements, implementing new solutions, and collaborating with developers to create actionable insights through effective dashboards and automated alerting systems. The ideal candidate will have a strong background in analytics and experience with advanced monitoring techniques to help us achieve metrics baselining, anomaly detection, and enhanced correlation and causation analysis.
Our Israel office is located in the Bursa area of Ramat Gan and offers a hybrid work model.
As a global firm, English proficiency is a must.
Responsibilities
- Conduct thorough reviews of our existing observability stack to identify areas for improvement and optimization
- Collaborate with the team to plan and design the next version of our observability infrastructure
- Assist in the implementation of the new observability stack, ensuring seamless integration and minimal disruption
- Create and maintain insightful and actionable dashboards that provide clear visibility into system performance without adding unnecessary noise
- Review existing alerts and work closely with developers to automate alert handlers for self-healing systems
- Utilize your experience in analytics to perform metrics baselining and anomaly detection, ensuring our systems are operating optimally
- Explore and integrate AI tools to enhance our correlation and causation analysis capabilities
- Develop and maintain necessary components such as metrics exporters and self-service tools
Required Skills:
- Demonstrated experience as a Site Reliability Engineer, Observability Engineer, or similar role in software development
- Hands-on experience with the Prometheus ecosystem
- Ability to design and develop code in Python or Go
- Acute drive to automate manual operations and processes
- Strong understanding of Linux operating systems
- Hands-on experience with configuration management tools such as Ansible, Salt Stack, or Terraform
- Experience in managing and scaling distributed systems
- Strong sense of ownership and integrity, demonstrated through clear communication and collaboration
- Excellent troubleshooting and problem-solving skills
- Ability to communicate complex concepts clearly with both stakeholders and developers
New Job Alerts
Teva Pharmaceuticals
Vice President Transformation, Work Force Planning and Analytics
Tel Aviv-Yafo
November 6, 2024
View Job DescriptionLooking for similar job?
Veeva Systems
Site Reliability Engineer
Tel Aviv-Yafo
FULL TIME
September 11, 2024
View Job DescriptionVeeva Systems
Principal Site Reliability Engineer
Tel Aviv-Yafo
FULL TIME
September 11, 2024
View Job DescriptionSiemens Energy
Site Maintenance Manager
Rosh Ha`Ayin
FULL TIME
September 17, 2024
View Job DescriptionSee What’s New: Millennium Management Job Opportunities
Millennium Management
Quantitative Researcher (Treasury)
Tel Aviv-Yafo
October 13, 2024
View Job DescriptionMillennium Management
Quantitative Developer - Python
Tel Aviv-Yafo
October 7, 2024
View Job DescriptionMillennium Management
Full Stack Developer (C#)
Tel Aviv-Yafo
September 26, 2024
View Job DescriptionNew Job Alerts
Teva Pharmaceuticals
Vice President Transformation, Work Force Planning and Analytics
Tel Aviv-Yafo
November 6, 2024
View Job Description