Please scroll down, To apply

Performance Manager (Site Reliability Engineering Manager) with Security Clearance

hiring now
New job

Insight Xcite LLC 215000.00 US Dollar . USD Per annum

2024-09-22 06:35:11

Job location Herndon, Virginia, United States

Job type: fulltime

Job industry: I.T. & Communications

Job description

Pay
$175,000.00 - $215,000.00 per year
Job Type
Full-time On-Site Benefits
Health/Dental/Vision insurance - Employer Covered
Paid time off/Flex Time (unlimited PTO)
401(k) 5% salary - no matching requirement, immediately vested
Flexible schedule
Referral program
Professional development assistance - Employer Covered
Bonus/Straight Pay
Long/Short Term Disability Insurance - Employer Covered
Life Insurance - Employer Covered Job Title: Performance Manager Location: Herndon, VA Company Overview: Insight Xcite is a government contractor who specializes in delivering data driven solutions to senior leaders and executives for informed decisions making. We are committed to delivering high-impact solutions that enable our customers to reach new levels of achievement and expectations. As a Performance Manager, you will play a critical role in ensuring the availability, scalability, and efficiency of our infrastructure. Responsibilities: Performance Measurement Framework:
Develop and implement a comprehensive performance measurement framework.
Define service performance indicators (SLIs), service level objectives (SLOs), and service level agreements (SLAs) related to system availability, response time, and reliability.
Collaborate with cross-functional teams to collect relevant data and establish baseline metrics. Monitoring and Analysis:
Monitor system performance using tools such as Prometheus, Grafana, and New Relic.
Analyze performance data to identify bottlenecks, inefficiencies, and areas for improvement.
Work closely with SREs to ensure alignment with site reliability engineering principles. Capacity Planning:
Forecast system capacity requirements based on historical data and growth projections.
Optimize resource allocation to meet performance goals.
Provide recommendations for scaling infrastructure as needed. Incident Response and Root Cause Analysis:
Participate in incident response activities during system outages or performance degradation.
Conduct thorough root cause analysis to prevent recurrence.
Collaborate with SREs to implement preventive measures. Documentation and Reporting:
Document performance-related processes, procedures, and best practices.
Generate regular performance reports for stakeholders.
Communicate performance insights to technical and non-technical audiences. Requirements:
Proven experience as a Performance Manager or similar role.
Strong understanding of site reliability engineering (SRE) principles.
Familiarity with monitoring tools and performance analysis techniques.
Excellent communication and collaboration skills. Join our team and contribute to building a robust performance measurement framework that ensures the reliability and availability of our critical systems. If you're passionate about performance optimization and enjoy working at the intersection of technology and operations, we'd love to hear from you.

Inform a friend!

<!– job description page –>
Top