Site Reliability Engineer with Security Clearance
Karsun Solutions Llc
2024-11-07 10:45:41
salary: 155000.00 US Dollar . USD Annual
Herndon, Virginia, United States
Job type: fulltime
Job industry: I.T. & Communications
Job description
Find Your Next at Karsun Solutions and transform your career with the company transforming possible for the US Government. At Karsun, collaboration drives our community. We're committed to building an environment where team members from diverse backgrounds can innovate, learn and grow with us. Here at Karsun, the only limit to your potential is the limit of your curiosity. And because we know well-being empowers us to thrive, we offer robust and comprehensive benefits including: Health, Life & Disability Insurance - Medical, Dental, Life and Disability coverage is paid for by Karsun for full time employees.
Paid Parental Leave
401k Retirement Plan - with pre-tax and post-tax ROTH contribution offerings and immediate vesting with a per pay period match
Generous time off programs including 11 paid holidays per year
Supplemental plans such as Vision, Pet Insurance and 529 Savings Plan
Employee Assistance Program with behavioral health, physical wellness and financial advice
Employee Discounts & Perks
In-house Technical/Skills Training
Join Team Karsun and Find Your Next. Karsun Solutions is an Equal Employment Opportunity (EEO) employer. It is the policy of the Company to provide equal employment opportunities to all qualified applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, age, protected veteran or disabled status, or genetic information. Karsun does not accept unsolicited resumes through or from search firms or staffing agencies. All unsolicited resumes will be considered the property of Karsun and Karsun will not be obligated to pay a placement fee. As a Site Reliability Engineer, you will help build out and run production environments, automate operations and maintain and support infrastructure. Drive and establish Service level objectives (SLOs) and metrics to meet reliability expectations of multiple applications Deploy and manage applications into Kubernetes container platforms such as AWS EKS, or OpenShift
Monitor systems and applications, proactively identifying and resolving any performance bottlenecks or availability issues.
Develop and maintain monitoring tools, alerts, and dashboards to provide visibility into system health and performance.
Implement and support integrated CI/CD pipelines for on-premises and/or cloud assets using tools such as Jenkins, GitHub/Bitbucket, Nexus/Artifactory
Conduct post-incident analyses to identify root causes and implement preventive measures to avoid future incidents
Implement, deploy and maintain infrastructure as code (IaC) for provisioning infrastructure using AWS CloudFormation or Terraform
Maintain, monitor, and improve application configurations using tools such as Ansible, Packer, Puppet, or Chef Designs, deploys, monitors, and manage cloud solutions in public environments such as AWS or Azure, or private environments.
Design, build, and maintain automated monitoring and notification services to support fault tolerant and highly available systems and metrics using tools such as AWS CloudWatch, EFK, and Prometheus Required: Bachelor's degree in computer science, Engineering, or a related field and 8-10 years of relevant experience
5+ years of experience supporting operations and maintenance for cloud-native applications in production that are fault-tolerant, self-healing, scalable and high available,
Deep understanding of cloud computing platforms (e.g., AWS, Azure, GCP) and containerization technologies (e.g., Kubernetes).
Experience with monitoring, logging, and observability tools like DataDog, AWS Cloudwatch, ELK, Prometheus, Splunk etc. Knowledge of infrastructure as code tools (e.g., Terraform, Ansible, ArgoCD) and CI/CD pipelines.
Experience deploying enterprise software within AWS Services such as EKS, RDS, EC2, Elastic Load Balancers, Lambda, DynamoDB, multi regions, and API Gateway
Strong problem-solving and analytical skills, with a keen attention to detail.
Certifications such as AWS Certified DevOps Engineer or Google Professional Cloud DevOps Engineer are a plus.
Ability to obtain and maintain a Public Trust clearance.
Preferred: Understanding of modern architecture, e.g. micro-services, EDA, etc., and cautious against overcomplexity and overengineering
Experience with monitoring and metrics platforms, e.g. New Relic, Prometheus, InfluxDB, Grafana, Splunk, etc
Experience designing and operating distributed systems and cloud infrastructure at scale
Candidates in the eastern, central or mountain time zones
Experience supporting US federal government contracts In accordance with pay transparency guidelines, the proposed salary range for this position is $120,000 to $155,000. Final salary will be determined based on various factors such as relevant skills, experience and certifications.