Chaos Engineer with Security Clearance
McBride Consulting LLC (MCB)
2024-11-07 16:40:30
Boston, Massachusetts, United States
Job type: fulltime
Job industry: I.T. & Communications
Job description
McBride is a certified Minority-Owned Business Enterprise (MBE), Service-Disabled Veteran-Owned Small Business (SDVOSB), and Disadvantaged Business Enterprise (DBE) management and IT consulting firm with significant professional services experience that includes grants management, business strategy development, program management, business process improvement, change management, human capital management, and operations. McBride has an exciting opportunity for a Chaos Engineer providing support to the Air Force Life Cycle Management Center/PEO Digital Directorate (AFLCMC/HB). The Air Force Program Execution Office for PEO Digital (AFPEO/HB) has the collective Air Force Materiel Command (AFMC) mission responsibility to manage and execute the modernization, development, testing, production, fielding, and sustainment of the PEO Digital portfolio, which includes over 130 programs for the United States Air Force (USAF) and foreign allies. AFLCMC/PEO Digital Directorate is headquartered at Hanscom Air Force Base, MA with geographically separated operating units CONUS and OCONUS. The Chaos Engineer shall provide Chaos Engineering support that will assist and support all the performance and load test related clarifications and anchoring the product team to conduct their isolated testing and integrated load test to support production migration. The Contractor shall be responsible for working with multiple agile teams, across divisions to help resolve obstacles and ensure successful, high quality software is delivered to the customer. The Contractor shall also provide Chaos Engineering support that will assist in the design, operation, and sustainment of systems and components that cover tasks throughout the acquisition life cycle (requirements analysis through system disposal). Tasks may also include supporting Foreign Military Sales. This is a full time position located in Boston, MA Responsibilities
Duties include, but not limited to: Assist in the design and implementation of an integrated performance and chaos engineering strategy, process, tools, and execution
Assist in the definition and help manage the load testing roadmap for applications
Work with multiple product teams to ensure application availability and set-up of test equipment, test data and execution environments Assisting in redefining the test management framework and ensure that the infrastructure is scalable to evolving products and services
Assist in application go/no go decisions by providing quality metrics by covering different types of test scenarios
Work closely with product managers and key stakeholders to plan and conduct application testing efforts
Coordinate different product teams and assist them in assessing deployment readiness Understanding Application Programming Interfaces, Google Remote Procedure Call (gRPC), abstractions and integration patterns to solve challenging distributed system problems.
Assist in the design, implement, and execute chaos experiments from conception to production deployment
Analyze complex problems in the application space relating to resilience
Assist in performing chaos testing on a distributed system to build confidence in the system's capability to withstand turbulent conditions in production. Assist in the development of automated chaos testing in pre-production systems Design & Architectures Conduct and support all engineering and technical activities required during the material solution analysis, technology, engineering, and manufacturing development, production and deployment, and operation and support phases of the systems acquisition process. Develop/prepare engineering and technical documentation inherently part of the activities listed above (e.g. Analysis of Alternatives (AoA), Systems Engineering Plan (SEP) ), systems review and analysis with written reporting, briefing, and artifact development as required for all of the technical specialty areas defined in this section; and the planning and execution of all necessary technical reviews, assessments, and interchange meetings (e.g. Critical Design Review(CDR), Manufacturing Readiness Assessment (MRA), Reliability Technical Interchange Meeting (TIM and associated artifacts (e.g. Critical Design Review entry and exit criteria) Conduct systems architectural analyses to include system design/design feasibility and state-of-the-art assessment Produce architectures, document planned and existing systems, determine deficiencies, develop alternatives, recommend solutions, and support development of a plan to implement the chosen solution per acquisition and sustainment system engineering processes. This may include traditional architecture development or Model Based Systems Engineering processes. Track and ensure system architecture and interoperability requirements for the baselined system is maintained Detail operational requirements and functional interactions within the system and with other systems, considering life cycle costs, logistics supportability, human factors, et al, to determine if the basic building blocks of the system are defined in sufficient detail to support a development program effort Translate DoD user requirements into system requirements for designing, developing, testing and evaluating systems and subsystems using USAF and DoD approved acquisition methods. Provide engineering analysis of system training, training equipment and simulator requirements; prepare technical documentation; and provide engineering evaluation of equipment and simulator designs
Acquisition
Understand and implement robust DoD acquisition and sustainment systems engineering processes to address the life-cycle requirements of each product support area Prepare and review program specifications for items, processes and materials. Develop and utilize analytical, simulation, MBSE and digital engineering analysis techniques to optimize system performance and/or system effectiveness; establish system sensitivities, define system parameters, evaluate hardware and validate flight test results in terms of overall system relationships and applications. Develop, prepare, review, and integrate selected milestone review documentation, periodic reports, and certifications as identified in current DoD and AF acquisition policy Review, evaluate and monitor acquisition Contractor drawings for completeness, currency, technical accuracy and conformance to requirements, specifications, regulations and quality control to ensure compliance with ASME technical Data Package format and requirements Technical Management
Complete and support requirements definition and analysis, architecture design, implementation, integration, verification and validation, transition, decision analysis, requirements management, risk management, interface management, and configuration management Facilitate the transfer of information, lessons learned and best practices across all customers and contractors serviced under this task order within constraints of proprietary rights Perform market research and trade studies to assess current and future systems to ensure they that meet mission needs/requirements. Conduct, define and evaluate system cost, schedule, and technical performance. Perform reviews, studies (including root cause analysis), audits, evaluations and analyses and provide technical recommendations to the Government. Conduct engineering performance, effectiveness, cost effectiveness, cost performance, life cycle cost, producibility, reliability and maintainability, risk and scheduling trade-off studies Develop USAF approved Test and Evaluation Master Plans (TEMP), test plans and procedures, prepare test readiness reviews and associated test documentation using DoD and USAF standard processes, execute developmental tests and assist in operational test planning/execution, and evaluate tests in support of acquisition and sustainment programs. Participate in integrated logistics support efforts, provisioning and source coding, maintenance data collection, technical order management, configuration management, evaluation of quality deficiency reports, and/or identification of alternate sources of supply for diminishing manufacturing/materiel sources preferred. Training in information technology, computer sciences or engineering Experience in of information technology (e.g., product management, business analysis, project management, performance and scalability consultant role) Experience in performance and load testing tools (Load Runner, JMeter, Visual Studio) Experience in cloud systems (e.g. Azure, Amazon Web Services, Pivotal Cloud Foundry and Google Cloud Platform) Experience with chaos engineering tools like Chaos Monkey as well as supporting tools like Dynatrace, Azure App monitor tool as well as Grafana Experience in programming languages like Java, C#, and Python Strong analytical and problem-solving skills Strong presentation skills with the ability to articulate complex information concisely and clearly Experience drafting effective project or product management documentation including but not limited to status reports and issues/risks logs Working knowledge with virtual machines and microservices Experience in product management using Agile methodology Experience with Agile Central(Rally) and/or Jira McBride Consulting is an Equal Opportunity Employer. McBride Consulting does not discriminate on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status or any other basis covered by appropriate law. All employment is decided on the basis of qualifications, merit, and business need.