Site Reliability Engineer SRE Job at System One, Virginia

bVEzMk1oYXU4TGQyVHg3alByUStDRVFnRGc9PQ==
  • System One
  • Virginia

Job Description

Site Reliability Engineer (SRE)
Active Secret Clearance Required
Direct Hire
Onsite at Langley AFB, Hampton, VA
Various Shifts Available

ALTA IT Services has direct hire openings for two Site Reliability Engineers SRE’s to support mission-critical DOD programs, onsite at Langley AFB, Hampton, VA. Various shifts are available. An active Secret clearance is required.

In this role, you will focus on ensuring the availability, reliability, and performance of a multi-tenant, microservices application suite. You will collaborate closely with cross-functional teams to troubleshoot issues, automate processes, and build scalable, resilient systems. You will learn the nuances of the entire suite of applications and their infrastructure, which will facilitate your missions of 24/7/365 tier 2/3 outage response and improve the efficiencies of the program.

Key Responsibilities:
  • Contribute to performance tuning and scalability improvements across the application stack.
  • Document incident responses and contribute to a knowledge base to foster a culture of continuous improvement.
  • Participate in an on-call rotation to provide 24/7/365 support for mission-critical systems.
  • Monitor system health, define Service Level Indicators (SLIs), and ensure adherence to Service Level Objectives (SLOs).
  • Respond promptly to outages, conduct root cause analyses, and implement durable solutions to prevent recurrence.
  • Collaborate with development and DevOps teams to optimize and maintain Kubernetes environments and CI/CD pipelines.
  • Develop and refine automation scripts to enhance system reliability, including automated recovery and self-healing capabilities.
  • Build and maintain observability frameworks, integrating metrics, logging, and tracing tools for proactive issue identification.
Qualifications:
  • Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
  • A minimum of 5 years of experience in site reliability, systems engineering, or DevOps roles.
  • Proficiency in one or more programming/scripting languages (e.g., Python, Go, Java, Bash).
  • Strong understanding of distributed systems, microservices architecture, and RESTful API design.
  • Hands-on experience with Kubernetes and container orchestration.
  • Familiarity with monitoring, alerting, and logging tools (e.g., Prometheus, Grafana, ELK stack, or Datadog). Experience with Elastic will be highly helpful with this position.
  • Hands-on experience with incident response, including designing and improving incident management processes.
  • Expertise in Observability practices, including metrics, logs, traces, and understanding of distributed tracing tools (e.g., OpenTelemetry).
  • Strong problem-solving skills with a focus on building resilient, fault-tolerant systems.
  • Excellent communication skills and a collaborative mindset.
  • Have to have SEC+ or higher certification or ability to obtain it within six months from hire.
  • Must be willing to do shift work to provide 24/7/365 coverage.
Preferred Qualifications:
  • Experience with cloud platforms (e.g., AWS) and their associated managed services.
  • Knowledge of database management and optimization for systems (e.g., PostgreSQL)
  • Familiarity with Infrastructure as Code (IaC) tools (e.g., Terraform,  CloudFormation).
  • Master’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
Please send an updated resume to Melissa McNally via mmcnally@altaits.com for consideration.

System One, and its subsidiaries including Joulé, ALTA IT Services, and Mountain Ltd., are leaders in delivering outsourced services and workforce solutions across North America. We help clients get work done more efficiently and economically, without compromising quality. System One not only serves as a valued partner for our clients, but we offer eligible employees health and welfare benefits coverage options including medical, dental, vision, spending accounts, life insurance, voluntary plans, as well as participation in a 401(k) plan.

System One is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, age, national origin, disability, family care or medical leave status, genetic information, veteran status, marital status, or any other characteristic protected by applicable federal, state, or local law.

#M-MM1
#LI-MM1
#DI-MM1

Ref: #855-IT Baltimore

Job Tags

Local area, Shift work,

Similar Jobs

Rubber and Road Creative

Project Manager part-time/freelance and/or contract Job at Rubber and Road Creative

 ...and Road Creative is looking for a project based or part time freelance Project/Account Manager to work with the Client Services team facilitating smooth execution...  ...) Qualifications Must be US based, role is remote but works primarily with an internal team on PST and... 

Claire Myers Consulting

Precision Grind Operator/ Technician Job at Claire Myers Consulting

 ...Job Title: Precision Grind Operator/ Technician Location: Buellton, CA 93427 Company: Excelta About Us: Our client is a family-owned distributor and manufacturer of tweezers, pliers, cutters, and other small assembly hand tools. We prioritize excellence... 

Adecco

Senior CNC Machine Programmer (Sheet Metal - Maritime) Job at Adecco

To expedite consideration, please email your resume to ****@*****.*** and ****@*****.*** PLEASE NOTE: THIS IS NOT A SOFTWARE PROGRAMMER POSITION - no need to apply. Applicants must also be U.S. citizens due to requirements for access...

Radiant Digital

Senior Mechanical Designer Job at Radiant Digital

 ...businesses across the USA, Canada, the Middle East, and Southeast Asia. On the federal side, we collaborate with agencies such as NASA, the Department of State (DOS), the IRS, ACL, ACF, USDA, and many others, as well as numerous state and local government entities.... 

John H. Carter Company, Inc.

Fleet Manager - Entry Level Job at John H. Carter Company, Inc.

 ...requirements to remain in good standing. Education and/or Work Experience ~ High School or Equivalent ~2+ years of previous fleet...  ...accepting unsolicited assistance from search firms/employment agencies for this employment opportunity. Please, no phone calls or...