Job Requirements
The Site Reliability Engineer II implements reliable infrastructure solutions according to the strategic direction of the team. They enable efficient delivery of value to our customers through effective infrastructure and fast pipelines, empowering application developers to deliver at the speed desired for the project. The Site Reliability Engineer II also supports operations of the production environments including observability and troubleshooting issues (sometimes outside of normal business hours if mandated by the contract). The Impact You Will Create:
- In this role, you will focus on ensuring the availability, reliability, and performance of our multitenant, microservices application suite. You will collaborate closely with cross-functional teams to troubleshoot issues, automate processes, and build scalable, resilient systems. You will learn the nuances of the entire suite of applications and its infrastructure, which will facilitate your missions of 24/7/365 tier 2/3 outage response, and improving the efficiencies. This role requires an active Secret clearance.
Your Responsibilities
- Monitor system health, define Service Level Indicators (SLIs), and ensure adherence to Service Level Objectives (SLOs).
- Respond promptly to outages, conduct root cause analyses, and implement durable solutions to prevent recurrence.
- Collaborate with development and DevOps teams to optimize and maintain Kubernetes environments and CI/CD pipelines.
- Develop and refine automation scripts to enhance system reliability, including automated recovery and self-healing capabilities.
- Build and maintain observability frameworks, integrating metrics, logging, and tracing tools for proactive issue identification.
- Contribute to performance tuning and scalability improvements across the application stack.
- Document incident responses and contribute to a knowledge base to foster a culture of continuous improvement.
- Participate in an on-call rotation to provide 24/7/365 support for mission-critical systems.
- Decomposes tasks into discrete objectives to serve the strategic direction of the team.
- Implements infrastructure and pipeline solutions with minimal direction in accordance with the project/organization technical standards.
- Resolves technical issues in software infrastructure like software-defined networks, databases, and compute resources.
- Advises the team on specific implementation options that meet business requirements.
- Designs and automates log collection, storage, and analysis.
- Automates security processes and vulnerability remediation.
- Contributes actively in team Agile processes through discussion and/or preparation.
- Manages the security posture of the system and helps maintain compliance with government regulations.
- Provides feedback to improve the team's technical procedures.
- Delivers scripts and tooling that interacts with external APIs.
- Collaborates with other engineers and designers to implement features that meet design specifications and deliver business value.
Work Experience
Technical Skills:
- 6 years of experience in site reliability, systems engineering, or DevOps roles.
- Proficiency in one or more programming/scripting languages (e.g., Python, Go, Java, Bash).
- Strong understanding of distributed systems, microservices architecture, and RESTful API design.
- Hands-on experience with Kubernetes and container orchestration.
- Familiarity with monitoring, alerting, and logging tools (e.g., Prometheus, Grafana, ELK stack, or Datadog). Experience with Elastic will be highly helpful with this position.
- Hands-on experience with incident response, including designing and improving incident management processes.
- Expertise in Observability practices, including metrics, logs, traces, and understanding of distributed tracing tools (e.g., OpenTelemetry).
- Strong problem-solving skills with a focus on building resilient, fault-tolerant systems.
- Experience with cloud platforms like AWS, Azure, or Google Cloud.
- Strong proficiency with Linux systems (especially RHEL) and scripting in Bash or Python. Skilled in managing infrastructure using Terraform, AWS CloudFormation, and configuration tools like Chef, Ansible, or Puppet.
- Deep understanding of application, network, and infrastructure security principles. Experience with distributed systems and scalable application architecture.
- Proficient in Git, including common and advanced commands (e.g., branching, merging, rebasing).
- Comfortable integrating with external APIs using REST, JSON, and authentication tokens. Familiarity with container orchestration platforms such as Kubernetes or OpenShift.
Soft Skills:
- Excellent communication skills and a collaborative mindset.
- Strategic thinking and strong problem-solving under pressure.
- Effective communication and collaboration across cross-functional teams.
- Self-motivated with initiative to drive personal and technical growth.
- Ability to mentor others and influence team practices through expertise and leadership.
Clearance:
- An ACTIVE and MAINTAINED "SECRET" Federal or higher
Other Requirements:
- Must be willing to do shift work to provide 24/7/365 coverage.
- Must be within 45 minutes drive of an IL6 workstation location (e.g., SIPR cafe, SCIF)
Education:
- Bachelor's degree in Computer Science, Engineering, or related field. Master's degree
- Have to have SEC+ or higher certification or ability to obtain it within six months from hire.
Physical Requirements: Ability to sit for extended periods while working on a computer or during meetings. Must be able to travel occasionally to client sites or company meetings. Ability to communicate effectively via phone, email, and in-person, requiring clear speech, listening, and written communication skills. Ability to move within an office environment, including reaching for files, using office equipment, and occasional light lifting (up to 10 pounds).
Benefits
Life at Fearless We're a digital integration consultancy on a mission to build a better tomorrow. At Fearless, we combine technology, people, and organizational development to solve meaningful problems. Through iterative development, we deliver smart, user-friendly solutions that make tech work better-for everyone. But great tech is just part of the story. What really makes us Fearless is our Purple Culture. What Makes Us Purple? Being Purple means you:
- Are valued as a whole person-not just a job title
- Get matched with work that plays to your strengths and passions
- Are supported by coaches, not micromanagers
- Have the autonomy and clarity to make decisions and drive impact
- Join a community that celebrates equity, curiosity, and innovation
- Do work that matters-every day
We believe in flexibility, growth, and balance. Our benefits and culture are designed to support you in doing your best work-while making space for what matters to you outside of it. We're proud to be an equal opportunity employer. At Fearless, we're building a workplace that welcomes and respects everyone-across race, gender, age, religion, identity, background, and ability. Compensation at Fearless Fearless is committed to providing a competitive compensation package that will meet your current and future needs. Our philosophy is aimed at rewarding team member contributions, supporting long-term financial growth and security, and overall well-being. We believe in paying people fairly, so we've established a compensation model aimed to ensure everyone at Fearless - regardless of race, ethnicity, gender, sexual orientation, disability, religion, age, nationality, or willingness/ability to negotiate - is consistently paid fairly based on alignment to the needs and requirements of the role. The salary range for this position is: Minimum Salary: $91,554 Salary Midpoint: $119,020 Maximum Salary: $146,486 Hiring Range: $91,554 - $135,000 Benefits at Fearless At Fearless, we take care of our team-because when you're supported, you can do your best work. We offer a flexible, family-friendly environment with benefits designed to support your health, growth, and life outside of work. For Full-Time Team Members (Starting Day One):
- Flexible, life-friendly schedules
- 100% coverage for our medical HSA plan + HSA contributions
- Dental & vision covered 100% for you and your dependents
- Competitive premiums for HMO/PPO and dependent coverage
- 401(k) with 4% match & immediate vesting
- Paid Parental Leave and 12 weeks paid FMLA
- Generous PTO, 11 Federal Holidays, a Birthday Holiday, and Sick Leave
- Up to 15 days for Jury Duty and Bereavement Leave
- Education, wellness, and tech allowances
- Referral bonus: $6K-$12K for each successful referral
- Pet insurance & discount plans
- Employee Assistance Program (EAP)
- Legal support, life insurance, disability coverage
- Part-Time & Interns:
- 8.75 days of safe & sick leave annually
- Eligible for our 401(k) plan with employer contributions
Reasonable Accommodations Fearless is committed to providing reasonable accommodations for applicants and candidates with disabilities. If a reasonable accommodation is needed to participate in the job application or interview process, please contact the Human Resources Department at hr@fearless.tech. So, What's Next? We've refined our hiring approach to make sure every team member is a great fit for Fearless-and that we're a great fit for you, too. If there's alignment, we'll reach out to kick off the interview process. Depending on the role or project, your experience may vary slightly, but it typically includes: Introductory Interview You'll connect with a recruiter to:
- Build rapport and get to know each other
- Review your experience and skills
- Talk through salary expectations and role details
- Set expectations for the rest of the process
Skills + Business Fit Interview This is where we dig deeper to:
- Review findings from any technical assessments
- Walk through situational and values-based questions
- Explore how your approach aligns with Fearless culture and project needs
Some roles may also include customer interviews based on specific project requirements in addition to background check and security clearance requirements.
|