We use cookies. Find out more about it here. By continuing to browse this site you are agreeing to our use of cookies.
#alert
Back to search results
New

Site Reliability Engineer, GovCloud Incident Response (GIR)

salesforce.com, inc.
parental leave, 401(k)
United States, Colorado, Denver
Feb 18, 2026

To get the best candidate experience, please consider applying for a maximum of 3 roles within 12 months to ensure you are not duplicating efforts.

Job Category

Software Engineering

Job Details

About Salesforce

Salesforce is the #1 AI CRM, where humans with agents drive customer success together. Here, ambition meets action. Tech meets trust. And innovation isn't a buzzword - it's a way of life. The world of work as we know it is changing and we're looking for Trailblazers who are passionate about bettering business and the world through AI, driving innovation, and keeping Salesforce's core values at the heart of it all.

Ready to level-up your career at the company leading workforce transformation in the agentic era? You're in the right place! Agentforce is the future of AI, and you are the future of Salesforce.

Applications will be accepted until 08/22/2026.

Join our team and contribute to the operational excellence of the Salesforce GovCloud!

Are you passionate about ensuring the reliability and performance of mission-critical cloud services? Salesforce is seeking a talented Site Reliability Engineer to join our dynamic team in our Denver, CO, location, supporting our GovCloud environment. As a key member of our Site Reliability organization, you'll play a vital role in maintaining 99.99% uptime for customer-facing services, proactively addressing issues, and ensuring the security of our data. We foster a collaborative and innovative culture, where you'll work alongside skilled engineers to solve complex problems and drive continuous improvement.

Please Note: This position requires a successful background investigation and the ability to obtain and maintain a specific level of U.S. government background clearance. Details will be provided during the interview process.

Shift Requirements: This role involves shift work, including night shifts, as part of a 24/7 support team. We provide a rotating schedule and ensure adequate compensation for shift differentials.

About the Role:

The Site Reliability team at Salesforce is the backbone of our cloud operations, working around the clock to keep our services available and our customers protected. You will be a crucial part of the GovCloud Incident Response (GIR) team, which maintains the current infrastructure through day-to-day alert response, smart hands support, and comprehensive incident management, including retrospectives and long-term remediation.

Your Responsibilities:

  • Ensure 99.99% uptime for customer-facing services by proactively monitoring and maintaining the health of supporting systems, contributing directly to customer satisfaction and trust.

  • Act in key support roles during major incidents (e.g., Sev0, Sev1) and participate in technical incident reviews for problem management.

  • Contribute to Problem Management by populating and participating in Root Cause Analyses (RCAs) and handing them off to the Global Solutions team.

  • Ensure all work carried out by the Site Reliability team aligns with the company's internal compliance policies and directives.

  • Collaborate with technical staff to solve complex technical issues and customer concerns.

  • Lead and mentor other team members in staying abreast of industry innovations and technologies, and assist in team development growth.

  • Thrive in a fast-paced environment, solving sophisticated issues quickly and successfully balancing multiple priorities.

  • Automate the detection and resolution of recurring issues in the production environment.

  • Help create and improve current processes to reduce operational and engineering toil, including the implementation of AI-driven automation for routine tasks.

Basic Requirements:

  • Citizenship: U.S. citizen (U.S. born or naturalized) who does not hold dual citizenship. You agree to complete a Minimum Background Investigation (MBI) for a Moderate Public Trust position with the U.S. federal government or other clearances as deemed appropriate for the role.

  • Education: Bachelor's degree in Computer Science, Engineering, Information Technology, or a related technical field.

  • Experience: Systems engineering experience in enterprise-scale internet service engineering or support role.

  • Technical Skills:

    • Expertise in TCP/IP related technologies (networking protocols, network programming, etc.).

    • Expertise in CLI enterprise support of Unix variants (Linux/Solaris/BSD), with significant exposure to Red Hat Enterprise Linux and Solaris.

    • Strong understanding of monitoring security systems and administration.

    • Experience provisioning, operating, and running AWS/C2S based infrastructure and systems.

    • Proficiency in scripting with Python, Go, or other languages.

  • Communication: Strong written and oral communication skills.

  • Incident Management: Past experience in Incident Management and a good understanding of ITIL service operations.

  • Availability: Ability to participate in a 24/7 on-call rotation supporting large data center operations and be available for shift work.

Preferred Qualifications:

  • Prior experience with Chef/Puppet or automated deployment. (This helps streamline our infrastructure management.)

  • Prior experience with Jenkins/Bamboo/Spinnaker pipeline execution. (This aids in our continuous integration and deployment processes.)

  • Experience supporting and maintaining monitoring and alert systems. (Ensures proactive issue detection.)

  • Experience supporting and maintaining Java applications. (Supports our application stack.)

  • Hands-on experience configuring and running AWS (Amazon Web Services) using the CLI/SDKs. (Essential for our cloud infrastructure.)

  • Certifications in Linux+, RedHat, and AWS. (Validates technical expertise.)

  • Experience supporting and leading Kubernetes-based applications and services. (Supports our containerized environment.)

  • Familiarity with Agile Process and DevOps practices. (Enables efficient workflow and collaboration.)

  • Experience participating in blameless retrospectives, learning from incidents, and conducting post-incident investigations, with an interest in how AI can assist in root cause analysis and pattern identification. (Promotes a culture of continuous improvement.)

  • Working knowledge of and interest in resilience engineering, including concepts such as Safety II and proactive problem prevention, leveraging AI for proactive risk identification and system optimization. (Enhances system reliability.)

  • Experience with AI/ML concepts and tools for operational insights, predictive maintenance, or intelligent automation.

  • Familiarity with data analysis and visualization tools to interpret AI-generated insights.

This candidate must be a U.S. citizen (U.S. born or naturalized) who does not hold dual citizenship and agrees to complete a U.S. federal government Minimum Background Investigation (MBI) for a Moderate Public Trust position.

Apply now to join our dynamic team and help us drive incident response efficiency and system resilience.

Unleash Your Potential

When you join Salesforce, you'll be limitless in all areas of your life. Our benefits and resources support you to find balance and be your best, and our AI agents accelerate your impact so you can do your best. Together, we'll bring the power of Agentforce to organizations of all sizes and deliver amazing experiences that customers love. Apply today to not only shape the future - but to redefine what's possible - for yourself, for AI, and the world.

Accommodations

If you require assistance due to a disability applying for open positions please submit a request via this Accommodations Request Form.

Posting Statement

Salesforce is an equal opportunity employer and maintains a policy of non-discrimination with all employees and applicants for employment. What does that mean exactly? It means that at Salesforce, we believe in equality for all. And we believe we can lead the path to equality in part by creating a workplace that's inclusive, and free from discrimination. Know your rights: workplace discrimination is illegal. Any employee or potential employee will be assessed on the basis of merit, competence and qualifications - without regard to race, religion, color, national origin, sex, sexual orientation, gender expression or identity, transgender status, age, disability, veteran or marital status, political viewpoint, or other classifications protected by law. This policy applies to current and prospective employees, no matter where they are in their Salesforce employment journey. It also applies to recruiting, hiring, job assignment, compensation, promotion, benefits, training, assessment of job performance, discipline, termination, and everything in between. Recruiting, hiring, and promotion decisions at Salesforce are fair and based on merit. The same goes for compensation, benefits, promotions, transfers, reduction in workforce, recall, training, and education.

In the United States, compensation offered will be determined by factors such as location, job level, job-related knowledge, skills, and experience. Certain roles may be eligible for incentive compensation, equity, and benefits. Salesforce offers a variety of benefits to help you live well including: time off programs, medical, dental, vision, mental health support, paid parental leave, life and disability insurance, 401(k), and an employee stock purchasing program. More details about company benefits can be found at the following link: https://www.salesforcebenefits.com. At Salesforce, we believe in equitable compensation practices that reflect the dynamic nature of labor markets across various regions. The typical base salary range for this position is $117,200 - $176,700 annually. The range represents base salary only, and does not include company bonus, incentive for sales roles, equity or benefits, as applicable.
Applied = 0

(web-54bd5f4dd9-cz9jf)