Site Reliability Engineer (SRE)

Spectraforce

US

2 hours ago

Similar Jobs

Spectraforce

2 hours ago

Job Description

Title: Site Reliability Engineer (SRE)
Location: Remote
Duration: 6 months

Top Skills Required:
1. SRE Monitoring
2. Dynatrace
3. Azure Kubernetes Services

Role Overview:

We are seeking a highly skilled Site Reliability Engineer (SRE) to own the overall health, availability, performance, and resilience of our enterprise platform.
The platform spans SQL Server, .NET, Java, React.js, Microservices, Kafka, and operates in a hybrid cloud environment on Azure and On Premises.
The SRE will lead reliability engineering practices across the stack, manage infrastructure deployment pipelines using Terraform, drive application deployments through GitHub and Azure DevOps, ensure timely remediation of security vulnerabilities, and implement world class observability using Dynatrace and Splunk.

Key Responsibilities
Platform Reliability & Operations

Own the end to end health, uptime, performance, and reliability of the platform across cloud (Azure) and on prem environments.
Ensure resilience across application layers: .NET, Java, React.js, Microservices, and backend systems such as SQL Server and Kafka.
Lead incident management, root cause analysis, and post incident reviews with a focus on continuous improvement.

Infrastructure Engineering & Automation

Design, implement, and maintain cloud and on prem infrastructure using Terraform (IaC).
Own and optimize CI/CD pipelines for infrastructure and applications in:
GitHub Actions
Azure DevOps
Improve deployment automation, reliability, and release processes across all teams.

Observability, Monitoring & Proactive Operations

Implement and enhance monitoring, alerting, dashboards, and analytics using:
Dynatrace (APM, RUM, synthetic monitoring, logs, metrics)
Splunk (log search, correlation, alerting)
Build proactive monitoring workflows to detect issues before they impact customers.
Own SRE metrics such as SLOs, SLIs, Error Budgets, MTTR, MTBF, availability KPIs, and system productivity metrics.
Performance tuning of the database / application services.

Security & Compliance

Ensure all platform and application security vulnerabilities are identified and remediated on time.
Partner with cybersecurity to ensure compliance with enterprise standards and policies.
Automate security scans and integrate them into CI/CD pipelines.

Performance & Scalability

Conduct performance analysis, load testing, and tuning across:
Microservices
SQL Server databases
Kafka clusters
Front end React.js applications
Partner with engineering teams to design scalable, reliable system architectures.

Collaboration & Leadership

Collaborate with development, architecture, infrastructure, and security teams.
Advocate for SRE and DevOps culture—automation, reliability engineering, blameless postmortems.
Mentor developers and engineers on reliability best practices and tools.

Required Qualifications:

5+ years of experience in SRE, DevOps, or Platform Engineering roles.

Strong expertise in:

SQL Server administration and performance tuning
.NET, Java, Microservices architectures
React.js fundamentals

Hands on experience with:

Azure Cloud services (VMs, AKS, App Services, Networking)
On prem servers and hybrid integrations
Terraform (writing, testing, maintaining modules)
CI/CD with GitHub and Azure DevOps

Proficiency with observability tools:

Dynatrace (preferred)
Splunk
Experience with Kafka (producers, consumers, performance, tuning).

Strong understanding of SRE fundamentals:

SLO/SLI design
Error budgets
Distributed systems concepts
Incident response

Preferred Qualifications

Experience with containerization and Kubernetes (AKS or on prem K8s).
Experience with service mesh, API gateway technologies, or event driven architectures.
Knowledge of secure coding practices and integrating security in CI/CD.
Familiarity with enterprise networking, firewalls, and hybrid connectivity.

Soft Skill

Strong communication and collaboration abilities.
Analytical mindset with strong problem solving skills.
Ability to handle pressure in high severity incidents.
Passion for automation, simplification, and continuous improvement.

Applicant Notices & Disclaimers

For information on benefits, equal opportunity employment, and location-specific applicant notices, click here

At SPECTRAFORCE, we are committed to maintaining a workplace that ensures fair compensation and wage transparency in adherence with all applicable state and local laws. This position’s starting pay is: $ 40.00/hr.

Site Reliability Engineer (SRE)

Spectraforce

US

2 hours ago

Job Description

Similar Jobs

Staff Civil Site Engineer - Aerospace & Industrial

Spectraforce

Chesapeake, Virginia

7 days ago

Sr. Reliability Engineer

Spectraforce

Milpitas, California

21 days ago

Test and Reliability Engineer IV (Hardware)

Spectraforce

Salt Lake City, Utah

a month ago

Test and Reliability Engineer IV (Hardware)

Spectraforce

Salt Lake City, Utah

a month ago

Experience +10 years

Site Reliability Engineer II

Spectraforce

Alpharetta, Georgia

a month ago