Software Architect - Site Reliability Engineer
Spectraforce
US
Remote
4 hours ago
Job Description
Job Title: Software Architect - Site Reliability Engineer
Duration of project: 6+ months
Location: 100% Remote
Description:
As a Platform Engineer / Site Reliability Engineer (SRE) with Client, you will play a critical role in designing, building, and operating the infrastructure and automation that support our suite of cloud-native enterprise applications in the rapidly evolving healthcare technology landscape. You will be part of a collaborative engineering team focused on ensuring our platforms are scalable, secure, reliable, and efficient — enabling clinicians and patients to access technology that sustains and improves health outcomes.
Responsibilities:
Required Experience:
At SPECTRAFORCE, we are committed to maintaining a workplace that ensures fair compensation and wage transparency in adherence with all applicable state and local laws. This position’s starting pay is: $ 75.00/hr.
Duration of project: 6+ months
Location: 100% Remote
Description:
As a Platform Engineer / Site Reliability Engineer (SRE) with Client, you will play a critical role in designing, building, and operating the infrastructure and automation that support our suite of cloud-native enterprise applications in the rapidly evolving healthcare technology landscape. You will be part of a collaborative engineering team focused on ensuring our platforms are scalable, secure, reliable, and efficient — enabling clinicians and patients to access technology that sustains and improves health outcomes.
Responsibilities:
- Design, implement, and maintain scalable, secure, and highly available cloud infrastructure using Infrastructure-as-Code (IaC) tools and modern DevOps practices.
- Build and optimize CI/CD pipelines using tools such as Jenkins, GitHub Actions, Maven, and JFrog to ensure fast, reliable, and repeatable software delivery.
- Develop and manage containerized application environments with Kubernetes, ensuring optimal deployment, scalability, and service reliability.
- Automate infrastructure provisioning, configuration, and deployment processes to improve operational efficiency and reduce manual interventions.
- Design and implement comprehensive monitoring, alerting, and observability solutions to ensure system health, performance, and reliability.
- Collaborate closely with development, product, QA, and security teams to design robust platform solutions aligned with business and technical requirements.
- Participate actively in Agile ceremonies such as daily stand-ups, sprint planning, demos, and retrospectives, driving continuous improvement and team collaboration.
- Contribute to operational support processes, including incident response, root cause analysis, capacity planning, and performance optimization for large-scale distributed systems.
Required Experience:
- Infrastructure as Code & Automation: Minimum of 5 years of hands-on experience with tools such as Terraform and Ansible for building and managing infrastructure as code, and CI/CD automation tools like Jenkins, GitHub, Maven, and JFrog.
- Cloud Platforms (GCP – Must Have): Proven hands-on experience deploying and managing infrastructure and services on Google Cloud Platform (GCP) is required. Strong knowledge of GCP networking, IAM, security, cost optimization, and core services (e.g., Cloud Run, Pub/Sub, GKE, Firestore) is essential. Experience with AWS or Azure is a plus but not a substitute.
- Containerization & Orchestration: Strong experience deploying, scaling, and managing containerized applications using Kubernetes, including service mesh, auto-scaling, and rolling update strategies.
- Programming & Scripting: Proficiency in one or more programming or scripting languages such as Python, Go, Java, C++, Perl, Ruby, or SQL, with the ability to automate workflows and optimize infrastructure operations.
- Operational Excellence: Experience designing and implementing operational support models, including incident response, root cause analysis, monitoring, and alerting strategies.
- DevOps & SRE Practices: Demonstrated experience with reliability engineering practices such as capacity planning, fault tolerance, and resilience design.
- Agile Development Practices: Experience working in Agile environments with familiarity in sprints, daily stand-ups, planning sessions, and retrospectives.
- Education: Bachelor’s degree in Information Systems, Information Technology, Computer Science, Engineering, or a related field — or equivalent work experience.
Applicant Notices & Disclaimers
- For information on benefits, equal opportunity employment, and location-specific applicant notices, click here
At SPECTRAFORCE, we are committed to maintaining a workplace that ensures fair compensation and wage transparency in adherence with all applicable state and local laws. This position’s starting pay is: $ 75.00/hr.