Position Title: Bioinformatics Software Engineer Work Location: South San Francisco, CA (Hybrid- 2-3 days onsite) Assignment Duration: 8 months (possibility of extension)
Position Summary: We are seeking a highly motivated Bioinformatics Software Engineer to support software systems and data workflows within a Next Generation Sequencing (NGS) laboratory environment. The successful candidate will develop, maintain, and support applications and pipelines used for sequencing data processing, laboratory workflow management, and data analysis.
Background & Context: This role requires a strong background in software development, database management, and workflow automation, with experience supporting scientific data systems. The candidate will collaborate closely with bioinformaticians, scientists, and laboratory staff to ensure reliable data processing and scalable analysis pipelines.
Key Responsibilities: • Design, develop, test, and maintain Java and Java EE web applications used to support sequencing lab workflows and laboratory data systems. • Develop software tools to support sequencing data processing, data tracking, and laboratory operations. • Participate in software design discussions and contribute to scalable and maintainable system architecture.
Workflow & Pipeline Development • Develop and maintain bioinformatics pipelines for processing NGS data. • Implement workflow automation using WDL (Workflow Description Language). • Manage and support workflow execution through the Cromwell workflow engine. • Optimize pipeline performance and ensure efficient processing of large sequencing datasets.
Database & Data Management • Develop and execute SQL and PL/SQL queries to support data retrieval, transformation, and system integration. • Support Oracle database systems, including writing queries, performing updates, and maintaining database integrity. • Ensure reliable data storage, accessibility, and traceability across sequencing workflows.
Data Processing & Automation • Develop and maintain Python and Bash scripts to automate data processing, analysis workflows, and system integration tasks. • Implement data validation and quality checks to ensure accuracy and consistency of sequencing data.
Systems Integration • Integrate laboratory information systems such as LIMS with sequencing platforms and downstream analysis pipelines. • Support data flow between laboratory instruments, analysis systems, and databases.
Infrastructure & Deployment • Support deployment and execution of pipelines in High Performance Computing (HPC) environments. • Assist with cloud-based workflow deployment using AWS services. • Build and maintain containerized software environments (e.g., Docker) to ensure reproducibility of computational workflows.
Troubleshooting & Support • Monitor and troubleshoot pipeline execution issues and system performance problems. • Identify and resolve bottlenecks in data processing pipelines. • Provide technical support to scientists and lab personnel using the software systems.
Collaboration • Work closely with bioinformaticians, data scientists, software engineers, and laboratory staff to understand workflow requirements. • Translate scientific and operational needs into scalable software solutions. • Participate in cross-functional discussions to improve sequencing workflow efficiency.
Documentation & Best Practices • Maintain clear documentation of pipelines, applications, and database structures. • Use Git repositories and version control best practices to manage software development. • Support testing, validation, and continuous improvement of systems.
Qualification & Experience: • Bachelor’s or Master’s degree in Computer Science, Bioinformatics, Computational Biology, Software Engineering, or related field w/ MIN 3 years exp • Strong experience with Oracle databases. • Proficiency in SQL and PL/SQL. • Experience with Java and Java EE web application development. • Strong programming skills in Python. • Experience with Bash scripting. • Experience developing or supporting workflow pipelines using WDL. • Familiarity with the Cromwell execution framework. • Experience working with data processing pipelines or scientific data systems.
Preferred Qualifications • Experience working in NGS sequencing or genomics environments. • Familiarity with bioinformatics data processing workflows. • Experience working in HPC computing environments. • Experience with AWS cloud services. • Experience with container technologies (Docker, Singularity, etc.). • Experience using Git version control systems. • Experience supporting LIMS systems or laboratory data platforms.
Skills & Competencies • Strong problem-solving and troubleshooting skills. • Ability to work independently in a fast-paced research environment. • Strong communication skills and ability to collaborate with scientific teams. • Ability to manage multiple priorities and support complex data workflows.
Additional Information Interview process: 1. phone (15-20mins) 2. In-person (1 hr) Hybrid- 2-3 days onsite Function: gRED
Applicant Notices & Disclaimers
For information on benefits, equal opportunity employment, and location-specific applicant notices, click here
At SPECTRAFORCE, we are committed to maintaining a workplace that ensures fair compensation and wage transparency in adherence with all applicable state and local laws. This position’s starting pay is: $ 55.00/hr.