Engineer, AI/ML & Analytics Platform Engineering
Spectraforce
Plainsboro, New Jersey
7 hours ago
Job Description
Position Title: Engineer, AI/ML & Analytics Platform Engineering
Work Location: Plainsboro Township, NJ (Hybrid)
Assignment Duration: Fulltime/Direct Hire
Work Arrangement: Hybrid
Position Summary: At our organization, AI & analytics technology is more than just a solution, it powers the R&D, Commercial, and functional business units that work tirelessly to save lives and bring valuable treatments to patients around the world. The new AI & Analytics Platform Engineering & Standards team, is building quickly to meet the business demand of immediate, real-time, in-house built technology innovation.
Key Responsibilities:
• Contribute to the build out of the AI/ML & Analytics platform, services, and tools (across dev, test, and prod) that accelerate model training, inference, and deployment within our spoke teams
• Build platform capabilities to support both batch and real-time workflows at scale with flexible deployment strategies to accommodate varying use cases (e.g. low-latency predictions, offline model inference)
• Improve platform performance, reduce manual intervention, scale compute, and increase deployment efficiency.
• Work with the foundational cloud teams to ensure platform operational effectiveness, reliability, security and efficiency.
• Work with team members to provide technical guidance and implementations for monitoring systems (e.g. registry, alerting, etc.) and governance frameworks (e.g. regulatory compliance).
• Collaborate with our spoke teams for AI/ML & Analytics system architecture design, deployment pipelines, and solution scaling.
Qualification & Experience:
• Bachelors or Masters in a quantitative subject (e.g. Computer Science, Engineering, Data Science, Mathematics, Statistics, Operations Research) or a related field with 5+ years of experience.
• Experience in building AI/ML & Analytics or related platforms for ML Researchers, ML Engineers, Data Scientists, and Data Analysts
• Experience building scalable self-service systems or platforms using microservices and/or event-based services
• Strong knowledge of commonly used AI/ML & Analytics programming languages such as Python, Spark, SQL or similar, with experience in machine learning frameworks like PyTorch or TensorFlow.
• Experience with the AWS cloud-service ecosystem including AI/ML & Analytics related services (e.g. Sagemaker, etc.)
• Experience implementing IaC (Terraform, OpenTofu, CDK, Pulumi, etc.) + CI/CD for deploying cloud-based platform infrastructure at scale
• Knowledge of basic software development tools including VCS (GitHub, GitLab, etc.), CI/CD (GitHub/Lab Actions, Jenkins, etc.), JIRA
• Knowledge of containerization (e.g. Docker, Podman, etc.) and orchestration tools (e.g. Kubernetes, Rancher, etc.)
• Experience with large scale CPU, GPU and/or multi-GPU infrastructure (bonus for CUDA fundamentals)
• Knowledge of fundamental ops capabilities such as registries, tracking, observability, and monitoring.
• Experience analyzing and improving system performance and reducing costs.
• Strong communication skills and ability to engage with stakeholders effectively.
• Prior experience working within the pharma/biotech domain
• Proficiency in at least one or more strongly typed programming language such as C/C++, Java, Go, Rust, or similar with associated OO or functional design principals.
• Experience with large-scale distributed systems (e.g. Ray, Dask, Spark, etc.) and high-performance computing environments (e.g. Slurm, etc.)
• In-depth knowledge of data platforms (e.g. Databricks, Snowflake, or Lake Formation) and tools (e.g. dbt) and their underlying technologies (e.g. Delta, Iceberg, Hudi, Spark).
• Prior work building and using real-time/streaming infrastructure (e.g., Kafka, Spark Streaming).
• Experience with GitOps style tools for building and enabling developer platforms (e.g. ArgoCD, Crossplane, etc.)
• Experience with multi-cloud platform development (e.g. some combination of AWS, GCP, Azure)
• Knowledge of high-performance frameworks for inference and training/fine-tuning (e.g. onnxRT, tensorRT, Triton, etc.) or resource intensive GenAI.
At SPECTRAFORCE, we are committed to maintaining a workplace that ensures fair compensation and wage transparency in adherence with all applicable state and local laws. This position’s starting pay is: $ 110000.00/Yearly.
Work Location: Plainsboro Township, NJ (Hybrid)
Assignment Duration: Fulltime/Direct Hire
Work Arrangement: Hybrid
Position Summary: At our organization, AI & analytics technology is more than just a solution, it powers the R&D, Commercial, and functional business units that work tirelessly to save lives and bring valuable treatments to patients around the world. The new AI & Analytics Platform Engineering & Standards team, is building quickly to meet the business demand of immediate, real-time, in-house built technology innovation.
Key Responsibilities:
• Contribute to the build out of the AI/ML & Analytics platform, services, and tools (across dev, test, and prod) that accelerate model training, inference, and deployment within our spoke teams
• Build platform capabilities to support both batch and real-time workflows at scale with flexible deployment strategies to accommodate varying use cases (e.g. low-latency predictions, offline model inference)
• Improve platform performance, reduce manual intervention, scale compute, and increase deployment efficiency.
• Work with the foundational cloud teams to ensure platform operational effectiveness, reliability, security and efficiency.
• Work with team members to provide technical guidance and implementations for monitoring systems (e.g. registry, alerting, etc.) and governance frameworks (e.g. regulatory compliance).
• Collaborate with our spoke teams for AI/ML & Analytics system architecture design, deployment pipelines, and solution scaling.
Qualification & Experience:
• Bachelors or Masters in a quantitative subject (e.g. Computer Science, Engineering, Data Science, Mathematics, Statistics, Operations Research) or a related field with 5+ years of experience.
• Experience in building AI/ML & Analytics or related platforms for ML Researchers, ML Engineers, Data Scientists, and Data Analysts
• Experience building scalable self-service systems or platforms using microservices and/or event-based services
• Strong knowledge of commonly used AI/ML & Analytics programming languages such as Python, Spark, SQL or similar, with experience in machine learning frameworks like PyTorch or TensorFlow.
• Experience with the AWS cloud-service ecosystem including AI/ML & Analytics related services (e.g. Sagemaker, etc.)
• Experience implementing IaC (Terraform, OpenTofu, CDK, Pulumi, etc.) + CI/CD for deploying cloud-based platform infrastructure at scale
• Knowledge of basic software development tools including VCS (GitHub, GitLab, etc.), CI/CD (GitHub/Lab Actions, Jenkins, etc.), JIRA
• Knowledge of containerization (e.g. Docker, Podman, etc.) and orchestration tools (e.g. Kubernetes, Rancher, etc.)
• Experience with large scale CPU, GPU and/or multi-GPU infrastructure (bonus for CUDA fundamentals)
• Knowledge of fundamental ops capabilities such as registries, tracking, observability, and monitoring.
• Experience analyzing and improving system performance and reducing costs.
• Strong communication skills and ability to engage with stakeholders effectively.
• Prior experience working within the pharma/biotech domain
• Proficiency in at least one or more strongly typed programming language such as C/C++, Java, Go, Rust, or similar with associated OO or functional design principals.
• Experience with large-scale distributed systems (e.g. Ray, Dask, Spark, etc.) and high-performance computing environments (e.g. Slurm, etc.)
• In-depth knowledge of data platforms (e.g. Databricks, Snowflake, or Lake Formation) and tools (e.g. dbt) and their underlying technologies (e.g. Delta, Iceberg, Hudi, Spark).
• Prior work building and using real-time/streaming infrastructure (e.g., Kafka, Spark Streaming).
• Experience with GitOps style tools for building and enabling developer platforms (e.g. ArgoCD, Crossplane, etc.)
• Experience with multi-cloud platform development (e.g. some combination of AWS, GCP, Azure)
• Knowledge of high-performance frameworks for inference and training/fine-tuning (e.g. onnxRT, tensorRT, Triton, etc.) or resource intensive GenAI.
Applicant Notices & Disclaimers
- For information on benefits, equal opportunity employment, and location-specific applicant notices, click here
At SPECTRAFORCE, we are committed to maintaining a workplace that ensures fair compensation and wage transparency in adherence with all applicable state and local laws. This position’s starting pay is: $ 110000.00/Yearly.