Software Engineer IV (AI and Systems Co-Design)
Spectraforce
Menlo Park, California
Remote
a day ago
Job Description
Job Title: Software Engineer IV (AI and Systems Co-Design)
Duration: 6 months - possible extension
Location: Remote
Job description:
At SPECTRAFORCE, we are committed to maintaining a workplace that ensures fair compensation and wage transparency in adherence with all applicable state and local laws. This position’s starting pay is: $ 85.00/hr.
Duration: 6 months - possible extension
Location: Remote
Job description:
- The AI and Systems Co-Design team has a mission to explore, develop, and help productize high-performance software and hardware technologies for AI. Our team defines and drives the AI software and hardware roadmap. We are seeking a candidate who will work on a foundational tool of our internal workloads on current and next-generation AI platforms. Specifically, this position focuses on collecting, processing, storing, and analyzing various operators and workloads.
- Hands-on experience with product-level
- Python programming
- Essential for implementing, improving, and maintaining programmatic and web interfaces, as well as processing and analyzing data.
- Proficiency in PyTorch, Kineto trace, dispatcher, and CUDA/Triton kernels
- Critical for extracting operators from AI/ML models, running them on devices, and collecting performance data.
- Hands-on experience in database management and SQL
- Necessary for processing collected data, storing it in databases, and maintaining data integrity.
- Machine learning experience
- Experience in Large Language Models (LLM), especially Llama
- Valuable for working with advanced AI models and potentially improving performance analysis.
- Knowledge of CI-based testing and automation
- Helpful for ensuring code quality and automating testing processes.
- Proficiency in Linux and Bash Important for working in the development environment and managing scripts and tools efficiently.
- Extract operators (e.g. aten, triton) from AI/ML models.
- Run operators on multiple devices and collect performance data.
- Process collected data and store it to a database while maintaining data integrity.
- Implement, improve, and maintain programmatic and web interfaces to query and analyze performance data stored in the database.
- Collaborate as part of a project team to coordinate development and determine project scope and limitations.
- Review project requests to estimate time and cost required to complete the project.
- At least three years of experience with above-mentioned skills is required for this role.
- 8+
- Maintain the database by ensuring data is properly saved and can be retrieved efficiently.
- Have a solid understanding of how to write data to the database and read data from it.
- After completing these tasks, integrate the database into the automated testing workflow (Continuous Integration - CI).
- The infrastructure team focuses on code design and development for their proprietary chip. This chip is used to run machine learning models, functioning similarly to a TPU (AI model accelerator).
- The team is responsible for evaluating the performance of each operator on this chip.
- 2 rounds, each lasting 35 minutes. Both rounds are coding-focused and will be conducted using CoderPad.
Applicant Notices & Disclaimers
- For information on benefits, equal opportunity employment, and location-specific applicant notices, click here
At SPECTRAFORCE, we are committed to maintaining a workplace that ensures fair compensation and wage transparency in adherence with all applicable state and local laws. This position’s starting pay is: $ 85.00/hr.