Fireworks AI is seeking a Member of Technical Staff, Cloud Infrastructure to join their team in San Mateo, CA. The role involves architecting and building foundational systems for their generative AI platform.
About the Role
As a Software Engineer on the Cloud Infrastructure team, you will architect and build scalable, resilient, and high-performance backend infrastructure to support distributed training, inference, and data processing pipelines. You will lead technical design discussions, mentor other engineers, and establish best practices for large-scale ML infrastructure. Your responsibilities include designing core backend services, driving infrastructure optimization initiatives, and collaborating with cross-functional teams to translate research and product needs into robust infrastructure solutions.
About You
Required:
Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent practical experience).
5+ years of experience designing and building backend infrastructure in cloud environments (e.g., AWS, GCP, Azure).
Proven experience in ML infrastructure and tooling (e.g., PyTorch, TensorFlow, Vertex AI, SageMaker, Kubernetes, etc.).
Strong software development skills in languages like Python or C++.
Deep understanding of distributed systems fundamentals: scheduling, orchestration, storage, networking, and compute optimization.
Preferred:
Master’s or PhD in Computer Science or related field.
Experience leading infrastructure projects supporting large-scale ML/AI workloads or high-throughput systems.
Familiarity with infrastructure-as-code and CI/CD tooling (e.g., Terraform, ArgoCD, GitOps).
Track record of driving system performance, reliability, and cost-efficiency improvements.
Contributions to open-source cloud or ML infrastructure projects a plus.
Benefits
Meaningful equity in a fast-growing startup.
Competitive salary and comprehensive benefits package.
Fireworks AI
Fireworks.ai offers generative AI platform as a service. We optimize for rapid product iteration building on top of gen AI as well as minimizing cost to serve. https://fireworks.ai/careers
Company Size: 51-200 employeesSoftware Development