Zoox is seeking an Engineering Manager, HPC Storage to lead their High Performance Computing Storage infrastructure team in Foster City, CA. The role involves managing petabyte-scale data movement and management for critical use cases such as ML model training.
About the Role
As an Engineering Manager, HPC Storage, you will oversee the design and optimization of storage systems, manage a team of engineers, and develop a strategic vision for storage solutions at Zoox. Your responsibilities will include collaborating with AI teams, addressing system pain points, and mentoring your team to ensure their professional growth.
About You
Required:
Experience managing teams of 5-10.
Demonstrated ability to prioritize development work and build cross-functional consensus across ML stakeholders.
Experience with high performance storage systems deployed on cloud providers, such as FSx for Lustre on AWS.
Strong operational background with highly available systems.
Bachelor's degree in computer science or a related field.
Preferred:
Experience with ML-specific data formats such as Mosaic Streaming Datasets (MDS).
Experience with end-to-end hosted ML services such as AWS SageMaker HyperPod.
Proficiency with Python, Java, or other managed languages.
Benefits
Comprehensive benefits package including health insurance, long-term care insurance, and life insurance.
Paid time off including sick leave, vacation, and bereavement.
Zoox Stock Appreciation Rights and Amazon Restricted Stock Units (RSUs).
Sign-on bonus may be offered as part of the compensation package.
Zoox
Zoox is transforming mobility-as-a-service by developing a fully autonomous, purpose-built fleet designed for AI to drive and humans to enjoy.