CoreWeave is seeking a Principal Engineer - Observability to lead the architecture, development, and operations of their Observability product in New York, NY.
About the Role
As a Principal Engineer, you will shape the vision for how customers monitor and troubleshoot their AI workloads. Your responsibilities include leading the strategy for Observability, designing advanced solutions, building insights for rapid troubleshooting, improving reliability of metrics, analyzing telemetry for performance improvements, and mentoring engineering teams.
About You
Required:
Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field.
10+ years of experience in distributed systems, focusing on reliability and scale.
Proven experience leading storage product engineering projects.
Proficiency in programming languages such as Go, C, or Rust.
Strong understanding of cloud computing infrastructure using Kubernetes.
Preferred:
Experience with distributed observability systems like ClickHouse.
Prior experience with building Observability solutions.
Benefits
100% paid medical, dental, and vision insurance.
Company-paid life insurance.
Voluntary supplemental life insurance.
Short and long-term disability insurance.
Flexible Spending Account.
Health Savings Account.
Tuition reimbursement.
Employee Stock Purchase Program (ESPP).
Mental wellness benefits through Spring Health.
Paid parental leave.
Flexible childcare support.
401(k) with generous employer match.
Flexible PTO.
Catered lunch each day.
Casual work environment.
CoreWeave
CoreWeave is the AI Hyperscaler™
Company Size: 501-1000 employeesTechnology, Information and Internet