Highlight
OnsiteFull-timeEntry
Summary
Cartesia is seeking a Research Intern to join their team in San Francisco, CA. The role involves working on pioneering multimodal models built on new model architectures.
About the Role
As a Research Intern, you will be responsible for pushing the quality, efficiency, and capabilities of pretrained models. Your tasks will include implementing new model backbones, rapidly running experiments, building training infrastructure for massive datasets, and staying updated on new research ideas.
About You
Required:- Deep machine learning background with a strong grasp of sequence modeling, generative models, and common model architecture families (RNNs, CNNs, Transformers).
- Proficient in Python and Pytorch (or similar framework) and tensor programming.
Preferred:- Experience in writing and pretraining large-scale models.
- Familiarity with efficiency tradeoffs in designing model architectures for GPUs.
- Pursuing advanced degrees in machine learning (MS/PhD).
- Prior research experience in advancing state space models.
- Experience in optimizing model inference with CUDA, Triton, or other frameworks.
Benefits
- Lunch, dinner, and snacks at the office.
- Relocation assistance.
- A personal Yoshi.