X, The Moonshot Factory is seeking a 2026 PhD Residency, Computer Vision to join their team in Mountain View, CA. The role focuses on applying Vision-Language Models to enhance the understanding of electrical infrastructure and defect detection.
About the Role
As a PhD resident, you will lead the curation and labeling of datasets, design evaluation frameworks for Vision-Language Model performance, collaborate with research scientists to improve model reasoning, analyze performance bottlenecks, and integrate insights into Tapestry's digital twin.
About You
Required:
Currently pursuing a PhD or MS in Computer Science, Electrical Engineering, or a related technical field with a focus on machine learning.
Strong foundational knowledge in deep learning, particularly in computer vision and natural language processing.
Proficiency in Python and experience with deep learning frameworks such as PyTorch, JAX, or TensorFlow.
Practical experience managing, preprocessing, and analyzing large-scale image datasets.
Preferred:
Direct experience working with Vision-Language Models or multi-modal architectures.
Exposure to large-scale distributed training environments and data pipelines.
A portfolio of peer-reviewed publications or significant open-source contributions.
A creative approach to problem-solving with real-world data.
Benefits
Competitive salary
Medical, dental, and vision coverage
Competitive residency stipend and housing relocation support
Direct mentorship from industry-leading research scientists and engineers
Opportunity to work on 'moonshot' problems with access to Alphabet-scale compute and resources
X, The Moonshot Factory
We create breakthrough technologies to help solve some of the world’s biggest problems. Born at Google, we got our start creating self-driving cars and smart glasses. Since then, we’ve continued to bring sci-fi ideas into reality.