Location: California
Category: Engineering
Domain: Generative AI
Experience Level: Mid Level
Compensation: $166,000—$225,000
Posted 8 months ago
Job Description
Founded in late 2020 by a small group of machine learning engineers and researchers, Mosaic AI enables companies to securely fine-tune, train and deploy custom AI models on their own data, for maximum security and control. Compatible with all major cloud providers, the Mosaic AI platform provides maximum flexibility for AI development. Introduced in 2023, Mosaic AI’s pretrained transformer models have established a new standard for open source, commercially usable LLMs and have been downloaded over 3 million times. Mosaic AI is committed to the belief that a company’s AI models are just as valuable as any other core IP, and that high-quality AI models should be available to all.
Now part of Databricks since July 2023, we are passionate about enabling our customers to solve the world's toughest problems — from making the next mode of transportation a reality to accelerating the development of medical breakthroughs. We do this by building and running the world's best data and AI platform so our customers can use deep data insights to improve their business. We leap at every opportunity to solve technical challenges, striving to empower our customers with the best data and AI capabilities.
Summary:
Mosaic AI is hiring experienced machine learning platform engineers to build out our generative AI platform for the ML model development lifecycle including pre-training of large language models (LLMs), fine-tuning, evaluation, and serving. You will thrive in this role if you have a strong sense of end-to-end ownership and enjoy translating user requirements into product interfaces and building the backend distributed systems to power those interfaces. In this role, you will have the opportunity to contribute to all areas of our stack such as our CLI and SDK interfaces, backend APIs, distributed orchestration systems built on Kubernetes, and core platform infrastructure to support our product.
You will:
Play a key role in the end-to-end design and implementation of our product which is a platform for powering use cases across training and serving of generative AI models
Work closely with both ML researchers in the company and customers to identify key areas of development for our generative AI platform
Have strong end-to-end product ownership, translating product requirements into user interfaces and backend distributed system design as well as own the implementation of these designs
Design and build the core platform infrastructure that supports our customer-facing product features
Ensure the reliability, security, and scalability of the backend distributed systems that power all aspects of our product
We Expect You To
5+ years of full time industry experience
Strong software engineering skills
Experience building large-scale distributed systems
Experience building ML platform systems for applications in the ML model development lifecycle such as model training, data preparation, model evaluation, and model serving
Direct experience developing ML models is a plus but not required
Strong sense of end-to-end product ownership as well as intuition for both robust system design and product usability
Effective communication skills and the ability to articulate complex technical ideas to cross-disciplinary internal and external stakeholders.