Senior Software Engineer, ML Data
About this role
SENIOR SOFTWARE ENGINEER, ML DATA
San Francisco, CA · Hybrid · Reports to Head of Vision & AI
WHO WE ARE
Voxel is building the future of Computer Vision and Machine Learning for operations, risk, and safety. We use computer vision and AI to enable existing security cameras to automatically detect hazards and high-risk activities, keep people safe and drive operational efficiencies. Our technology addresses the key cost drivers for workers’ compensation, general liability, and property damage, which cost US employers over $500 billion annually. Our customers include Fortune 500 companies across grocery, retail, manufacturing, food and beverage, logistics, and pharmaceutical distribution. We’ve passed $10M ARR with strong expansion revenue. Based in SF, backed by industry-leading VCs.
ABOUT THE ROLE
Voxel’s perception system is the technical core of everything we ship. Our models detect human activity, equipment interactions, environmental hazards, and operational state in real time across thousands of cameras in manufacturing, logistics, retail, and pharmaceutical environments. Safety was our wedge; it proved our platform works. Now customers are pulling us into operations: equipment utilization, workflow compliance, process efficiency. Every new use case runs through the perception team.
We're hiring a strong software engineer to own the data infrastructure required to train and evaluate our ML models. You’ll work with petabytes of data streaming from thousands of live customer cameras, and build pipelines to ingest, search, label, version and store the highest quality of data required to train state-of-the-art computer vision models. Your work directly shapes how our applied ML team measures and improves model quality. You'll set technical direction, write code, make architecture calls, and partner closely with applied CV, ML infra and Platform engineers.
WHAT YOU'LL DO
- Drive the roadmap for the data infrastructure that powers Voxel's vision and ML capabilities.
- Build petabyte-scale pipelines for ingestion, search, labeling, versioning and storage of camera data.
- Develop methods for data mining and automated data collection.
- Partner with applied ML engineers on dataset quality for training and evaluation.
- Collaborate with the Data Ops team (HITL) to design processes to measure and improve data quality.
- Understand the data needs of Vision and AI engineers and design scalable infra solutions that support model improvement and vision capabilities.
- Collaborate with the Platform team to store and retrieve data efficiently on the cloud.
WHAT WE'RE LOOKING FOR
- 4+ years of experience building and shipping large scale software solutions.
- Working knowledge of ML training and evaluation. Understand what makes a good dataset, how to measure model quality, and how data quality affects model performance.
- Strong Python. Comfortable across the stack: data mining, data labeling, storage and retrieval.
- Track record of owning something end to end: building data products valuable to internal customers.
- Bias toward shipping. You'd rather ship something good this week than something perfect next quarter.
- Strong communication skills.
NICE TO HAVE
- Experience with implementing data compliance & data governance solutions.
- Deep understanding of data for computer vision - object detection, tracking, video understanding.
- Familiarity with human in the loop data annotation, auto labeling.
COMPENSATION & BENEFITS
- Equity through Voxel’s Equity Incentive Plan
- Total compensation includes base salary, annual bonus, and equity
- Comprehensive health, dental, and vision insurance
- Competitive paid parental leave
- Unlimited PTO and flexible work arrangements
- Daily meals in-office, team events, annual company onsite