
Senior Data Engineer, ML Infrastructure at Serve Robotics. Remote Location: USA (remote). At Serve Robotics, we’re reimagining how things move in cities. Our personable sidewalk robot is our vision for the future. It’s designed to take deliveries away from congested streets, make deliveries available to more people, and benefit local businesses. . The Serve fleet has been delighting merchants, customers, and pedestrians along the way in Los Angeles while doing commercial deliveries. We’re looking for talented individuals who will grow robotic deliveries from surprising novelty to efficient ubiquity.. Who We Are. We are tech industry veterans in software, hardware, and design who are pooling our skills to build the future we want to live in. We are solving real-world problems leveraging robotics, machine learning and computer vision, among other disciplines, with a mindful eye towards the end-to-end user experience. Our team is agile, diverse, and driven. We believe that the best way to solve complicated dynamic problems is collaboratively and respectfully.. As a Senior Data Engineer in the Machine Learning (ML) Infrastructure team you will be helping us build out our petabyte scale data platform supporting data partnerships, ML and autonomy engineers. Your work will directly impact a new revenue stream through commercialization of our robot data. You will be focusing on building highly scalable data pipelines and improving data discoverability features. You will collaborate with ML engineers in the creation of diverse large scale datasets used to train cutting edge ML models that are deployed to our fleet of thousands of robots.. Responsibilities. Architect and implement robust, scalable data pipelines to process, synchronize, and package robotics data (e.g., LiDAR, camera, IMU, proprietary maps) for third-party consumption.. Build a data processing and egress platform, ensuring the timely and accurate delivery of datasets according to strict partner SLAs.. Create data lifecycle policies to control cloud data costs. Build and maintain a universal data catalogue of all raw robot data. Create cost monitoring, attribution and alerting systems.. Build data discoverability platform features, use ml models to generate new attributes and maintain efficient, highly scalable search indexes.. Setup data access audit trails and strong security controls managed through IaC. Create lineage maps and expose data traceability capabilities to internal consumers.. Qualifications . 5+ years of professional experience in software or data engineering.. Strong programming proficiency in Python, SQL. Hands-on experience building and maintaining large-scale data processing pipelines using cloud technologies. Proficiency with data warehousing and ETL/ELT concepts. Solid understanding of system design, along with data privacy and security best practices. What Makes You Stand Out . Hands on experience setting up IaC to orchestrate cloud resources and security policies. Experience with GCP and solid understanding of fully managed cloud infrastructure. Familiarity with robotics data such as lidar, multi-modal camera, mapping, etc. Experience working in a fast paced startup environment. Experience building and optimizing terabyte scale data pipelines