Senior Data Engineer
May Mobility
May Mobility is transforming cities through autonomous technology to create a safer, greener, more accessible world. Based in Ann Arbor, Michigan, May develops and deploys autonomous vehicles (AVs) powered by our innovative Multi-Policy Decision Making (MPDM) technology that literally reimagines the way AVs think.
Our vehicles do more than just drive themselves - they provide value to communities, bridge public transit gaps and move people where they need to go safely, easily and with a lot more fun. We’re building the world’s best autonomy system to reimagine transit by minimizing congestion, expanding access and encouraging better land use in order to foster more green, vibrant and livable spaces. Since our founding in 2017, we’ve given more than 300,000 autonomy-enabled rides to real people around the globe. And we’re just getting started. We’re hiring people who share our passion for building the future, today, solving real-world problems and seeing the impact of their work. Join us.
Senior Data Engineer
May Mobility is experiencing a period of significant growth as we expand our autonomous shuttle and mobility services nationwide. We are seeking a talented data engineer specializing in building highly scalable, reliable, and performant data pipelines to support automated tagging and analysis systems. Working closely with the data scientists, ML engineers and data platform teams, you will be responsible to optimize the data pipelines to ensure the rapid and reliable generation of the high-quality, searchable metadata and tags that powers our core autonomy improvements.
Responsibilities
- Design, build, and optimize high-throughput ETL pipelines using PySpark and cloud services to manage the flow of multimodal AV sensor logs
- Collaborate directly with ML Engineers to productionize, scale, and performance-tune the model inference pipelines, focusing on maximizing data throughput and minimizing operational costs
- Implement robust data quality checks, schema validation, and monitoring on all raw input data and on the structured, searchable metadata
- Identify bottlenecks in data movement and processing, improve the speed and efficiency of data preparation, and downstream data retrieval for dashboards and data search functionalities.
- Serve as the liaison between the Data Science teams and the Data Platform team, advocating for and implementing infrastructure improvements necessary for long-term scalability and reliability.
Skills
Success in this role typically requires the following competencies:
- Ability to work with high-volume, multimodal sensor data, understanding the complexities of temporal alignment
- Implement strategies to reliably synchronize generated metadata and tags e.g., tags created during real-time processing and post-processing analysis.
- Experience collaborating cross-functionally to define service level objectives for pipeline uptime, latency, and data freshness.
- Clear written communication and the ability to align folks on a plan before executing
- Excellent attention to detail and rigorous testing methodology.
- Identify complex problems and devise optimal and innovative solutions that often cross organizational boundaries.
Qualifications and Experience
Required
- 5+ years of professional experience in a Data Engineering role, specifically focused on supporting complex analytics initiatives and machine learning.
- Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
- Expert proficiency in PySpark and distributed computing frameworks for processing petabyte-scale datasets.
- Deep working knowledge of cloud ecosystems (AWS, GCP, or Azure) and modern data lake/warehouse technologies.
- Demonstrated experience in migrating prototype data pipelines/scripts into fully managed, production-ready, fault-tolerant ETL systems.
Desired
- Experience in autonomous, robotics or heavily data-driven industry
- Experience working with data formats common in the AV/Robotics space (e.g., ROS bags, Protobuf, custom logging formats).
Benefits and Perks
- Comprehensive healthcare suite including medical, dental, vision, life, and disability plans. Domestic partners who have been residing together at least one year are also eligible to participate.
- Health Savings and Flexible Spending Healthcare and Dependent Care Accounts available.
- Rich retirement benefits, including an immediately vested employer safe harbor match.
- Generous paid parental leave as well as a phased return to work.
- Flexible vacation policy in addition to paid company holidays.
- Total Wellness Program providing numerous resources for overall wellbeing
Want to learn more about our culture & benefits? Check out our website!
May Mobility is an equal opportunity employer. All applicants for employment will be considered without regard to race, color, religion, sex, national origin, age, disability, sexual orientation, gender identity or expression, veteran status, genetics or any other legally protected basis. Below, you have the opportunity to share your preferred gender pronouns, gender, ethnicity, and veteran status with May Mobility to help us identify areas of improvement in our hiring and recruitment processes. Completion of these questions is entirely voluntary. Any information you choose to provide will be kept confidential, and will not impact the hiring decision in any way. If you believe that you will need any type of accommodation, please let us know.
Note to Recruitment Agencies: May Mobility does not accept unsolicited agency resumes. Furthermore, May Mobility does not pay placement fees for candidates submitted by any agency other than its approved partners.