Lead Data Platform Engineer - AI Foundation at MLabs. Senior Data Engineer - AI-First Fintech Platform. Location:. Remote - Fully Remote - Across Europe. Compensation: . $120K - $140K. We are a rapidly growing fintech and crypto platform undergoing an AI-first transformation. We are seeking a strategic, high-impact, and high-ownership . Senior Data Engineer. to build the foundational data infrastructure that powers our cutting-edge AI features.. You will report directly to the CTO and work closely with our Machine Learning (ML) team to consolidate and govern our data ecosystem (including PostgreSQL, Kafka, BigQuery, and GCS) into a clean, governed, and ML-ready platform. Your contributions will directly enable critical AI functionality across our product suite.. This role is responsible for the end-to-end design, implementation, and governance of our core data platform, ensuring high performance and readiness for ML applications.. Data Platform Architecture:. Build and consolidate the foundational data platform, ensuring data from various sources (PostgreSQL, Kafka) is accurately captured and processed into our cloud data warehouse.. Pipeline Development:. Design and implement robust, resilient batch and real-time data pipelines using streaming platforms and Change Data Capture (CDC) tools.. Data Modeling & Transformation:. Apply expert-level SQL and tools like dbt to build and maintain dimensional models (fact/dimension tables) optimized for analytics, reporting, and Machine Learning feature creation.. Data Governance & Quality:. Implement and enforce strict data governance policies, including PII tagging, column-level security, and access controls. Implement automated data quality monitoring with checks and alerting.. Performance & Optimization:. Optimize the performance of our data warehouse (e.g., BigQuery) through techniques like partitioning, clustering, and advanced query optimization.. Observability:. Implement and maintain full observability of the data platform, focusing on data freshness monitoring, schema change detection, and pipeline health dashboards.. AI/ML Collaboration:. Work closely with the ML team and CTO to structure data specifically to enable and accelerate the development of new AI-driven features.. Cloud Expertise:. Strong hands-on experience with . Google Cloud Platform (GCP). data services (e.g., BigQuery, Dataflow, Cloud Storage, DataStream, AlloyDB/CloudSQL).. SQL & Modeling:. . Expert-level SQL proficiency. and significant experience with data modeling tools like . dbt. for transformation and testing.. Streaming & CDC:. Hands-on experience with . streaming platforms (Kafka, Kafka Connect). and an understanding of Change Data Capture (CDC) tools (Debezium or similar).. Dimensional Modeling:. Proven experience building dimensional models (fact/dimension tables) for analytics and ML features.. Data Governance:. Practical experience implementing data governance measures (PII tagging, security, access controls).. Data Quality & Observability:. Experience with implementing automated data quality monitoring and setting up data observability (freshness, schema changes, health dashboards).. Performance Tuning:. Experience optimizing data warehouse performance (e.g., BigQuery partitioning, clustering, query tuning).. Nice to Have. Experience with . Feature Store. architecture and understanding ML feature serving patterns (real-time vs. batch).. Prior work within . financial services or regulated data environments. .. Familiarity with the . Vertex AI. ecosystem or . Apache Beam/Dataflow. transformations.. Knowledge of . vector databases. or semantic search concepts.. Background collaborating directly with ML/data science teams.. Company Location: Germany.
Lead Data Platform Engineer - AI Foundation at MLabs