
Senior Data Engineer (Remote - Spain) at Jobgether. This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Senior Data Engineer in Spain.. As a Senior Data Engineer, you will be responsible for designing, building, and maintaining scalable data platforms that enable high-quality, reliable, and accessible data for analytics, modeling, and reporting. You will work closely with cross-functional teams to transform raw datasets into clean, usable formats and ensure that data governance practices are enforced at scale. This role offers the opportunity to influence architectural decisions, mentor other engineers, and shape the data strategy across the organization. You will tackle complex challenges involving distributed processing, cloud infrastructure, and data modeling while contributing to continuous improvement of pipelines and frameworks. The position emphasizes ownership, technical expertise, and collaboration in a dynamic and innovative environment.. . Accountabilities. Architect, develop, and evolve scalable data infrastructure to ingest, process, and serve large volumes of data efficiently.. Improve existing data pipelines and frameworks to ensure performance, reliability, and cost efficiency.. Establish and enforce data governance practices, including lineage, quality checks, and access controls.. Transform raw datasets into structured, clean, and usable formats for analytics, modeling, and reporting.. Investigate and resolve complex data issues, ensuring accuracy, consistency, and system resilience.. Maintain high standards for code quality, automated testing, CI/CD, and observability.. Stay up-to-date with industry trends and emerging technologies to enhance engineering practices.. . Bachelor’s degree in Computer Science, Engineering, or a related field.. 5+ years of experience designing, building, and operating scalable data ingestion, processing, and serving layers in production.. 5+ years of SQL experience, including query design, performance tuning, and optimization.. 5+ years of Python experience for data manipulation, pipeline development, and integration (e.g., PySpark, pandas).. Experience with data modeling for Data Warehouses, Lakehouses, and efficient ELT/ETL pipeline design.. 3+ years of experience with distributed data processing (Apache Spark) for batch and/or streaming workloads.. 3+ years of experience with cloud platforms (AWS and/or GCP) for data engineering workloads.. Experience implementing data governance at scale, including policies, lineage, quality checks, and access controls.. Experience improving existing pipelines and frameworks for performance, reliability, and cost efficiency.. Knowledge of automated testing, CI/CD, and observability for data pipelines.. Experience with API-based integrations (JDBC/ODBC, REST, SOAP) and diverse data types/formats (JSON, Avro, ORC, Parquet).. Strong shell scripting skills (Bash or PowerShell) and familiarity with Git workflows.. Expertise in modern data architectures such as Data Lake, Data Warehouse, Lakehouse, Data Mesh, or Data Fabric.. Advanced English proficiency for collaboration with international teams and clients.. . Company Location: Spain.