Data Engineer 2 (Remote - India) at Jobgether

We are redirecting you to the source. If you are not redirected in 3 seconds, please click here.

Data Engineer 2 (Remote - India) at Jobgether. This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Data Engineer 2 in India.. We are seeking a skilled Data Engineer 2 to design, build, and maintain scalable data pipelines that support AI and analytics initiatives. This role focuses on transforming complex, structured, and semi-structured data into production-ready datasets for machine learning models and business intelligence applications. You will collaborate closely with data scientists, analysts, and engineering teams to ensure data quality, reliability, and compliance. This position offers the opportunity to work in a fast-growing, innovative environment, tackling challenging problems in healthcare and financial data systems while contributing to impactful AI solutions.. Accountabilities. . Design, develop, and maintain ETL/ELT pipelines to support AI/ML workflows and analytics.. . Build and optimize data ingestion and transformation logic using Python, PySpark, and SQL.. . Implement and manage DataOps pipelines with data quality checks, logging, monitoring, and automated testing.. . Schedule and orchestrate pipelines using tools like Airflow, Databricks Workflows, or Azure Data Factory.. . Optimize SQL queries and Spark jobs to ensure performance and cost efficiency.. . Collaborate with data scientists to provide clean, well-structured data for model development and deployment.. . Ensure adherence to data governance, security, and compliance policies, particularly in healthcare environments.. . . Bachelor’s degree in Computer Science, Information Systems, Data Engineering, or related field.. . 3+ years of experience in data engineering or backend data development.. . Strong proficiency in Python and PySpark for data processing.. . Advanced SQL skills, including query optimization, complex joins, and handling large datasets.. . Hands-on experience with data pipeline tools (Airflow, Databricks, Azure Data Factory, Glue).. . Knowledge of data modeling, validation, and error-handling techniques.. . Familiarity with cloud data platforms (AWS Redshift, Azure Synapse, Snowflake) is a plus.. . Experience with CI/CD pipelines for data workflows and data versioning/cataloging tools is a bonus.. . Understanding of healthcare or revenue cycle management (RCM) data is advantageous.. . Company Location: India.