
Staff Data Engineer (Remote - India) at Jobgether. This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Staff Data Engineer in India.. We are seeking a seasoned Staff Data Engineer to lead the design, development, and optimization of large-scale data platforms that power advanced analytics and business intelligence. In this role, you will work on transforming complex, unstructured, and semi-structured data into actionable insights that directly influence business decisions. You will collaborate with product and engineering teams to architect scalable data pipelines, mentor junior engineers, and drive innovations in data processing, storage, and retrieval. The ideal candidate thrives in solving optimization and integration challenges, enjoys technical leadership, and contributes to creating a robust and reliable data ecosystem.. Accountabilities. . Lead the design, development, and extension of data pipeline services and architectures to support business and customer needs.. . Transform raw unstructured and semi-structured data into structured, high-quality datasets for analytics and reporting.. . Implement and optimize ETL/ELT processes and workflows, ensuring scalability, reliability, and efficiency.. . Build and maintain data warehouses, data lakes, and streaming data solutions to enable advanced analytics and AI pipelines.. . Collaborate with cross-functional teams to define data solutions that align with business requirements and drive value.. . Provide technical mentorship to junior engineers and evangelize best practices across teams.. . Maintain high-quality standards for data management, documentation, and operational procedures.. . Contribute new ideas and innovations to enhance data architecture, processing, and analytics capabilities.. . . Bachelor’s or Master’s degree in Computer Science, Engineering, or a related technical field.. . 10+ years of professional experience in data engineering or software engineering with a focus on data platforms.. . Strong proficiency in Python and SQL, with hands-on experience in data processing frameworks like Apache Spark, Flink, and Airflow.. . Experience with RDBMS, NoSQL, and Big Data solutions (Postgres, MongoDB, Snowflake, etc.).. . Solid understanding of streaming solutions (Kafka, Pulsar, Kinesis/Firehose) and ETL/ELT processes.. . Hands-on experience with Docker, Kubernetes, Terraform, and infrastructure-as-code practices.. . Knowledge of columnar and row-oriented data structures (Parquet, ORC, Avro) and open table formats like Apache Iceberg.. . Familiarity with data warehousing, data lakes, and BI tools (e.g., Tableau) and ML/AI pipelines.. . Strong problem-solving skills, ability to learn quickly, and adapt to changing business requirements.. . Experience leading teams, mentoring engineers, and facilitating agile ceremonies.. . Excellent communication skills and ability to collaborate with cross-functional teams.. . Bonus Points:. . Experience with vector databases, retail/ecommerce environments, or additional programming languages (Scala, Java, Golang).. . Experience with in-memory columnar data technologies (Apache Arrow).. . Company Location: India.