
Data Scientist (Remote - US) at Jobgether. This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Data Scientist in the United States.. This role offers a unique opportunity to develop advanced AI and NLP systems focused on healthcare data. You will design, implement, and maintain machine learning models that extract meaningful insights from clinical text, ICD-10 codes, and healthcare claims data. Working with MLOps frameworks and LLM governance protocols, you will enable scalable, production-ready AI solutions while mentoring junior team members. The position emphasizes both technical expertise and collaboration, as you will work closely with engineers, analysts, and other stakeholders to optimize data workflows and drive innovation. Ideal candidates have a strong foundation in NLP, deep learning, and predictive modeling and are motivated by creating impactful solutions in the healthcare sector. This is a remote position with a focus on enterprise-level AI innovation.. . Accountabilities:. Lead the development of NLP-based classification systems for ICD-10 code identification from clinical charts and encounters.. Design and implement deep learning models using PyTorch and transformer architectures for medical text analysis.. Build and maintain MLOps pipelines, including CI/CD workflows, model deployment orchestration, and production monitoring.. Develop and optimize machine learning models for risk adjustment coding and HCC classification.. Create feature engineering pipelines, model evaluation strategies, and decision support systems for automated coding validation.. Collaborate with engineering teams to integrate AI/ML solutions into production environments.. Mentor junior scientists and analysts on AI, NLP, and machine learning best practices.. . Bachelor’s degree in Computer Science, Statistics, Mathematics, Data Science, or a related field; Master’s preferred.. 3+ years of experience in machine learning, AI, and NLP for classification and predictive systems.. Expertise in PyTorch and transformer architectures (BERT, RoBERTa, etc.) for text classification.. Experience implementing Model Context Protocols (MCP) for enterprise LLM governance.. Familiarity with MLOps frameworks such as MLflow, Kubeflow, and Weights & Biases.. Advanced Python programming skills and experience with NLP libraries.. Knowledge of healthcare claims data, clinical text processing, ICD-10 coding, and risk adjustment methodologies preferred.. Strong analytical and problem-solving skills, with excellent communication abilities.. Experience with SQL and collaborative development/version control systems.. Reliable high-speed internet and a secure, dedicated remote work setup.. . Company Location: United States.