
Patent Informatics Data Scientist at Drug Hunter. About Us:. Drug Hunter distills the science behind emerging drugs and technologies into an easy-to-read and simple-to-search reference work. The company was founded in 2018 out of a scientist’s frustration with existing ways to learn about new drugs and the science of pharma. Now, thousands of R&D leaders from every major pharma and biotech company in the world choose to invest their time in Drug Hunter over crawling through the literature alone.. We are seeking a highly motivated Patent Informatics Data Scientist to assemble, analyze novel chemical and biological datasets, and contribute towards innovative solutions in drug design.. As. Patent informatics Data Scientist. , you will apply a wide variety of approaches to building, curating and integrating drug discovery patent data, with an initial focus on small molecule patents. The job is a unique blend of organization, and curation of patent data, alongside data/software engineering to build high quality data views and content for our platform and users. Deep domain knowledge of the patent process for pharmaceuticals, is required, alongside hands-on experience of public domain/commercial patent data systems in the drug discovery space.. You will work closely with both the internal product team and external partners who range from medicinal chemists, pharmacologist, strategists, and software engineers to enhance our product and support of the community.. Key Responsibilities:. . Develop and apply patent data-gathering and mining approaches to build high quality foundation data sets.. . Process and analyze small- and large-chemical and biological intellectual property datasets, including therapeutic use, molecular target and chemical structure data.. . Integrate chemical and biological databases to patent data.. . Develop novel data mining approaches to unearth cryptic data that enables decision making in partner organisations.. . Optimize data pipelines for processing and storing patent data, using text-mining, cheminformatics and bioinformatics approaches.. . Collaborate with cross-functional teams to integrate computational approaches into curation and analysis workflows.. . Contribute to thought leader articles on drug intellectual property informatics and data mining.. . Maintain best practices in data integrity, reproducibility, and documentation of data sources and derived content.. . Required Qualifications:. . Ph.D. or Master’s degree in Cheminformatics, Computational Chemistry, Bioinformatics, Data Science, or a related field.. . 5-10+ years of experience in cheminformatics, computational drug discovery, or machine learning applications in chemistry.. . Proficiency in Python/R, with experience in cheminformatics libraries and topics.. . Strong knowledge of molecular descriptors, drug targets, and chemical/biological informatics techniques.. . An innate sense of how to query and derive value from patent data.. . Familiarity with Open Source and academic/commercial competitive intelligence/patent systems.. . Experience working in a structured collaborative data and software development environment (git, SQL/Postgres, python notebooks).. . Exceptional communication skills in written and verbal communication of science, a natural story-teller to make sense and provide insights from complex data.. . Preferred Qualifications:. . Understanding of regulatory and patent landscapes for chemical and pharmaceutical data.. . Text mining experience, NER/NLP. Existing expertise in Python and relational database systems. API development and systems architecture.. . We understand that we are looking for a broad range of skills, so are committed to on the job coaching from experienced team members.. . Company Location: United States.