Job Description
The Principal Data Scientist – Machine Learning/Deep Learning is a primary driver in the design and development of state-of-the-art Artificial Intelligence solutions for medical applications.
You'll enjoy the flexibility to work remotely from anywhere within the U.S.
Enhancement of existing company NLP technologies and extension of those systems in new cloud-based applications
Emphasis on development of novel machine/deep learning techniques for information extraction and synthesis
Create research code for clinical NLP solutions deployed at scale in production environments including statistical methods, deep learning, and large language model technologies
Work will involve all aspects of methods development from initial PoC implementation to performance characterization and production launch of new methods
• Strong history of publication in Machine/Deep Learning with an emphasis on Natural Language Processing, Information Retrieval and/or Information Extraction
• Proven success in taking machine/deep learning solutions to production environments
Requirements
• 5+ years of professional work experience using machine/deep learning technologies;
• Publication history of machine/deep learning and NLP subject area experience;
• ~5+ years of experience with Python;
• ~3+ years of experience with LLM approaches such as RAG, agentic approaches, or fine tuning of large language models;
• ~2+ years of experience working with transformer architectures or similar architectures (such as state space models, BERT or GPT);
• ~ PhD in a computational domain;
• Demonstrated publication record in AI domain especially relating to text extraction and summarization;
• Experience with Azure ML and/or AWS;
• Experience with Hybrid NLP solutions that combine symbolic and machine learning approaches;
• Experience within the medical domain;
• Experience with SQL;