Data Scientist | Clinical AI

San Francisco 29 days agoFull-time External
Negotiable
Job Description Data Scientist | Clinical AI Machinify, Palo Alto, CA • Remote Full-time Posted 16 hours ago Job Description Machinify is a leading healthcare intelligence company focused on healthcare payment solutions for health plan clients. They use an AI-powered platform to maximize financial outcomes and drive down healthcare costs. They are hiring a Data Scientist to help advance their AI system for clinical criteria evaluation. This role involves medical policy interpretation, data science, and applied ML/LLM evaluation. The Data Scientist will translate policy requirements into code (SQL and Python) and build the measurement and quality systems that continuously improve an AI pipeline that extracts structured clinical signals from medical records. What You'll Do • Translate medical policy into executable logic by reading and interpreting medical policies and clinical criteria. • Convert requirements into correct, maintainable SQL and Python implementations. • Design rule representations that are composable and auditable. • Prompt engineering and system parameter tuning for AI configuration that extracts clinical information from medical records. • Build robust clinical feature pipelines, handling missing timestamps, multiple measurement sources, unit normalization, deduplication, conflicting values, provenance tracking. • Own measurement, evaluation, and continuous quality improvement. • Define and instrument accuracy metrics for the AI system that extracts data from medical records. • Build gold datasets, sampling strategies, and review workflows with clinical/operations partners. • Establish engineering frameworks and tooling. • Create reusable tooling for policy-to-code translation: templates, test harnesses, validation suites, regression checks, and monitoring dashboards. • Partner deeply with domain experts. • Work with clinicians, policy specialists, and operations to clarify ambiguous requirements and ensure implementations reflect real-world intent. • Produce clear documentation that explains what the code is doing and why, with examples and edge-case handling. What You'll Bring • Strong SQL and Python engineering skills - Ability to translate nuanced requirements into correct SQL and production-quality Python. • Experience operationalizing rules + models - Track record of implementing complex business/clinical logic and deploying it reliably. • Evaluation/Metric mindset - Experience designing metrics, building ground truth, running experiments, and improving system quality through structured iteration. • Systems thinking and rigor - You build frameworks that make other engineers/scientists faster. • Healthcare curiosity (and willingness to learn fast) - Interest in medical records, clinical data, and how policies translate into decision criteria. Nice to Have • Experience with clinical data standards or lab data normalization (LOINC familiarity, units conversion, reference ranges). • Experience evaluating LLM/IE systems (information extraction) or building human-in-the-loop QA workflows. • Familiarity with distributed data systems (Spark, BigQuery/Snowflake, Databricks) and workflow orchestrators. Why This Role This role is designed for individuals who enjoy turning ambiguous real-world policies into precise code, building quality systems that make AI reliable, and operating at the boundary of clinical domain nuance and production engineering.