Mediabistro logo
job logo

Senior Director, AI and Data Science (Drug Discovery and R&D Enablement)

Scorpion Therapeutics, New York, NY, United States


Key Responsibilities AI/ML Strategy + Delivery

Define and execute Lexeo’s applied AI/ML roadmap across discovery and development, prioritizing use cases that improve speed, quality, and decision confidence.

Deliver solutions that are internal-only (e.g., scientific decision support, operational forecasting) and those that are generated internally but external-facing (e.g., partner‑ready analyses (regulatory dossiers, briefing books, protocols etc.), validated dashboards, and decision materials).

Establish best practices for model lifecycle management (validation, documentation, monitoring, retraining), especially where outputs influence scientific decisions or regulated workflows.

Advanced Analytics + Predictive Modeling

Lead development and selection of appropriate ML approaches (e.g., XGBoost, Random Forest, SVMs, and other advanced models) based on problem framing, data constraints, interpretability needs, and deployment context.

Build and oversee predictive analytics using real‑world data, including robust evaluation design, bias/variance trade‑offs, and performance monitoring.

Small Data Excellence + Synthetic Controls

Apply techniques to amplify signal‑to‑noise in smaller datasets (e.g., regularization, Bayesian methods, hierarchical modeling, augmentation, multimodal integration, careful feature engineering, uncertainty quantification).

Guide strategy for synthetic control arms and comparable approaches (as appropriate), ensuring methodological rigor, transparency, and fit‑for‑purpose use in decision‑making.

Drug Discovery / Translational Partnership

Translate drug discovery and translational questions into testable analytical hypotheses; partner with bench scientists to design data capture that enables strong modeling.

Serve as a bridge between scientific teams and data/engineering, ensuring solutions are scientifically credible and operationally adoptable.

Cross‑functional Enablement + Platform Integration

Partner with stakeholders across R&D, CMC, Clinical, Safety, and IT/Security to implement scalable data pipelines and AI‑enabled workflows.

Contribute leadership to current and emerging initiatives such as AI workflow automation/database buildouts and analytics agents that leverage enterprise platforms (examples already in motion include CMC AI automation, MaxisAI clinical database/AI efforts, and AI work to ingest historical data into Dataverse/Fabric for agent‑based analysis; integration work such as a Benchling AI API initiative may also be in scope depending on priorities).

External Partner/Vendor Leadership

Liaise with external partners to evaluate tools, define statements of work, and deliver solutions—while ensuring knowledge transfer and sustainable internal ownership.

Operational Excellence

Improve internal processes through automation and analytics, focusing on measurable impact (cycle time, error reduction, throughput, decision latency).

Establish practical governance for data quality, documentation, and fit‑for‑use standards aligned with the realities of biopharma environments (including where regulated practices apply).

What Success Looks like (First 6‑12 Months)

A prioritized AI/analytics roadmap tied to measurable R&D outcomes; clear ownership and delivery cadence.

2‑4 production‑grade analytics solutions adopted by teams (internal and/or external‑facing outputs as needed).

A repeatable approach for small datasets and high‑noise signals; documented modeling standards and review practices.

Strong partner engagement model: vendors/partners used strategically, with internal capability building and durable outcomes.

Required Skills and Qualifications

Advanced degree in a quantitative or scientific discipline (PhD strongly preferred; MS with exceptional experience considered).

10+ years of relevant experience across applied data science/ML in life sciences/biopharma (or adjacent domain with direct drug discovery translation), including 5+ years leading teams and influencing senior stakeholders.

Deep familiarity with advanced ML methods (including XGBoost, Random Forest, SVMs) and the judgment to select and justify the right tool for the job.

Demonstrated experience building predictive models with real‑world, imperfect datasets and delivering them into production or decision workflows.

Proven ability to improve processes and operationalize analytics—moving beyond prototypes to adoption.

Strong cross‑functional communication: can partner with scientists, engineers, and executives; can explain model performance and limitations clearly.

Preferred Skills and Qualifications

Direct experience in drug discovery, translational research, and/or R&D decision support (target ID/validation, MoA, biomarker strategy, preclinical data integration).

Experience with small data strategies, causality‑aware thinking, and synthetic control arms or closely related methodologies.

Experience operating in regulated/quality‑sensitive environments and building documentation practices that scale (particularly relevant where validation and traceability are required).

Familiarity with enterprise data platforms and modern analytics stacks (lakehouse/warehouse patterns, feature stores, MLOps, model monitoring).

#J-18808-Ljbffr