[Hiring] Bioinformatics Machine Learning Intern @RefinedScience

Remote, USA Full-time Posted 2026-05-31
Apply Now

Role Description

We are seeking a highly motivated Bioinformatics Machine Learning Intern to join our team. This internship is designed for Ph.D. candidates with experience applying machine learning, deep learning, or generative AI methods to single-cell omics data. You will contribute to active projects spanning single-cell biology, multiomics integration, and computational approaches to precision medicine and drug development.

Our Bioinformatics team plays a crucial role in integrating computational biology, large-scale data analysis, and machine learning to drive discoveries in precision medicine and drug development.

    Key Activities
  • Analyze single-cell and multiomics datasets to extract biological insights supporting precision medicine and drug development programs
  • Apply and evaluate machine learning and deep learning approaches to single-cell data for tasks such as cell type classification, biomarker discovery, and patient stratification
  • Explore and prototype generative AI and LLM-based approaches to accelerate biological data interpretation and scientific workflows
  • Collaborate with scientists, clinicians, and data scientists to design and execute data-driven research projects
  • Document and optimize computational workflows following reproducible research best practices
  • Present findings through technical reports, visualizations, and presentations to cross-functional teams
    Qualifications
  • Current Ph.D. candidate in Bioinformatics, Computational Biology, Computer Science, Biostatistics, or a related quantitative field
  • Single-cell omics experience: Demonstrated ability to process, analyze, and interpret single-cell data (scRNA-seq, scATAC-seq, CITE-seq, or spatial transcriptomics) using frameworks such as Scanpy/scverse, Seurat, or Bioconductor
  • Machine learning expertise: Applied experience developing and evaluating ML/deep learning models on biological data, including neural network architectures (GNNs, transformers, autoencoders), model selection and benchmarking, and integration of ML approaches into analytical workflows
  • Programming proficiency: Python and/or R for data analysis, statistical modeling, and visualization
  • Statistical foundation: Understanding of statistical methods for biological data (hypothesis testing, differential expression, multiple testing correction, clustering)
  • Strong problem-solving skills and ability to communicate complex insights effectively
    Requirements
  • Experience with deep learning frameworks (PyTorch, TensorFlow, JAX)
  • Familiarity with graph neural networks, attention mechanisms, or transformer architectures applied to biological data
  • Experience with ML experiment tracking and reproducibility (MLflow, Weights & Biases)
  • Exposure to representation learning, variational autoencoders, or contrastive learning methods
  • Familiarity with scikit-learn, XGBoost, or similar ML libraries
  • Interest in or experience with LLMs, RAG systems, or agentic AI tooling
  • Experience with multimodal single-cell integration (Seurat WNN, scvi-tools/MultiVI/totalVI, Muon)
  • Familiarity with spatial transcriptomics analysis (Squidpy, cell2location, nf-core/spatialvi)
  • Experience with cell-cell communication inference (CellChat, NicheNet, LIANA)
  • Knowledge of drug-gene interaction resources (CMap/LINCS, OpenTargets, ChEMBL)
  • Familiarity with Linux/Unix CLI and version control (Git/GitHub)
  • Experience with containerization (Docker, Singularity) and environment management (conda, venv)
  • Exposure to cloud computing platforms (GCP preferred)
  • Familiarity with workflow managers (Nextflow, Snakemake)
  • Adherence to best-practices for conduct reproducible computational research
    Duration
  • 8–10 weeks
    Benefits
  • Compensation: $34-$38 per hour

Company Description

At RefinedScience, our mission is to advance care by bringing together the best science, data and minds – disease by disease, patient by patient, cell by cell to discover pathways to life beyond disease.

    Our Values
  • Act with Purpose – We believe in rigor through deliberate and thoughtful actions
  • Be Curious – Curiosity is the spark that ignites innovation and growth
  • Take Ownership – True ownership leads to pride and commitment in the work we do
  • Invest in Relationships – Building strong connections is the foundation for effective collaboration and trust for long term success
  • Embrace Agility – We celebrate agile thinking, resilience, and adaptability

Apply tot his job

Apply To this Job

Similar Jobs