[GSoC 2025] Supporting Patient-Level Pipelines within JuliaHealth

Hello Julia Community! :waving_hand:

I’m thrilled to share a major milestone: I’ve officially completed my Google Summer Of Code 2025 project with JuliaHealth! This summer, I had the incredible opportunity to improve tooling for patient-level prediction pipelines in the healthcare space. My project was structured in two main phases:

Phase 1: HealthBase.jl

  • Added schema-aware support for OMOP CDM tables.
  • Developed a new HealthTable interface for working with OMOP CDM data.
  • Introduced preprocessing utilities to make healthcare data easier to use for downstream modeling.
  • :open_book: Blog post: Phase 1

Phase 2: OMOPCDMFeasibility.jl

  • Built a package to perform pre and post-cohort feasibility analysis.
  • This tool helps ensure reliable cohort studies and trustworthy research outcomes.
  • :open_book: Blog post: Phase 2

You can find my complete final work report here: GSoC 2025 Final Work Report

Throughout this journey, I not only learned a lot about Julia and healthcare data science, but also about contributing to open-source and working in a collaborative research ecosystem.

A huge thanks to my mentor, Jacob S Zelko (@TheCedarPrince) for his constant guidance and encouragement throughout the project, this wouldn’t have been possible without his support. :folded_hands:

I am truly excited to continue contributing to JuliaHealth and the open-source community and I welcome any thoughts, suggestions and feedback you may have. :seedling:

~ Kosuri Lakshmi Indu (LinkedIn, GitHub)

8 Likes