Collecting and Processing Genomic Data to Help Cure Rare Diseases

September 21, 2020 54:59 52.79 MB Downloads: 0

In this episode of Running in Production, Dan Kolbman goes over using Django to build an internal tool that helps make sense of ~5 Petabytes of Genomic data that is then made available to clinicians. It’s running across many different AWS resources using ECS Fargate.

Dan walks us through what their app does, dealing with loads of data, using GraphQL, getting away from using Serverless and going mostly all-in with AWS. Their apps are open source too. The ones we’ll be talking about are on GitHub here and here.

Topics Include

  • 3:55 – Motivation for using Django and Python
  • 6:11 – Using GraphQL and having a few separate apps (micro-service’ish)
  • 11:16 – Querying ~5 Petabytes of Genomic data stored on S3
  • 17:21 – Using both Graphene (GraphQL) and Django REST Framework
  • 22:44 – Docker is being used in dev (Docker Compose) and in production (ECS Fargate)
  • 25:20 – PostgreSQL and Redis are being used too with lots of background tasks
  • 27:29 – Breaking down which AWS resources they use, along with using Terraform
  • 37:02 – Netlify is being used for deploy previews and CloudFront for production
  • 39:34 – Breaking down the work flow for deploying something from dev to prod
  • 46:55 – Planning for disasters and handling backing up data
  • 51:02 – Automated metrics around CPU and memory, along with alerting
  • 52:29 – Best tips? Use tools that a lot of people have thought long and hard about
  • 54:21 – You can find Dan on GitHub and his personal website

Links

📄 References ⚙️ Tech Stack 🛠 Libraries Used

Support the Show

This episode does not have a sponsor and this podcast is a labor of love. If you want to support the show, the best way to do it is to purchase one of my courses or suggest one to a friend.

  • Dive into Docker is a video course that takes you from not knowing what Docker is to being able to confidently use Docker and Docker Compose for your own apps. Long gone are the days of "but it works on my machine!". A bunch of follow along labs are included.
  • Build a SAAS App with Flask is a video course where we build a real world SAAS app that accepts payments, has a custom admin, includes high test coverage and goes over how to implement and apply 50+ common web app features. There's over 20+ hours of video.