One of my passions in the broad field of biology and medicine has always been genomics. I strongly believe the best treatment of today and tomorrow will be personalized medication based on one’s genetic makeup. I am currently taking John Hopkins Genomic Data Science Specialization course which starts with instruction from Dr. Steven Salzberg, one of the scientists who worked on the Human Genome Project, and current Director of the Center for Computational Biology at John Hopkins. One of the methods he gives a glimpse of early on is Principal Component Analysis (PCA) to recognize differences in genetics between populations…

My thoughts going into my public health and infrastructure project (which I did a write up on here and can be found on my GitHub here) were to say the least, naive. We’ve made tons of technological advances over the course of the last 2 decades and I believed gathering and handling data on infrastructure and disease would be very simple. Additionally my assumptions of why my hypothesis might prove to be true, ignored a lot of caveats only realized after performing my analysis. Nonetheless the process of going through the project provided me with a lot of lessons on…

Using Infrastructure to Forecast the Spread of Disease (project found here)

As someone who has formerly worked in healthcare, I was naturally inclined to pursue a project involving public health. Having never done such a project, I learned a lot about the challenges of undertaking a public health data science project and gained a lot of insight on how to further improve my model as well as insight on other areas of public health that I can explore more. …

(Project can be found here)

The problem

Using social media to gauge public sentiment is not something that is particularly new. You can see what is trending on Twitter in each location, see which places are being tagged on Instagram, what’s trending on Reddit, the list goes on. The goal for this project was to use Pushshift’s API to collect posts from two subreddits then use Natural Language Processing (NLP) to train a classifier to predict which subreddit a given post came from. …

The “why are you doing this?” question for pursuing any field, hobby, or endeavor is something I usually scoff at because to me it seems like a loaded question. I find it hard to believe there’s any ONE reason someone does something, but rather a culmination of enjoying the little things about something or learning through doing other things, what you hate, what you enjoy, which in turn create that one encompassing reason as to “why” you do something. At least that’s what my journey into pursuing data science has felt like. At a young age you generally have some…

Dimitri Kisten

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store