Hello, Merck

Last week, I started a new position as Machine Learning Engineer at Merck through Insights Global. Words are not enough to express my excitement, so I won’t even try. I wanted to hit the ground running and run, I did. Thank you, Kathleen, Regina, and CJ, for this opportunity. I will endeavor to make you … Continue reading Hello, Merck

Using MapQuest API to Get Geo Data

A friendly tutorial on getting zip codes and other geographic data from street addresses. Knowing how to deal with geographic data is a must-have for a data scientist. In this post, we will play around with the MapQuest Search API to get zip codes from street addresses along with their corresponding latitude and longitude to … Continue reading Using MapQuest API to Get Geo Data

Get Your Feet Wet in Power B I

A hands-on introduction to Microsoft Analytics Tool As a data scientist, you’ll need to learn to be comfortable with analytics tools sooner or later. In today’s post, we will dive headfirst and learn the very basics of Power BI. Be sure to click on the images to better see some details. The Data The dataset … Continue reading Get Your Feet Wet in Power B I

Forecasting in Power BI

A visual step-by-step guide to forecasting using Power BI. In this post, we’ll go through the process of creating forecasting in Power BI. Get the Data You can download the dataset that I used here. It contains daily female births in California in 1959¹. For a list of other time-series datasets, check out Jason Brownlee’s article … Continue reading Forecasting in Power BI

Democratize Data Science

Every once in a while, I would come across an article that decries online data science courses and boot camps as pathways towards getting a data science job. Most of the articles aim not to discourage but serve as a reminder to take a hard look in the mirror first and realize what we’re up … Continue reading Democratize Data Science

Exploring the Trump Twitter Archive with PyCaret

For adventurous beginners in NLP. For this project, we’ll be using PyCaret: PyCaret is an open source, low-code machine learning library in Python that allows you to go from preparing your data to deploying your model within seconds in your choice of notebook environment.¹ PyCaret PyCaret does a lot more than NLP. It also does a … Continue reading Exploring the Trump Twitter Archive with PyCaret