I’ve posted this on LinkedIn, but I figured it would be appropriate to share on this page. This graduate school project was a high-level exercise in data wrangling, joining, manipulating, exploring, and analyzing.
To give you an idea of how nothing came easy on this project, I chose to use a Levenshtein distance function across thousands of rows as a soft join. This was one of a few choices I made over the course of the project, given time restrictions and minimum requirements. If you are interested to read more about the methodologies I used, check out the Github repo.
Finally, the link you’ve all been so desperately waiting for. Behold.
If the website is down, email me at whitsonbuck@gmail.com so I can restart the server. I’m cheap.