Fun with Bar Plots: Examining the Holdings
Posted on Sun 26 June 2016 in posts • Tagged with data science, python, projects, new york city, historical data, visualization, bar plots
Using BeautifulSoup, I wrote the following script to identify and gather the holdings information for the New York City directory collection in Hathitrust. The script collected useful information about each directory, such as its title, publisher, publication year, and the total number of pages (really, files) in the digital version of the directory. With that information stored in my catalog data frame, I'm going to produce some tables to get a better sense of what I have. Specifically, I'd like to know how many directories I have for each year from 1800 to 1899, and how that collection breaks down by publisher and repository.
Continue reading