Digging in to Hilary Clinton's emails- USA Presidential Candidate




So few months ago, Hillary Clinton released her email communications  happened (sent and received) during her tenure as Secretary of State in response to a FOIA request. You can download the extracted and normalized version of these raw email data here. Now, Let's see what interesting information we can dig from this. 

To start off, what are the most common words in her emails. Using PyEnChant to ignore the language specific words provides us with some interesting names and incidents. Here is the script and the result.





How about the countries that she communicated mostly about. Here's the result of that attempt with the assistance of PyCountry.  There is some noise in the final data (such as 'tv' & 'fm' most probably referring to the respective medias, not to the Tuvalu Islands and Federated States of Micronesia ). But as a whole, we can see which countries have received most of her attention. 





Addition to these basic information, although it's not huge amount of data, this could be the source for much more sophisticated sentiment analysis such as finding out  how happy, sad or frustrated Hilary was during this period.

Comments

Popular Posts