I’ve decided how to group the different data points based on the actor’s present in the protest and now need to divide the datasets. Normally I’d do this through fancy indexing, but the actor columns I’m using for grouping contain string data which doesn’t interact nicely with fancy indexing. Due to this, I’ll need to run through the datasets manually with loops looking for substrings within each actor entry. This will take some time but ultimately lead to multiple curated data frames for analysis.
Leave a comment