I'm replacing my Oscar winner collections for categories that aren't included in the PMM default Oscar collection, e.g. actor, supporting actress, adapted screenplay, animated feature, etc.
I'm using the IMDB_award attribute of the IMDB builder. The problem is that in order to do this with IMDB_award, you have to use category_filter which are the section headings for each award on each page and what amounts to the same awards aren't consistently named across years and the names that IMDB uses don't necessarily match the award names from Oscar Wiki or the Academy's website.
For example, based on spot checking ~20 years I found 8 different section headings for the adapted screenplay category:
- Best Writing, Adaptation
- Best Writing, Screenplay
- Best Writing, Screenplay
- Best Writing, Screenplay Based on Material from Another Medium
- Best Writing, Screenplay Adapted From Other Material
- Best Writing, Screenplay Based on Material Previously Produced or Published
- Best Writing, Adapted Screenplay
- Best Adapted Screenplay
The Animated Feature category has only existed about 20 years and I found 3 different category names for it by spot checking 5 of those 20 years:
- Best Animated Feature
- Best Animated Feature Film of the Year
- Best Animated Feature Film
So, I can only think of 3 ways of being pretty confident that I'm capturing all of the category name variations.
- Go through each of the 90+ years and manually build a list and key them together to make my collection category filters. I really don't want to do this.
- Build a script to loop through & scrape the categories from each year's page based on a shared tag. I'm not a programmer, but I think I can do this, but it's going to be a significant time investment.
- Hope someone else has already built the keyed list and someone can point me to it.
Any thoughts or direction would be appreciated.
Thanks.