Our Algorithm
We calculate Diversity Scores to assess the gender and racial diversity of films, based on the directors, producers, writers, and top five billed actors.
Let’s look at the cast, crew, and algorithmic breakdown of The Sun is Also a Star (2019).
Diversity Score Algorithm
Our scoring algorithm considers each film’s producer(s), director(s), writer(s), and top 5 billed cast members and outputs a Gender Score and a Race/Ethnicity Score, each ranging from 0 to 10.
A score of 10 means that the film’s % of known diverse people matches or exceeds the US population’s % of diverse people.
- Each intersectional person (a woman or non-binary person of color) adds a bonus of 0.25. Intersectional people are significantly underrepresented compared to all other groups.
- People with multiple positions (e.g. producer and director) are counted for each credit. Holding multiple positions would afford them more power and control over the production.
- Every position is weighted equally.
Gender Score
- 51% of the US population describe themselves as women or non-binary (US Census 2020). Hollywood films underrepresent these genders (UCLA Hollywood Diversity Report 2020).
- We do not count people whose gender we do not know towards the Gender Score.
Race/Ethnicity Score
- 40.1% of the US population describe themselves as Asian, Black, Latinx, Middle Eastern/North African, Mixed/Multiple Race, or Native (global indigenous peoples) (US Census 2020). Hollywood films underrepresent these racial and ethnic groups (UCLA Hollywood Diversity Report 2020).
- We do not count people whose race or ethnicity we do not know towards the Race/Ethnicity Score.
Philosophy
While we chose to focus on gender and race, we understand that diversity comes in many forms — we could have accounted for LGBTQ+, age, ability, religion, body type, and more.
Gender and race have the most information readily available, making them easiest to focus on for our project. There is extensive research linking the lack of representation in these categories to stereotyping of diverse people (Sociology Compass 2015).
Using US Census data, we compare a film’s race and gender to America’s population to emphasize the lack of diversity in Hollywood. America is known for having a diverse population, and its film industry should reflect that.
In terms of who we consider, we prioritize the producers, directors, and writers because their positions are the most influential in a film’s production. They are also the three most mentioned positions in other algorithms/rating systems (e.g., GradeMyMovie, Reframe Stamp, F-Rating), created with filmmaker/industry input.
We include the top five cast members as well. Although our primary focus is behind-the-camera diversity, on-screen diversity is essential too! We want to ensure that diverse talent is recognized and celebrated in all aspects.
Data Collection
Our platform uses film, crew, and cast data from TMDb.
Our team manually inputs missing gender and race data through online research. We mark data as unknown if we do not find explicit information from IMDb or a valid news source. We do not assume a person’s gender or race/ethnicity solely on their name or how they look.