
Data is created, generated, collected, captured, and extended by humans. Even “raw” data is never truly untainted. The fact that any data set exists means that a human has already decided what signal to capture and how to go about capturing it. That’s why data is always incomplete and messy, and needs to be interpreted and analyzed. It can be biased through what is included or excluded, how it is framed, and how it is presented.