Crowdsourcing in the 21st Century Library, Museum and Archive

Early this month, the New York Public Library unveiled a new project called Emigrant City, built around the premise that important currents of New York City history are buried in a trove of bond and mortgage records from The Emigrant Savings Bank during the years 1841–1933.

There’s a unique twist to this project, however. In order to make sense of these newly digitized collections, the Library needs help from the public. They are using a microsite to solicit citizen volunteers to provide identification, transcription, tagging and more of this vast trove of data.

Inviting regular people to participate in a significant project like this certainly represents an innovative move for this storied institution — and it is a prime example of how institutions like this one can use “crowdsourcing” to improve the quality of their collections.

“Crowdsourcing” is defined by Merriam Websters as “obtaining needed services, ideas, or content by soliciting contributions from a large group of people, and especially from an online community, rather than from traditional employees or suppliers.”

As the social web knits diverse populations together, public institutions are increasingly turning to the public to solicit their ideas and contributions on any number of different projects.

The act of “crowdsourcing” in museums, libraries and archives therefore represents a unique opportunity for some of these venerable institutions to evolve their missions into dynamic digital form and in doing so, build new relationships with the public.

Picture by Senior Airman Joshua Strang, via Flickr

How Do Libraries, Museums and Archives use Crowdsourcing?

In libraries, museums and archives, crowdsourcing can take a number of diverse forms including:





The Value of Open Data

Many institutions are also finding that there in tremendous value in opening up the data they compile back to the public.

Trevor Muñoz, Associate Director of MITH and Assistant Dean for Digital Humanities Research at the University of Maryland Libraries, has used data compiled from the “What’s on the Menu?” project to teach students about digital humanities data curation, and written about his process extensively here, here and here.

He says: “I think that the What’s on the Menu project showed was that that’s all great if you can do something fancy down the line but maybe you can just dump out the database into a series of spreadsheets and let people download them from your website and then people will go off and do things with the data.”

Over at the Cooper Hewitt, Micah Walter explains the value of opening data to the public and potential developer interests, below:

Fostering a Full Range of Voices

Alice Backer started Afrocrowd to deal with the lack of diverse voices on Wikipedia. Why is diverse representation so important as crowdsourcing initiatives? She explains why in the clip below:

A recent edit-a-thon was held in partnership with the Museum of Modern Art, where participants were tasked with boosting the amount and quality of content dedicated to black artists such as Jean-Michel Basquiat.

Afrocrowd further underscores the need to document languages spoken by small and geographically isolated communities, as well as to expand the definition of what is “notable” within the boundaries of projects like Wikipedia.

Making Crowdsourcing a Simple Proposition

In 2008, Mary Flanagan was given a grant to develop a technology that can support crowdsourcing in museums. The result was Metadata Games, an open source platform that turns the task of tagging photographs and other collections data into a game for users. Currently, Metadata Games is being used by the British Library, Boston Public Library, The Open Parks Network, Digital Public Library of America, and the American Antiquarian Society, among others.

For many institutions, the desire to crowdsource is an entirely consistent with a dedication to serving the public. Technology is just making the possibilities increasingly more accessible.

The Crowd Consortium for Libraries and Archives is dedicated to uniting leading experts in a conversation about crowdsourcing best practices. For more on the project, including case studies, visit our website at

The Crowdsourcing Consortium for Libraries and Archives is dedicated to uniting leading experts in a conversation about crowdsourcing best practices.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store