Knowledge for Development, A Machine Learning Approach

3 min readMar 22, 2022

The wealth of development data has not been mined or analyzed at scale limiting our understanding of whether or not investments are achieving competitive results on the ground. However, recent advances in Machine Learning enabled DevelopMetrics, a data science company in the field of international development, to aggregate the of findings from millions of research documents, project evaluations, and other key sources thereby unlocking incredible insights into what works in international development.

One foundational insight enabled by machine learning is a better understanding of project design. By ‘reading’ every project evaluation available, semi-supervised machine learning enables the aggregation of an incredible amount of information. One result of such an effort is a map of the trajectory of interventions across time and space revealing the details of portfolio composition. When analyzed at scale, portfolio composition is revealed as a sort of Rubik’s cube of interventions which combine in different patterns across time to form the design of different projects and the scaffolding of the portfolio itself. Such clarity around portfolio composition is a prerequisite for establishing what impact, if any, development projects have.

Taking USAID as an example, Figure One below maps portfolio trends over one hundred and twenty-three countries and sixty years within USAID’s agriculture portfolio. If Organizational Knowledge Theory, which postulates that organizations learn from past experience, is to be believed, then a fall in the popularity of an intervention would indicate that USAID learned from the failure of that intervention at achieving the desired result and implemented the necessary course correction.

Figure One: Popularity of USAID Agriculture Interventions Across Time (displayed as a proportion of the total agriculture portfolio)

Explore this visualization of USAID Agriculture Portfolio Trends in detail by accessing this Tableau Visualization.

For example, the orange line in Figure Two below tracks the popularity of the ‘Research and Development’ intervention across sixty years of agriculture projects implemented by USAID. Research and Development was a popular intervention up until the 1990s but then fell dramatically in popularity around 1995. Could this rise in popularity be related to the success of agricultural research and development during the green revolution? If so, why then did it go out of fashion? An alternative explanation could be that, contrary to Organizational Learning Theory, USAID is not learning from past experience but rather intervention implementation is more related to congressional earmarks or other external factors than to learning from past experience. Further research into the causal mechanisms behind the trends is required to better understand the full picture.

Figure Two: Popularity of the ‘Research and Development’ Intervention within Agriculture Projects Across Time

As stated by Lord Kevin “When you can measure what you are speaking about and express it in numbers, you know something about it. (Otherwise) your knowledge is meager and unsatisfactory kin; it may be the beginning of knowledge but you have scarcely advanced to the stage of science.” Machine Learning gives us this power to speak in numbers by quantifying a vast amount of qualitative information that hitherto could not have been analyzed by manpower alone. Thus, we are now able to better understand trends and forward our understanding, and ultimately our impact, within international development programing.

Knowledge for Development, A Machine Learning Approach

Written by Lindsey Moore