The Machine Learning Manifesto

After months of insightful discussion, feedback and research I venture and propose a list of 42 principles for more holistic machine learning initiatives

Published in

Unmanage

4 min readFeb 10, 2022

Data Engineers, Agile Practitioners, Machine Learning Operators and Engineers, Data Scientists, Privacy Lawyers and all people of the world affected by the radical economic and work place transformation that is revolutionising our thinking resist the rise of data myopia. Hence we declare:

Design and implement models in accordance to as well as stay informed on the latest thinking regarding the ethical use of artificial intelligence
Encourage and foster direct discussions between model designer and model user
Reduce intermediaries and hierarchies so that information can flow freely and with minimal distortions to the modeling teams
Explore rigorously and communicate the model evaluation metrics transparently and not rely on beneficial metrics to advertise success
Champion to avoid bias in data, opinion, and design through balanced staffing and data collection strategies
Exercise great care in the preparation and collection of both label and feature data
Communicate openly, transparently, and repeatedly the model mechanics so that anyone who wishes to understand the logic will be able to do so
For every model deployed, establish a model operations plan to monitor performance and data consistency, quality, completeness, and structural changes
Use documentation effectively to facilitate communication of model assumptions, input, key algorithm design aspects, and mechanics
Strive to collect sufficiently accurate, consistent, clean, and definitive labels and maintain a label update process
Establish a positive error culture in order to fail early and often and lead by example by admitting to mistakes and staying open to critique
Discourage building extensive central ivory tower data science teams
Recognize that excellent data management and especially documentation and quality of data are essential for trusted data inputs driving models that people feel confident of using
Encourage and help to establish AI acceptance before deployment rather than relying on passive tolerance or rejection, especially for models that rely on feedback loops
Write data input quality checks and integrate this step in the model design process and as far as possible into model evaluation
Work towards reduction of bias against AI results and include AI as a dimension of diversity and inclusion for organizations
Establish cross-functional teams that involve as many people as possible using the model results in the engineering process to reduce fear and rejection
Provide tangible perspectives of model operations, maintenance, and parameter improvement to people whose present work will change because of AI deployments
Listen to advice from model and data engineers that have been employed irrespective of their social, ethical, professional, or corporate position
Always stay informed and attuned to technical aspects of the work
Support the industry in the creation of appropriate design, operations, control, and monitoring procedures and frameworks
Recognize that artificial intelligence is affecting the core of what defines human nature and thus the special responsibility for care to affected parties
Strive to avoid cross-functional team specialization and teach the essence of data and model engineering so that every team member can contribute meaningfully
Prevent that unrealistic expectations for AI/ML model speed and accuracy are established
Do not try to break the laws of physics or mathematics through the use of machine learning
Complement AI/ML with appropriate statistical analysis both pre- and post-modeling; it’s not a competition but a symbiotic relationship
Allow models to learn and improve over time and recognize that the learning process may require potentially extensive human activity observation before models perform in the desired fashion, especially for such models with a feedback loop/human-in-the-loop/collaborative filtering
Evaluate models against data issues before deployment
Celebrate the engineers as the heroes of data, understanding their critical role and heed their recommendations and advice
Celebrate the non-technical professions in the data field as the heroes in their domains
Never play engineers and non-technical professions against each other but recognize that good models need collaboration between several disciplines
Document and keep updated the details of data elements such as attributes, code meaning, time and source as well as legal permissions always as close to the data record so that it is evident during data exploration
Create nimble machine learning structures with many happy AI-enabled teams across the organization
Establish recurring procedures to keep data, models, and the environment they operate in well-aligned
Design AI/ML models in such a way that they can be asked to unlearn and forget if this should become necessary
Keep track of model, data, and configuration versions used for training and deployment
Establish the legally permissible use of each data element at the point of data collection and, in the event of secondary data, get a legal opinion on how the input data can be used
Stay curious, learn and accept the fact that experiences are biased by the events that people have witnessed by chance during the course of their life
Leverage existing role definitions from agile and technical domains rather than creating firms specific niche roles that make it harder for everyone to navigate the many responsibilities in the field of AI
Help your leadership team to establish serving leader incentives
Build a business case and revenue/cost model for your initiative
Do not solve mathematical problems using machine learning models

Written in recognition of the Clue Train Manifesto with full references published in the free book Unmanage. The story does not end here. Please share your views or show support for the manifesto in the comments.

The Machine Learning Manifesto

After months of insightful discussion, feedback and research I venture and propose a list of 42 principles for more holistic machine learning initiatives

Written by Jean Voigt