Fact Check & Document All Assumptions | ML Tips #2

All assumptions, especially those underlying the project creation, should be documented and fact checked!

Fact Check & Document All Assumptions | ML Tips #2
Photo by John Schnobrich / Unsplash

There are two main type of assumptions in a data science project:

  1. Your own assumption that you build up during your project.
  2. Other assumption that are already there when you began your project.

The first kind isn't too problematic. Most data scientist will naturally document why they did this or that in their project. Whether in their analysis or in a report, it's just natural.

However, the second type of assumption are often completely overlooked or difficult to distinguish from basic facts. This is one of the reason why it is so crucial to ask a heck load of question before even starting any project.

Even statement about the usefulness of the project or the expect impact should be documented and fact checked.

Having this proper mapping in place will unlock two things:

  1. Allow you to control the project better and to steer it in the right direction.
  2. Allow you to raise flags fast if an assumption is proven incorrect throughout an analysis.

This will instantaneously make you a much more effective data scientist! ✅

Subscribe to Yacine's Machine Learning Help Desk

Don’t miss out on the latest issues. Sign up now to get access to the library of members-only issues.