Dagshub Is The Way To Go For Data Scientists
Hey, are you wondering what Dagshub is? and why I am blabbering that “it is the go-to for Data Scientists”?🤔 Hmm? If you are then buckle yourself because this is going to be fun🤑.
Now, What if I say that there is a place where you can not only store your Data Science projects and are also given the chance to experiment(This is going to be talked about a lot. Heads up!!) on them right there? If I was asked this question before getting to know dagshub I would probably laugh and ridicule the hell out! lol😂😅. But it is real! It is possible to do that, seriously!!
Let me cut to the chase. You can say that DagsHub is the GitHub for Data Scientists. The only change is that DagsHub can do a lot more things than GitHub and Gitlab.
One of the things is that we can cover the entire ML life cycle and we don’t even need any Dev ops.
The layout and the design of the website are quite literally the same as GitHub, making it easy for us to get into it. It also works the same way along with the inclusion of other methods as well. Today I am not going to show you how it is done, maybe in another article? Anyway, I am writing this article so that fellow data scientists or data science enthusiasts can get to know that this exists and they too can reap the benefits of this website.
We can perform experiments on our models and see the insights we get out of those experiments. We can make use of tools like MLflow, DVC, New Relic, Jenkins etc., which can be integrated into dagshub.
They made it possible to organize things in an orderly fashion. Like you have your notebooks in one place, your data in one place etc.,
As you can see above it is quite similar to GitHub. If you observe it has different tabs in the middle — All, Data, Models, Notebooks, DVC, Git. If you think about it, the organization of things are very effective.
The next thing I want to talk about is Collaboration. It is made effective. I mean it is somewhat similar to GitHub but here we can do much more like:
- Commenting on work
- Share with team
- Reproduce models with desired results
- Comparing different experiments
Take a look at the image below.
As I’ve said earlier that we can perform experiments. By that, I mean that we can try out different hypotheses on our models to see how they are working.
IT’S OPEN-SOURCE
The main reason for starting this platform was to tackle the problem of collaborating. As the existing tools available are more inclined towards software development rather than Data Science.
It is kind of similar to GitLab in this aspect. By being open-source it makes the development process transparent, where everyone can contribute.
I will say that DagsHub has similarities with GitHub and GitLab. The open-source thing from GitLab and the efficient organization of the data from GitHub was put together to make DagsHub.
DagsHub uses open-source protocols, so it’s fully portable and extensible.
SO WHY DAGSHUB?
DagsHub allows you to quickly build, share and reuse machine learning and data science projects eliminating the hassle for teams to start every time from scratch. Following are the features of DagsHub that makes it stand out from other traditional platforms:
Inbuilt tools like Git for source code tracking, DVC for data version tracking, and MLflow for experiment tracking, which allows you to connect everything in one place with zero configuration.
It also supports data science tools & frameworks you already use. Isn’t that great?
We can track the experiments using the dashboard provided. We can even compare experiments and visualize. You can go from experiment to source with a single click.
CONCLUSION
Even though there is Github and Gitlab, I think, as a Data Scientist it is more beneficial to use dagshub. Not only does it store the data but also helps in making the project better by giving access to experimentation on the platform along with collaboration with the team.
I hope that you found this article helpful and interesting. Let me know your thoughts in the comment section.
References
Also, check out my other works: