Hao CaiinTowards Data ScienceTrain and Deploy Fine-Tuned GPT-2 Model Using PyTorch on Amazon SageMaker to Classify News ArticlesA tutorial for text classification using GPT-2 on Amazon SageMaker8 min read·Feb 3, 2022----
Hao CaiinTowards AIHow to set up your environment for SparkSpark is a very popular open-source big data framework. This article shows you how to set up Spark in your machine.8 min read·Jan 15, 2022----
Hao CaiinTowards Data ScienceBuild recommendation system using Scala, Spark and HadoopA movie recommender built using Scala and Spark on distributed Hadoop cluster6 min read·Jan 13, 2022----
Hao CaiinTowards Data ScienceVisualizing AI startups in drug discoveryAs a machine learning researcher in the biology field, I have been keeping an eye on the recently emerging field of AI in drug discovery…5 min read·Jul 2, 2020----