Review on HIT summer school

I took part in the summer school hosted by Harbin Institute of Technology. The Topic is “Big Data and Business Analysis”. Though it’s only the fifth day of the summer school when I wrote this review, I already got a lot of good experience from it. I want to share with you what I learned, and hope you can join the fantastic program too in the near future.


Data Science is popular. It is so popular that when you write some articles on this topic, it will draw a lot of attentions from the audience.

I myself wrote several blog entries on WeChat public account. I talked about how to use Python to do some easy data science projects, such as word cloud drawing, sentiment analysis, decision trees, topic extraction and time series analysis, etc. They were popular on mass social media and websites for scholars. My articles on LDA was selected to the front page by one of the famous scholarly blogging websites .

These are the topics I taught my students in class. My MLiS students were excited when they first learned these new methods and tools. I want to maintain their curiosity and drill into the next level of research method learning, so I made the instructions easy to follow. However, they got so confident that they showed me their thesis just one day before the mid-term check.

I was dismayed while reading their thesis. They were totally tools-driven. I don’t know who brought this famous expression up, but I remember Warren Buffett quoted it several times.

To the one with a hammer, everything looks like a nail.

Dr. Arun Rai, the Editor-in-Chief of MIS Quarterly showed me the problem vividly with a picture talking about “street light effect”.

Good research should be problem-driven instead of tools-driven or method-driven. If your research is not problem-driven, how to deal with the problem?

Alok Gupta, the Editor-in-Chief of ISR gave us another advice on this issue — attend every class talking about research method in graduate school. In this way, you will have a toolbox instead of just a hammer. Define your research question first, talk with your colleagues or visitors before you dive into it.

I totally agree with him.


I learned how to use STATA and R to do different kinds of regression models in the summer school from Dr. Anindya Ghose, the world famous scholar and data scientist.

Dr. Ghose gave us 5 different lectures (16 hours in total). Not only did he show us how the algorithmic advertisement ecosystem works, but used concrete example to teach us how to use various kinds of regression models to analyze different business scenarios as well.

I learned statistics and R systematically from Coursera courses produced by Duke University and did tons of exercise. However, it’s the first time I realized when and why shall we use log() on continuous variables in OLR. I found STATA very easy to use and Dr. Ghose corrected several mistakes in my R models.


The best experience is to get instant feedback from the top scholars in the world.

In the Q&A session, I asked different speakers a lot of questions.

For instance, I used LDA in my researches, but I was not so sure about the result, for the algorithm asks the researcher to set the parameters arbitrarily.

Dr. Rai solved my problem with a very smart solution: show the result to field experts, and tell them it’s my colleague’s work.

I used simulation methods in my series studies on fake information diffusion control on social media. A reviewer once criticized on my agent settings. To him, the rule based agent was too simple. I asked Dr. Gupta if I could use deep learning to replace the simple rules of agents. He gave me a positive answer under certain conditions. I found his answer very valuable.

My current research focuses on privacy issues on social media platforms. So I was interested when Dr. Ghose talked about the companies which can trace customer behavior on individual level. Dr. Ghose’s answer (also in his new book, TAP: unlocking the mobile economy) opened my eyes and give me new thoughts on the following research works.

Not only the instructors answered my question face to face. Some of them emailed me and gave me resources to follow up.

To tell the truth, I never expected the top scholars in the world were so warm-hearted and eager to help. I was moved deeply.


In HIT summer school, I am no longer a college professor, but only an elder student. In my class, there are more than 20 selected professors and managers from nation wide.

We had a busy but good time in Harbin. The ice-breaking activities went smoothly on Central Street and Saint Sophia Cathedral.

We got together and discussed a lot of different topics in a lovely local bar in the evenings.

The work load was heavy, but we succeeded in helping each other to get through the problems in learning and got the concepts and practical methods from the lecturers.

As we came from different regions and had various kinds of backgrounds, soon we found a lot of opportunities to cooperate, both in the academic world and the industries as well.

We organized as a strong study group and made a lot of (good) peer pressure on each other. Without my classmates’ encouragement and help, I never expected I will finish my summer school report in so many words.


I really appreciate the efforts made by teachers and students from Harbin Institute of Technology. Thanks to them, I had this amazing opportunity to meet so many fantastic scholars and colleagues.

When I go back to Tianjin, I will help HIT to promote this program. Hope more and more professors and students can join the summer school in the future and have a marvelous experience.

Big Data is a Big issue. Hope the HIT summer school on Big Data and Business Analytics become more and more successful!

Thank you, HIT!

Shuyi Wang

Tianjin Normal University