A Complete 20 Week Curriculum to Become a Data Analyst in 2022

Nishesh Gogia
7 min readSep 1, 2022

--

Invest 20 weeks to crack one of the most exciting jobs in IT domain…

To get what you want, you have to deserve what you want.” — Tai Lopez.

If you want to read more about data science related stuff like-:

1-Data science Interview Questions

2. Machine Learning Case Studies for domains like Digital Marketing

3. Recommendation Systems

4. Probability and Statistics Questions

5. Visualization Plots

You can follow me here for more…

Introduction

Hi Guys, Today let’s talk about how to start your journey to get a entry in IT domain as a DATA ANALYST. It does not matter which field you belong to, which degree you opt, if you follow this 20 week Curriculum, there is a high probability that you will crack a Data Analyst job comfortably.

To crack any job interview you need to understand the business requirement, you need to be persistent and consistent with your work, you should have a decent portfolio and you should stick with your one plan.

I have seen people changing their paths on the daily basis, by doing that you won’t reach to any destination, you will just move in circles. So make your plan and stick to it.

Before we Begin…

Before we begin, A couple of notes before we dive into it:

I am assuming that you can spend at least 2–3 hours daily or 14–16 hours weekly to prepare for data analyst role

This will not cover everything that you need to know to be a fully equipped data analyst. This will cover what I believe are the most fundamental skills of a data analyst.

This curriculum will not include anything related to deep learning . Deep learning deserves its own 20 weeks on its own — it would be a disservice if I tried to squeeze it in!

Let’s get started…

COURSE STRUCTURE

WEEK-1 TO WEEK-4 (Python Coding Language)

WEEK-5 TO WEEK-7 (Probability and Statistics)

WEEK 8 TO WEEK 10 (SQL/PowerBI)

WEEK 11 TO WEEK 12 (Basic Mathematics)

WEEK 13 TO WEEK 17 (Exploratory Data Analysis)

WEEK 18 TO WEEK 19 (Dimension Reduction Techniques)

WEEK 20 (Basic Machine Learning Algorithms)

WEEK-1 TO WEEK-4(Python Coding Language)

From Week-1 to Week-4, We work on the basics of coding, it’s very important to have the basics of python programming when you start this journey, knowing these topics would help…

  1. Keywords/Variables
  2. Control Flow statements
  3. Simple For Loop/While Loop
  4. Recursion
  5. OOP(Object Oriented Programming)
  6. Multithreading
  7. File Handling(Exception Handling)
  8. Numpy and Pandas Library

And lots and lots of practise…

If you have time and energy to learn all this online for free, let me give you some good references…

Python Basic

Python VideoPlaylist Hindi

Python VideoPlaylist English

Python Practice Coding Questions

If you have tried youtube but you did not understand much, you can also join

“CODE WITH ME” Session on DataScienceWala.com

“Code With Me” is an initiative started by me to built a habit of coding, After training more than 2000 students, i have realised most of them lack in coding and that is the reason of rejections at most of the companies.

Also the fee for this is less than your weekend party bill!!!

WEEK-5 TO WEEK-7 (Probability and Statistics)

Now if we are done with the basic coding, it’s important to understand the basic probability and Statistics which is the favourite topic of lot of interviewers.

See as a Data Analyst, you need to deal with lot of data so Statistics can give you tools and resources which can enhance the analysis part.

knowing these topics would help…

(Check my blog on this…)

  1. Basics of Probability theory
  2. Conditional Probability
  3. Bayes Theorem
  4. Basics of permutation and combination
  5. Expectations and Variance
  6. Descriptive Statistics
  7. Central Tendency and Deviations
  8. Plots like Q-Q Plot, Histogram, PDF,CDF,Box Plot, Count Plot, Violin Plot,KS Plot
  9. Normal Distribution
  10. Uniform Distribution
  11. Log Transformation
  12. Kurtosis
  13. Box-Cox Transform
  14. Inferential Statistics
  15. Hypothesis Testing(Check my blog on this…)
  16. Confidence Interval
  17. A/B Testing

Knowing these topics will help you in the interview, I have also written lots of blogs on statistics, if time permits, please check my account.

You can also checkout these amazing YouTube videos,

Statistics Playlist

Probability Playlist

Amazing Visualisations

WEEK 8 TO WEEK 10 (SQL)

Sql is again a very important topic with respect to interviews perspective, Lot of times you will be asked SQL based questions.

Specially for data analyst job roles, SQL is again a favourite topic.

knowing these topics would help…

  1. Difference Between SQL AND MYSQL
  2. DataBase Management Systems
  3. Order of Keywords
  4. JOINS
  5. Wildcard Operators
  6. Basic Queries involving GroupBy, Having.
  7. Inner Queriers
  8. Practise One or More PowerBI dashboards

let me give you some good references…

SQL FULL COURSE

SQL PRACTISE

POWER BI DASHBOARDS

WEEK 11 TO WEEK 12 (Basic Mathematics)

Maths is topic which will give you an edge in the interviews, let me share the topics you can prepare.

  1. Vectors( Topics like Angle between 2 vectors, Projection of one vector onto another vector, Dot Product)
  2. Matrices(Addition, Multiplication of matrix)
  3. Linear Algebra(Topics like Euclidian distance, distance of a point to a line, equation of plane, equation of hyperplane, distance of a point to a hyperplane)
  4. Differentiation(Basics of differentiation, chain rule, Addition of derivative, Division of derivative)

The best source to learn basic maths is KHAN ACADEMY.

WEEK 13 TO WEEK 17 (Exploratory Data Analysis)

In EDA, you can download any data from Kaggle, and start doing data analysis. You can keep few topics in mind while doing data analysis.

  1. Scaling of features/Columns, Types of Scaling.(Check my blog on this…)
  2. Converting Categorical Features to Numerical Features(ONE HOT ENCODING)
  3. Also see libraries like Numpy and Pandas, see their functionalities.
  4. UNIVARIATE ANALYSIS(Histogram, Pdf, Cdf, Line plot, Bar plot, Count plot,Box Plot, Violin Plot)
  5. Bivariate Analysis(Scatter Plot, Pair plot)

Some frequently asked questions in this area…

  1. How to handle missing values in data?
  2. How to handle imbalance data?
  3. How to handle lot of dimensions/features/columns?
  4. What is the difference between classification and regression?
  5. What is supervised and unsupervised learning?

EDA Resources

WEEK 18 TO WEEK 19 (Dimension Reduction Techniques)

This is a bit advance topic but will help you in understanding the data world in much more depth.

Check my blog on this…

Generally asked questions are…

  1. What is curse of dimensionality?
  2. Why do we have to reduce the dimension in the data?
  3. What are the techniques to reduce the dimensions in data?
  4. What is PCA?
  5. What is T-SNE?
  6. How time complexity is related to dimensionality?

Dimension Reduction Resources

WEEK 20 (Basic Machine Learning Algorithms)

Again Learning the basic algorithms wont be a bad option, let me give you some questions which you can prepare.

Check my blog on this…

  1. What is KNN?
  2. Difference between KNN and K-Means?(a very famous questions, don’t know the reason though)
  3. How Naive Bayes works?
  4. What is the assumption of Naive Bayes algorithm?
  5. What is Logistic Regression?
  6. What is sigmoid function?
  7. What are the failure cases of KNN?
  8. Can logistic regression handle outliers?

Machine Learning algorithm resources

So this was WEEK BY WEEK schedule for 20 weeks, This blog obviously can’t claim that it will cover all the data analyst interviews but this blog covers almost 80–90% of the topics which can be asked in Data Analyst Interviews.

Portfolio Building

It is very important to build a good portfolio, some of the suggestions are…

  1. Have a good and updated LinkedIn profile
  2. Have an updated Resume(You can use ZETY to make a good resume)
  3. You can use portals like Indeed.com or Naukri.com to search jobs.
  4. Start writing technical blogs on medium, This shows your passion and consistency for the subject.
  5. Have an updated Github profile.

Cracking an interview is a skill, I wish you all the best for your interviews.

Thank you for reading….

Nishesh Gogia

If you enjoyed this, follow me here for more

Interested in collaborating? Let’s connect on Instagram

It would be great if we can connect on LinkedIn.

--

--