Online Learning Open Datasets

Josh Johnson
Feb 23 · 3 min read
Image for post
Image for post
Photo by JESHOOTS.COM on Unsplash

The opportunities for humans to contribute to the work of the world are changing rapidly. Businesses growing to take advantage of these opportunities need workers with new skills. Programmers, data scientists, web developers, and leadership positions are hiring, but there are not enough folks with the right skills to fill the need. This is true of many industries.

Education is expensive. Traditional teachers have to be multi-talented, high educated, passionate, and hard-working. Teaching and assessing are done manually and at considerable expense of time and money. However, we live in a magical age where data driven, accessible, personalized, and effective virtual education is possible. But how does an enterprise achieve those lofty goals? Data and data science are the answers

I jumped into the work of learning analytics with a project to predict course outcomes by online study patterns in user data and had a hard time finding data on learning and learning platform interaction patterns. Fortunately I found this dataset from Open University which has been a well-used source of study.

Traditional educational research is hard. Kids are squirrely and inconsistent. Classrooms are unique and external variables abound making controlled experimentation difficult. Group sizes are small and variables differ greatly between classrooms. The data is labor intensive to collect and often subjective.

Image for post
Image for post
Photo by NeONBRAND on Unsplash

Educational technologies and distance learning providers have opportunities for rigorous research with hard numbers on large samples of students. Every interaction with the computer can be tracked and anonymized to find correlations between product features, user behaviors and learning outcomes to iteratively redevelop course standards and interactive tools. The data is hypothetically available, and with many interactive cloud-based learning platforms being adopted by schools, the data may begin to span many years. Researchers can begin to model correlations between product features years ago to later outcomes, conducting cheaper longitudinal learning research.

If data is collected ethically and anonymously, according to FERPA guidelines, the community of online educational institutions have the opportunity to share research, data, and data drive practices for learning analytics and modeling.

Data from online learning gathered by online learning providers should be, as a norm, anonymized and provided as public datasets. These datasets can be a source of important and generalizable learning research by academic, professional, and amateur data scientists. Modern data science tools make data and analysis much more transparent than they used to be.

Image for post
Image for post
Photo by Dose Media on Unsplash

If marketable skills can be efficiently built in a diversity of learners, as is the mission of companies like Coursera and Udemy, we can re-employ millions of people into higher paying and highly needed jobs or business ventures.

Open source online learning datasets can be a source of important new research and advance the science of learning.

Sign up for Age of Awareness - Rethinking the ways we learn

By Age of Awareness

Stories providing creative, innovative, and sustainable changes to the ways we learn  Take a look.

By signing up, you will create a Medium account if you don’t already have one. Review our Privacy Policy for more information about our privacy practices.

Check your inbox
Medium sent you an email at to complete your subscription.

Josh Johnson

Written by

I'm a data scientist with a background in education. I empower learners to become the folks they want to be.

Age of Awareness

Stories providing creative, innovative, and sustainable changes to the ways we learn

Josh Johnson

Written by

I'm a data scientist with a background in education. I empower learners to become the folks they want to be.

Age of Awareness

Stories providing creative, innovative, and sustainable changes to the ways we learn

Medium is an open platform where 170 million readers come to find insightful and dynamic thinking. Here, expert and undiscovered voices alike dive into the heart of any topic and bring new ideas to the surface. Learn more

Follow the writers, publications, and topics that matter to you, and you’ll see them on your homepage and in your inbox. Explore

If you have a story to tell, knowledge to share, or a perspective to offer — welcome home. It’s easy and free to post your thinking on any topic. Write on Medium

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store