10 Csvkit Commands You Should Know As A Data Engineer
Learn how to use the csvkit and psql libraries to analyze, transform and move data across systems through the command line
Published in
12 min readFeb 15, 2021
Recommended To My Readers:
Are you trying to advance your career as a Data Engineer or Machine Learning Engineer? I would highly recommend for you to check the following courses:
- Python Data Engineering Nanodegree→ High Quality Course If You Have More Time To Commit. **Up to 75% DISCOUNT IN MARCH 2022**
- Data Streaming With Apache Kafka & Apache Spark Nanodegree (Udacity) → **Up to 75% DISCOUNT IN MARCH 2022**
- Distributed Computing With Spark SQL (Coursera)
- Python advanced Coding Problems (StrataScratch)→ Best platform I found to prepare Python & SQL coding interviews so far! Better and cheaper than LeetCode.
Introduction To Csvkit
csvkit
is a command line tool built as a Python library, that is optimized to explore, transform, and move comma-separated datasets across systems.
Althought csvkit
is often presented as a quick alternative to other programming languages to perform data science tasks, it really unleashes its true potential…