TSV, better than CSV

Lately I’ve been getting a few questions on why one should use .tsv files since CSVs are “stardard” and virtually everyone knows how they work — so I wanted to clarify this once and for all.

CSVs ARE A PAIN IN THE BACK. USE TSVs SINCE THEY LIKELY WON’T SCREW AROUND WITH YOUR SOURCE DATA.

To put it differently:

TSV is an alternative to the common comma-separated values (CSV) format, which often causes difficulties because of the need to escape commas — literal commas are very common in text data, but literal tab stops are infrequent in running text. The IANA standard for TSV achieves simplicity by simply disallowing tabs within fields.

Wikipedia Tab-separated Values

Do yourself a favor and stop using CSVs, transition over to TSVs since they’re insanely less error-prone — say goodbye to sleepless nights trying to figure out why funny data has been uploaded to the DB.

If you’re worried about converting old CSV documents, python has the answer:

It’s then a matter of pipes: cat file.csv | csv2tsv.sh and you’re done.

Originally published at odino.org (23rd March 2018).

--

--

CTO at @NamshiDotCom, I like distributed systems, Golang, NodeJS, scalability, software design and µseconds. Writing https://leanpub.com/wasec :wq

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Alex Nadalin

CTO at @NamshiDotCom, I like distributed systems, Golang, NodeJS, scalability, software design and µseconds. Writing https://leanpub.com/wasec :wq