Automunge
Published in

Automunge

Hashed Categoric Encodings with Automunge

High cardinality, low overhead

=> '0f44cb01d838c981156d9f0c030159fb'
['unique', 'entries', 'in', 'feature', 'set']=> {'entries' : 0, 'feature' : 1, 'in' : 2, 'set' : 3, 'unique' : 4}
"Hashing is a form of cryptography in which a message is transformed into an encoded representation."=> 20295613598449223719803640046900304379
"Hashing is a form of cryptography in which a message is transformed into an encoded representation."=> 9
['unique', 'entries', 'in', 'feature', 'set']=> [6, 2, 8, 7, 7]
“Unique entries (in feature set).”=> [6, 2, 8, 7, 7]

Acknowledgments

References

Machine Learning Design Patterns — Valliappa Lakshmanan, Sara Robinson, Michael Munn

Machine Learning Design Patterns

The Soft Bulletin — The Flaming Lips

The Soft Bulletin
The Soft Bulletin — The Flaming Lips

--

--

Automunge —Prepare your data for Machine Learning

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Nicholas Teague

Writing for fun and because it helps me organize my thoughts. I also write software to prepare data for machine learning at automunge.com. Consistently unique.