Jenny LeeinTowards Data ScienceBenchmarking Language Detection for NLPFour Python tools for identifying the language of your text and a speed and accuracy testNov 16, 20204Nov 16, 20204
Jenny LeeinTowards Data ScienceImprove Your EC2 SSH Workflow Using argparsePort forwarding, remote jupyter notebook access, and tmux auto-startMay 23, 2020May 23, 2020
Jenny LeeinThe StartupThere’s More Than Active and Passive Voice in EnglishCool transitivity alternations you didn’t know existed in EnglishJan 30, 20204Jan 30, 20204
Jenny LeeinThe StartupFive Most Useful Pathlib OperationsAre you still using os.path?Jan 4, 2020Jan 4, 2020
Jenny LeeinTowards Data ScienceCan You Become a Data Scientist Without a Quantitative Degree?A story and some insightsDec 29, 20193Dec 29, 20193
Jenny LeeinTowards Data ScienceText Feature Extraction With Scikit-Learn PipelineUsing 2010 primary debate transcriptsDec 13, 20194Dec 13, 20194
Jenny LeeinThe StartupA Super Quick Guide to Randomized (or Grid) Search with PipelineFive steps using scikit-learnDec 13, 20191Dec 13, 20191
Jenny LeeinTowards Data ScienceBeyond Speaking Time: An Analysis of Democratic Presidential DebatesData preparation and feature engineering for predictive modeling using real-world dataDec 8, 20191Dec 8, 20191
Jenny LeeinTowards Data ScienceWriting Linguistic Rules for Natural Language ProcessingWith a guide to question type extraction with spaCyNov 28, 20196Nov 28, 20196
Jenny LeeinTowards Data ScienceHow to Build a Reusable Custom NLP Pipeline with Scikit-LearnWith an emphasis on feature engineering and trainingNov 22, 20193Nov 22, 20193