Jenny LeeinTowards Data ScienceBenchmarking Language Detection for NLPFour Python tools for identifying the language of your text and a speed and accuracy test·5 min read·Nov 16, 2020--4--4
Jenny LeeinTowards Data ScienceImprove Your EC2 SSH Workflow Using argparsePort forwarding, remote jupyter notebook access, and tmux auto-start·5 min read·May 23, 2020----
Jenny LeeinThe StartupThere’s More Than Active and Passive Voice in EnglishCool transitivity alternations you didn’t know existed in English·7 min read·Jan 30, 2020--4--4
Jenny LeeinThe StartupFive Most Useful Pathlib OperationsAre you still using os.path?·3 min read·Jan 4, 2020----
Jenny LeeinTowards Data ScienceCan You Become a Data Scientist Without a Quantitative Degree?A story and some insights·16 min read·Dec 29, 2019--3--3
Jenny LeeinTowards Data ScienceText Feature Extraction With Scikit-Learn PipelineUsing 2010 primary debate transcripts·10 min read·Dec 13, 2019--4--4
Jenny LeeinThe StartupA Super Quick Guide to Randomized (or Grid) Search with PipelineFive steps using scikit-learn·3 min read·Dec 13, 2019--1--1
Jenny LeeinTowards Data ScienceBeyond Speaking Time: An Analysis of Democratic Presidential DebatesData preparation and feature engineering for predictive modeling using real-world data·12 min read·Dec 8, 2019--1--1
Jenny LeeinTowards Data ScienceWriting Linguistic Rules for Natural Language ProcessingWith a guide to question type extraction with spaCy·15 min read·Nov 28, 2019--6--6
Jenny LeeinTowards Data ScienceHow to Build a Reusable Custom NLP Pipeline with Scikit-LearnWith an emphasis on feature engineering and training·7 min read·Nov 22, 2019--3--3