Allison StaffordinTowards Data ScienceBinning Records on a Continuous Variable with Pandas Cut and QCutWhen, why, and how to transform a numeric feature into a categorical featureSep 29, 2021Sep 29, 2021
Allison StaffordinTowards Data ScienceMapping Census DataHow to access and map population data in PythonSep 19, 2020Sep 19, 2020
Allison StaffordinTowards Data ScienceSearching or Sorting a list of Objects Based on an Attribute in PythonWhen you want to find which (dog) student works well with the highest number of other (dog) studentsApr 29, 2020Apr 29, 2020
Allison StaffordinTowards Data ScienceChanging the Frequency (Precision) of Time Data in PandasUsing Pandas’ resample() method to adjust time featuresMar 6, 20201Mar 6, 20201
Allison StaffordinTowards Data ScienceExtracting data from semi-structured tweets using Pandas and regexUsing Series string functions and regex to extract numeric data from textFeb 29, 20201Feb 29, 20201
Allison StaffordinTowards Data ScienceUsing ColumnTransformer to combine data processing stepsCreate cohesive pipelines for processing data where different columns require different techniquesFeb 22, 20203Feb 22, 20203
Allison StaffordinTowards Data ScienceCreating Beautiful Sankey Diagrams with floWeaverStepping up your Sankey game in PythonFeb 15, 2020Feb 15, 2020
Allison StaffordinTowards Data ScienceHow to Split ShapefilesCutting the western Aleutian Islands off Alaska (they’re mostly uninhabited)Feb 7, 20201Feb 7, 20201
Allison StaffordinTowards Data ScienceNatural Language Processing with PySpark and Spark-NLPDiving in to the text of Financial Services Consumer ComplaintsFeb 5, 20202Feb 5, 20202
Allison StaffordinTowards Data ScienceData Prep with Spark DataFramesUsing PySpark to continue investigating the Financial Services Consumer Complaint DatabaseJan 25, 20203Jan 25, 20203