In-Depth Analysis.

Excel approximate match-fuzzy match-up

The powerful excel tool for matching names or similar text.

Bena Brin
Analytics Vidhya

--

Click here to check on fuzzy lookup for excel

Have you ever attempted to use VLOOKUP in Excel but been frustrated

when it does not return any matches? Developed by Microsoft and available for free, Fuzzy Lookup is an Excel add-on that takes an input, searches for the best match it can find, and returns that best match along with a similarity rating.

Fuzzy Lookup utilizes advanced mathematics to calculate the probability that what it finds matches up with your search entry, which means the tool works even when characters (numbers, letters, punctuation) do not match up exactly. Think of it as a beefier version of VLOOKUP that is more flexible and even easier to use.

Comparing Similarity of texts on two columns using Fuzzy Match Array formula

Fuzzy Match Array formula allows to quickly compare texts in two columns

Fuzzy matching array has given me the ability to quickly make sense of unorganized client data and draw conclusions that otherwise would have taken hours to discover. To illustrate the main functionality of Fuzzy Lookup, here are a few examples that this tool identified as similar (similarity scores range from 0 to 1, with 1 being the highest similarity possible):

You can see how each entry on the left is technically different than the corresponding entry to the right, but Fuzzy Lookup recognized that there is a chance they really mean the same thing. Fuzzy Lookup returns a probability score for each pair, which means you can quickly sort out, edit, and compare lists like these.

This tool is useful if you have a big list of names that were not entered in a consistent manner, or if some entries are abbreviated and others are not.

Lookup for similar Texts using Fuzzy lookup

Note: This is a not a ‘deep dive’ into Fuzzy Lookup tool settings. This is a quick-start guide for using this tool to make a simple comparison between two lists.

  1. Install the latest version of Fuzzy Lookup by accessing the link here. Or you can search it by clicking excel Developer tab then addins then search Fuzzy lookup on office add-ins

2. Confirm you have Fuzzy lookup add-in on the task bar and click on it

3. Follow the steps below to use the add-in

6. Select the Similarity Threshold you want Fuzzy Lookup to use (I find 0.75 is a good starting place):

7. Select a cell to serve as the insertion point for the Fuzzy Lookup table that is about to be created, then select ‘Go’ on the Fuzzy Lookup tool to finish the comparison and examine the results.

Other Tips and Uses

  1. Always use the excel worksheet you want you data to be appended as table.
  2. If you are working with a large list that produces duplicate results (this happens if the best match is the same for multiple entities you search for), sort similarity (low to high) and apply conditional formatting to the column with the duplicates. When you encounter duplicates you can decide if you want to keep those results or delete them and search manually for a better match.
  3. Fuzzy Lookup is great for large lists and is a perfect option for entries that are difficult to read such as long strings of random text or numbers.

--

--

Bena Brin
Analytics Vidhya

I am Risk Consultant working for a Swiss Fintech. I help Banks fight Fraud using big data technologies.Data Science/Machine Learning/