100 Scripts in 30 Days challenge: Script 5 & 6 — Fast Document Search & Indexing using Whoosh

DailyPriyab
gopy
Published in
1 min readApr 13, 2017

Many a times I wanted a ready to use search option in Python but apart from some crude examples I never found a salable solution. And thats when I found Whoosh.

In this post I share 2 scripts. In my first “index_folder.py” script I show how we can index documents in a folder and its sub-folders.

In my second script “search_index.py” I share a simple way of searching the index and showing the results in an Html format with highlights of sections where you get a match.

More details about Whoosh can be found in the below link:

The code is given below.

For any further queries or feedback please contact me on twitter https://twitter.com/twitmyreview.

--

--

DailyPriyab
gopy
Editor for

Data Engineering | Data Governance | Azure | Spark | Python | Manager