100 Scripts in 30 Days challenge: Script 5 & 6 — Fast Document Search & Indexing using Whoosh
Many a times I wanted a ready to use search option in Python but apart from some crude examples I never found a salable solution. And thats when I found Whoosh.
In this post I share 2 scripts. In my first “index_folder.py” script I show how we can index documents in a folder and its sub-folders.
In my second script “search_index.py” I share a simple way of searching the index and showing the results in an Html format with highlights of sections where you get a match.
More details about Whoosh can be found in the below link:
Whoosh was created by Matt Chaput. You can view outstanding issues on the Whoosh Bitbucket page and get help on the…whoosh.readthedocs.io
The code is given below.
For any further queries or feedback please contact me on twitter https://twitter.com/twitmyreview.