CASL Project
Published in

CASL Project

Building a Question Answering System Part 2: Document Retrieval

Authors: The CASL Team

CASL Forte
Example: Open Research Dataset

Environment Setup

Option1: Using Conda

conda activate forte_qa

Option 2: Using Python venv

source env/bin/activate

Clone and cd into our repo

set PYTHONPATHexport PYTHONPATH=$(pwd):$PYTHONPATH

Build the ElasticSearch index

On another terminal, start an ElasticSearch backend

./bin/elasticsearch

Switch back to the first terminal and build the index

python examples/pipeline/indexer/cordindexer.py --data-dir sample_data/cord_paper
python examples/pipeline/indexer/cordindexer.py --data-dir <path-to-cord19-dataset>/document_parses/pdf_json

Recap: Question Understanding as a Forte pipeline

Extending the Forte pipeline with Document Retrieval

Looking forward

Why use Forte?

About CASL

--

--

News and updates about the CASL open — source project.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Petuum, Inc.

One Machine Learning Platform to Serve Many Industries: Petuum, Inc. is a startup building a revolutionary AI & ML solution development platform petuum.com