Skip to content

Getting Started

This project provides several search engines to explore Randomized Clinical Trial (RCT) protocols from URC Saint Louis

Project overview


Quick Start

You have two options to start using the RCT search engines:

Use the Hosted Version

No setup is required. Simply visit the web application: RCT_Search_Engine

Run Locally via Git and Docker

  • Clone the repository:
git clone https://github.com/ohassanaly/rct_rag.git
cd rct_rag
  • Build and run the Docker container:
docker build -t rct_search .
docker run -p 8080:8080 rct_search
  • Open your browser at http://localhost:8080 to access the application.

Demos

You have access to three search engines:

  • Similarity - demo
    Retrieves the most relevant protocols based on semantic keyword-based retrieval (TF-IDF embedding under the hood).

  • Key-Words - demo
    Finds protocols containing the exact keywords of your query, supporting wildcards (*) adn operators(AND/OR) for flexible matching.

  • Database - demo
    Allows direct browsing and exploration of database protocols.


Project Structure

  • builder – Handles data preparation and construction of the search engines
  • search_engine – Contains the application logic and user interface
  • sparse_search_engine – Details on the Similarity Search Engine