spopkin.github.io

A multi-PDF search utility

Navigation


View My GitHub Profile

Visit My LinkedIn Page

2018-03-18: Please note that this site is presently undergoing construction, as I have decided to switch over to Jekyll.

PDF RuleFinder

The PDF RuleFinder is a simple Node.js-based utility that is used to find the page in a set of pdf files that best matches a set of keywords. Essentially, it's used for searching D&D PDFs to try to find specific rules based on passed keywords

This project is actually the first of a broader set of tabletop gaming utilities that I intend to work on.

GitHub: https://github.com/spopkin/Dungeon-Master-Toolkit

List of Technologies and Techniques Used:

How it Works

When started with the --refresh-db flag, it uses pdf-text-extract to extract the text from all of the PDF files in a preconfigured rulebook directory. After the text is extracted, it is inserted into entries for each given book in MongoDB. Later, when the function to search is invoked, it searches through those extracted text entries and ranks them in terms of which page is the most likely to correspond to the searched-for keywords.

The frontend is an HTML, CSS, and JavaScript page that uses JQuery to retrieve information from the NodeJS REST API.

Design Decisions






Return to projects page.