Skip to content

Latest commit

 

History

History
20 lines (15 loc) · 786 Bytes

readme.md

File metadata and controls

20 lines (15 loc) · 786 Bytes

AnyStyle workflow

This repo contains a workflow for extracting bibliographic references from PDF documents using the AnyStyle extraction engine.

Setup

The workflow also relies on some Python scripts which are called from within ruby using the pycall gem. For this to work, you need a Python executable that has been compiled with the "--enable-shared" option. When installed, install the python dependencies with pip install -r pylib/requirements.txt

Here is a gist that installs the python and ruby versions used: https://gist.github.com/cboulanger/f358273bda7ca330aa77d22f656b0750

Todo:

  • use proper CLI gem
  • move Format::CSL to Model::CSL
  • autoloading classes
  • refactor Datasource classes