A tool to download and process the articles from the articles database (at the moment www.paperity.com, soon https://www.ncbi.nlm.nih.gov/pubmed/ and others).
SysReviewer downloads available .pdf from the search results.
The .pdfs are processed to .txt.
RAKE algorithm is used for .txt file to find most frequent phrases. The table of phrases and their freqencies for papers is returned.
All outputs are saved in a directory:
../Output.
The .pdf files are saved in:
../Output/Output_<searched_phrase>_pdf
The .txt files are saved in:
../Output/Output_<searched_phrase>_txt
The key phrases are saved as:
../Output/kw_<searched_phrase>.csv
Getting started:
Download the SysReviewer repository from the github. Extract in your home folder. In the directory ./python run an example from the'Usage' section.