Jump to content

Extract Keywords


Recommended Posts

image.png.91c95121d1036ced58b29a1e34b755cb.png

Extract Keywords

Extract keywords and keyphrases from articles, books or any other document with YAKE!

 

Download On Github

 

 

Usage

  • Send PDF, docx, doc, rtf or txt documents to the workflow’s File Actions
  • Pass the text from your selection in macOS on to the workflow’s Universal Action
  • Use the keyword and paste your text (default: kw

image.thumb.png.00e07e17d2ebfad19f538db5b91d0779.png

The extracted keywords are presented in a dialog.
image.thumb.png.5597a76a198f7555ff792cb37a32ffa4.png


Dependencies
The workflow relies on Python3 to install the YAKE standalone.
YAKE!


pdftotext


Stopwords

 


Yake has internal stopword handling that cannot be influenced from the command line. However, you can still define a list of words that will be flat out purged from the input text. To set up a ‘purge word’-list, create a text file named as the language identifier for a corresponding language in the workflow root folder: assets/stopwords/de.txt.


The workflow checks if the file exists and if it does, the words are removed.


The purge-word files can be quickly accessed through Alfred by prefixing the keyword with a colon (default: :kw).


image.thumb.png.5cc62e8da5e73d628aaa0cc67861ebe9.png



YAKE! is a light-weight unsupervised automatic keyword extraction method which rests on text statistical features extracted from single documents to select the most important keywords of a text.

Edited by zeitlings
Link to comment

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
×
×
  • Create New...