Skip to content

Latest commit

 

History

History
46 lines (28 loc) · 1.25 KB

README.md

File metadata and controls

46 lines (28 loc) · 1.25 KB

Spotlight

Annotating Movie Reviews with UIMA

Using UIMA to annotate names of cinematographers, actors and characters in movie reviews

Data Set

Movie review data set by Bo Pang and Lillian Lee. Can be downloaded here

Download the data files, and place the positive review files under the folder:

resources/data/text/pos/

Annotation

To execute the annotators with UIMA tools, one needs to configure a few variables regarding the location of descriptor files and other resources.

  • modelMEFilePath: location of the OpenNLP POS tag pack for English (EN)

  • sentenceMEFilePath: location of the OpenNLP sentence identification pack for English (EN)

You can edit these variables in the PersonDescriptor.xml file.

Running

You need to have UIMA properly set up in your environment.

Use use the documentAnalyser tool to annotate documents:

$UIMA_HOME/bin/documentAnalyzer.sh

And the annotationViewer to view the annotations:

$UIMA_HOME/bin/annotationViewer.sh

Building

mvn clean compile assembly:single