Commands for test run:
- cd into the directory "peg"
- run command "spark-submit --class com.test.pegasus_prediction target/scala-2.11/pegasus_prediction_2.11-1.0.jar"
- view accuracy by running the command "hdfs dfs -cat /user/ak8257/Pegasus/accuracy/*.csv"
Commands for generating a small dataset. (WARNING: It will overwrite the previous data generated for our test run. Please ensure to run this only after running a sample test run to reproduce results.)
- cd into directory "data-generator"
- run "sh generate_clean_data"
- run the test run commands again.