Spark notebook

Some Spark Scala examples on notebook/jupyter:

word count using countByValue
word count using map-reduce
compute pi
Spark SQL
ML decision tree

Requirements

install docker
install git

Run

$ git clone https://github.com/dportabella/spark_notebook.git
$ cd spark_notebook
$ docker run -p 9001:9001 -v $PWD:/tmp/notebook -e NOTEBOOKS_DIR=/tmp/notebook andypetrella/spark-notebook:0.7.0-scala-2.11.8-spark-2.1.0-hadoop-2.7.3

open a web browser at http://localhost:9001 and run the examples

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Example1_word_count_using_countByValue.snb		Example1_word_count_using_countByValue.snb
Example2_word_count_using_map-reduce.snb		Example2_word_count_using_map-reduce.snb
Example3_compute_pi.snb		Example3_compute_pi.snb
Example4_Spark_SQL.snb		Example4_Spark_SQL.snb
Example5_ML_decision_tree.snb		Example5_ML_decision_tree.snb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spark notebook

Requirements

Run

About

Releases

Packages

dportabella/spark_notebook

Folders and files

Latest commit

History

Repository files navigation

Spark notebook

Requirements

Run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages