-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathresources.html
30 lines (28 loc) · 1.17 KB
/
resources.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
---
layout: default
title: CS6240 Large-scale Parallel Data Processing
---
<div class="post">
<div>
<h5>Recommended Textbook and Materials</h5>
<p>
There is no required textbook because the
instructor provides textbook-like course material. To gain a deeper understanding of the
material covered in this course, we recommend the following books (most should be
available online for free for Northeastern University students from O’Reilly for Higher
Education):
</p>
<ul>
<li>Design Patterns by Donald Miner and Adam Shook</li>
<li>Hadoop: The Definitive Guide by Tom White</li>
<li>High Performance Spark by Holden Karau and Rachel Warren</li>
<li>Spark: The Definitive Guide by Bill Chambers and Matei Zaharia</li>
<li>Spark in Action by Petar Zecevic and Marko Bonaci</li>
<li>Programming Elastic MapReduce by Kevin Schmidt and Christopher Phillips</li>
</ul>
<p>
For some topics we will work with research papers or other online resources, e.g., the
Hadoop and Spark API doc.
</p>
</div>
</div>