Mining of Massive Datasets

mmds.org

The book is based on Stanford Computer Science course CS246: Mining Massive Datasets (and CS345A: Data Mining).

The book, like the course, is designed at the undergraduate computer science level with no formal prerequisites. To support deeper explorations, most of the chapters are supplemented with further reading references.

Chapters

Data Mining
Map-Reduce and the New Software Stack
Finding Similar Items
Mining Data Streams
Link Analysis
Frequent Itemsets
Clustering
Advertising on the Web
Recommendation Systems
Mining Social-Network Graphs
Dimensionality Reduction
Large-Scale Machine Learning

Comments (0)

Sign in to post comments.

Sebastian Hanula
seba

Things (744)

Topics (56) +

Comments (0)

Sebastian Hanula seba

Things (744)

Topics (56) +

Mining of Massive Datasets

Comments (0)

Sebastian Hanula
seba