The age of the data product -- An operating system for big data -- A framework for Python and Hadoop streaming -- In-memory computing with Spark -- Distributed analysis and patterns -- Data mining and warehousing -- Data ingestion -- Analytics with higher-level APIs -- Machine learning -- Summary : doing distributed data science.