Edition |
4th edition. |
Description |
xxv, 727 pages : illustrations ; 24 cm |
Note |
"4th Edition Revised & Updated"--Cover. |
|
"Storage and analysis at Internet scale"--Cover. |
|
Includes index. |
Contents |
Pt. I. Hadoop fundamentals -- Meet Hadoop -- MapReduce -- The Hadoop distributed filesystem -- YARN -- Hadoop I/O -- pt. II. MapReduce -- Developing a MapReduce application -- How MapReduce works -- MapReduce types and formats -- MapReduce features -- pt. III. Hadoop operations -- Setting up a Hadoop cluster -- Administering Hadoop -- pt. IV. Related projects -- Avro -- Parquet -- Flume -- Sqoop -- Pig -- Hive -- Crunch -- Spark -- HBase -- ZooKeeper -- pt. V. Case studies -- Composable data at Cerner -- Biological data science : saving lives with software -- Cascading. |
Summary |
Offers information on how to build and maintain reliable, scalable, distributed systems with Apache Hadoop covering such topics as MapReduce, HDFS, YARN, Avro for data serialization, Parquet for nested data, and data ingestion tools Flume and Sqoop. |
Subject |
Apache Hadoop.
|
|
File organization (Computer science)
|
|
Logiciels.
|
|
Bases de données.
|
|
Internet.
|
|
Apache Hadoop. (OCoLC)fst01911570
|
|
File organization (Computer science) (OCoLC)fst00924147
|
|
Hadoop. (DE-588)1022420135
|
|
NoSQL-Datenbanksystem. (DE-601)638468353
|
ISBN |
9781491901632 (paperback) |
|
1491901632 (paperback) |
|