image top
Giỏ hàng Giỏ hàng 0
Không có sản phẩm trong giỏ hàng.
Email cho bạn bè

Learning Spark Lightning-Fast Data Analytics

169,000₫
  • ✪ Miễn phí GIAO HÀNG đơn hàng từ 399.000đ
  • ✪ Giao hàng COD toàn quốc nhanh chóng từ 2 - 4 ngày
  • ✪ Giao hàng HOẢ TỐC trong nội thành Hà Nội
  • ✪ Hỗ trợ xuất hóa đơn VAT theo yêu cầu

Learning Spark Lightning-Fast Data Analytics

Sách đen trắng, Bìa nềm
 
Thể loại:Computers - Programming
 
Năm:2020
 
In lần thứ:2
 
Ngôn ngữ:english
 
Trang:300 / 399
 
 
Data is getting bigger, arriving faster, and coming in varied formats —
and it all needs to be processed at scale for analytics or machine
learning. How can you process such varied data workloads efficiently?
Enter Apache Spark.
 
Updated to emphasize new features in Spark 2.x.,
this second edition shows data engineers and scientists why structure
and unification in Spark matters. Specifically, this book explains how
to perform simple and complex data analytics and employ machine-learning
algorithms. Through discourse, code snippets, and notebooks, you’ll be
able to:
 
• Learn Python, SQL, Scala, or Java high-level APIs: DataFrames and Datasets
 
• Peek under the hood of the Spark SQL engine to understand Spark transformations and performance
 
• Inspect, tune, and debug your Spark operations with Spark configurations and Spark UI
 
• Connect to data sources: JSON, Parquet, CSV, Avro, ORC, Hive, S3, or Kafka
 
• Perform analytics on batch and streaming data using Structured Streaming
 
• Build reliable data pipelines with open source Delta Lake and Spark
 
• Develop machine learning pipelines with MLlib and productionize models using MLflow
 
• Use open source Pandas framework Koalas and Spark for data transformation and feature engineering