Parallel Python with Dask: Perform distributed computing, concurrent programming and manage large dataset

- Perform distributed computing, concurrent programming and manage large dataset

Bog
  • Format
  • Bog, hæftet
  • Engelsk
  • 174 sider

Beskrivelse

Unlock the Power of Parallel Python with Dask: A Perfect Learning Guide for Aspiring Data Scientists

Dask has revolutionized parallel computing for Python, empowering data scientists to accelerate their workflows. This comprehensive guide unravels the intricacies of Dask to help you harness its capabilities for machine learning and data analysis.



Across 10 chapters, you'll master Dask's fundamentals, architecture, and integration with Python's scientific computing ecosystem. Step-by-step tutorials demonstrate parallel mapping, task scheduling, and leveraging Dask arrays for NumPy workloads. You'll discover how Dask seamlessly scales Pandas, Scikit-Learn, PyTorch, and other libraries for large datasets.



Dedicated chapters explore scaling regression, classification, hyperparameter tuning, feature engineering, and more with clear examples. You'll also learn to tap into the power of GPUs with Dask, RAPIDS, and Google JAX for orders of magnitude speedups.



This book places special emphasis on practical use cases related to scalability and distributed computing. You'll learn Dask patterns for cluster computing, managing resources efficiently, and robust data pipelines. The advanced chapters on DaskML and deep learning showcase how to build scalable models with PyTorch and TensorFlow.



With this book, you'll gain practical skills to:

Accelerate Python workloads with parallel mapping and task schedulingSpeed up NumPy, Pandas, Scikit-Learn, PyTorch, and other librariesBuild scalable machine learning pipelines for large datasetsLeverage GPUs efficiently via Dask, RAPIDS and JAXManage Dask clusters and workflows for distributed computingStreamline deep learning models with DaskML and DL frameworks

Packed with hands-on examples and expert insights, this book provides the complete toolkit to harness Dask's capabilities. It will empower Python programmers, data scientists, and machine learning engineers to achieve faster workflows and operationalize parallel computing.



Table of ContentIntroduction to DaskDask FundamentalsBatch Data Parallel Processing with DaskDistributed Systems and DaskAdvanced Dask: APIs and Building BlocksDask with PandasDask with Scikit-learnDask and PyTorchDask with GPUsScaling Machine Learning Projects with Dask

Læs hele beskrivelsen
Detaljer
  • SprogEngelsk
  • Sidetal174
  • Udgivelsesdato19-10-2023
  • ISBN139788119177653
  • Forlag GitforGits
  • FormatHæftet
  • Udgave1
Størrelse og vægt
  • Vægt313 g
  • Dybde0,9 cm
  • coffee cup img
    10 cm
    book img
    19 cm
    23,5 cm

    Findes i disse kategorier...

    Velkommen til Saxo – din danske boghandel

    Hos os kan du handle som gæst, Saxo-bruger eller Saxo-medlem – du bestemmer selv. Skulle du få brug for hjælp, sidder vores kundeservice-team klar ved både telefonerne og tasterne.

    Om medlemspriser hos Saxo

    For at købe bøger til medlemspris skal du være medlem af Saxo Premium, Saxo Shopping eller Saxo Ung. De første 7 dage er gratis for nye medlemmer. Medlemskabet fornyes automatisk og kan altid opsiges. Læs mere om fordelene ved vores forskellige medlemskaber her.

    Machine Name: SAXO082