Big data analysis with python : combine spark and python to unlock the powers of parallel computing and machine learning / Ivan Marin, Ankit Shukla, Sarang VK.

Por:

Marin, Ivan [autor]

Colaborador(es):

Tipo de material: Texto

TextoIdioma: Inglés Editor: Birmingham, UK : Distribuidor: Packt publishing, Fecha de copyright: ©2019Edición: 1ª ediciónDescripción: 256 páginas : ilustraciones, gráficas ; 23 x 19 cmTipo de contenido:

texto

Tipo de medio:

sin medio

Tipo de soporte:

volumen

ISBN:

9781789955286

Tema(s):

Clasificación LoC:

QA76 .73 .P98 M337 2019

Contenidos:

1. The Python Data Science Stack -- 2. Statistical Visualizations -- 3. Working with Big Data Frameworks -- 4. Diving Deeper with Spark -- 5. Handling Missing Values and Correlation Analysis -- 6. Exploratory Data Analysis -- 7. Reproducibility in Big Data Analysis -- 8. Creating a Full Analysis Report --

Resumen: Processing big data in real time is challenging due to scalability, information inconsistency, and fault tolerance. Big Data Analysis with Python teaches you how to use tools that can control this data avalanche for you. With this book, you'll learn practical techniques to aggregate data into useful dimensions for posterior analysis, extract statistical measurements, and transform datasets into features for other systems. The book begins with an introduction to data manipulation in Python using pandas. You'll then get familiar with statistical analysis and plotting techniques. With multiple hands-on activities in store, you'll be able to analyze data that is distributed on several computers by using Dask. As you progress, you'll study how to aggregate data for plots when the entire data cannot be accommodated in memory. You'll also explore Hadoop (HDFS and YARN), which will help you tackle larger datasets. The book also covers Spark and explains how it interacts with other tools. By the end of this book, you'll be able to bootstrap your own Python environment, process large files, and manipulate data to generate statistics, metrics, and graphs.Resumen: Procesar big data en tiempo real es un desafío debido a la escalabilidad, la inconsistencia de la información y la tolerancia a fallas. Big Data Analysis con Python le enseña cómo utilizar herramientas que pueden controlar esta avalancha de datos por usted. Con este libro, aprenderá técnicas prácticas para agregar datos en dimensiones útiles para análisis posteriores, extraer medidas estadísticas y transformar conjuntos de datos en funciones para otros sistemas. El libro comienza con una introducción a la manipulación de datos en Python usando pandas. Luego se familiarizará con el análisis estadístico y las técnicas de trazado. Con múltiples actividades prácticas preparadas, podrá analizar datos distribuidos en varias computadoras utilizando Dask. A medida que avance, estudiará cómo agregar datos para gráficos cuando no se puedan almacenar todos los datos en la memoria. También explorará Hadoop (HDFS y YARN), que le ayudará a abordar conjuntos de datos más grandes. El libro también cubre Spark y explica cómo interactúa con otras herramientas. Al final de este libro, podrá iniciar su propio entorno Python, procesar archivos grandes y manipular datos para generar estadísticas, métricas y gráficos.

Etiquetas de esta biblioteca: No hay etiquetas de esta biblioteca para este título. Ingresar para agregar etiquetas.

Existencias
Tipo de ítem	Biblioteca actual	Biblioteca de origen	Colección	Signatura topográfica	Copia número	Estado	Notas	Fecha de vencimiento	Código de barras	Reserva de ítems
Libros para consulta en sala	Biblioteca Antonio Enriquez Savignac	Biblioteca Antonio Enriquez Savignac	COLECCIÓN RESERVA	QA76 .73 .P98 M337 2019 (Navegar estantería(Abre debajo))	Ejem.1	No para préstamo (Préstamo interno)	Ingeniería Logística		043165

Total de reservas: 0

Navegando Biblioteca Antonio Enriquez Savignac estanterías, Colección: COLECCIÓN RESERVA Cerrar el navegador de estanterías (Oculta el navegador de estanterías)

Previo								Siguiente
Previo	QA 76 .73 .P98 D39 2010 Python programming for the absolute beginner /	QA 76 .73 .P98 F47 Python 3 al descubierto /	QA 76 .73 .P98 M15 2013 Python for data analysis /	QA76 .73 .P98 M337 2019 Big data analysis with python : combine spark and python to unlock the powers of parallel computing and machine learning /	QA 76 .73 .P98 M38 2019 Python crash course : a hands-on, project-based introduction to programming /	QA 76.73.P98 M85 2016 Introduction to machine learning with Python : a guide for data scientists /	QA 76 .73 .P98 R373 2019 Python machine learning : machine learning and deep learning with python, scikit-learn, and tensorflow 2 /	Siguiente

Incluye índice

1. The Python Data Science Stack --
2. Statistical Visualizations --
3. Working with Big Data Frameworks --
4. Diving Deeper with Spark --
5. Handling Missing Values and Correlation Analysis --
6. Exploratory Data Analysis --
7. Reproducibility in Big Data Analysis --
8. Creating a Full Analysis Report --

Processing big data in real time is challenging due to scalability, information inconsistency, and fault tolerance. Big Data Analysis with Python teaches you how to use tools that can control this data avalanche for you. With this book, you'll learn practical techniques to aggregate data into useful dimensions for posterior analysis, extract statistical measurements, and transform datasets into features for other systems. The book begins with an introduction to data manipulation in Python using pandas. You'll then get familiar with statistical analysis and plotting techniques. With multiple hands-on activities in store, you'll be able to analyze data that is distributed on several computers by using Dask. As you progress, you'll study how to aggregate data for plots when the entire data cannot be accommodated in memory. You'll also explore Hadoop (HDFS and YARN), which will help you tackle larger datasets. The book also covers Spark and explains how it interacts with other tools. By the end of this book, you'll be able to bootstrap your own Python environment, process large files, and manipulate data to generate statistics, metrics, and graphs.

Procesar big data en tiempo real es un desafío debido a la escalabilidad, la inconsistencia de la información y la tolerancia a fallas. Big Data Analysis con Python le enseña cómo utilizar herramientas que pueden controlar esta avalancha de datos por usted. Con este libro, aprenderá técnicas prácticas para agregar datos en dimensiones útiles para análisis posteriores, extraer medidas estadísticas y transformar conjuntos de datos en funciones para otros sistemas. El libro comienza con una introducción a la manipulación de datos en Python usando pandas. Luego se familiarizará con el análisis estadístico y las técnicas de trazado. Con múltiples actividades prácticas preparadas, podrá analizar datos distribuidos en varias computadoras utilizando Dask. A medida que avance, estudiará cómo agregar datos para gráficos cuando no se puedan almacenar todos los datos en la memoria. También explorará Hadoop (HDFS y YARN), que le ayudará a abordar conjuntos de datos más grandes. El libro también cubre Spark y explica cómo interactúa con otras herramientas. Al final de este libro, podrá iniciar su propio entorno Python, procesar archivos grandes y manipular datos para generar estadísticas, métricas y gráficos.

Haga clic en una imagen para verla en el visor de imágenes

PARA TODA NECESIDAD SIEMPRE HAY UN LIBRO

Big data analysis with python : combine spark and python to unlock the powers of parallel computing and machine learning / Ivan Marin, Ankit Shukla, Sarang VK.

Navegando Biblioteca Antonio Enriquez Savignac estanterías, Colección: COLECCIÓN RESERVA Cerrar el navegador de estanterías (Oculta el navegador de estanterías)