Data Analist naar Data Scientist - Deel 1 Data Analist
Vanaf € 361,79 € 299,00
Data Analist naar Data Scientist - Deel 1 Data Analist
Vanaf € 361,79 € 299,00
Productinformatie
Dit is deel 1 van het leerpad Data Analist naar Data Scientist.
In dit deel wordt gefocust op de vaardigheden en kennis die je nodig hebt als Data Analist; het verzamelen en controleren van gegevens om deze te verwerken tot informatie. Deze informatie wordt vervolgens geanalyseerd en omgezet in kennis.
In dit deel leer je hoe je Python en Microsoft R inzet om data te analyseren. Naast deze talen leer je ook meer over Hadoop en MongoDB en ga je aan de slag met Data Silo's.
Je vindt hier verschillende cursussen die je voorbereiden om aan de slag te gaan als Data Analist. Daarnaast is er een livelab beschikbaar om te oefenen. Je sluit dit deel af met een examen.
Inhoud van de training
Data Analist naar Data Scientist - Deel 1 Data Analist
Data Architecture Primer
Data Engineering Fundamentals
Data engineering is the area of data science that focuses on practical applications of data collection and analysis. In this course, you will explore distributed systems, batch vs. in-memory processing, NoSQL uses, and the various tools available for data management/big data and the ETL process.
Python for Data Science: Introduction to NumPy for Multi-dimentional Data
Python for Data Science: Advanced Operations with NumPy Arrays
Python for Data Science: Introduction to Pandas
Python for Data Science: Manipulating and Analyzing Data in Pandas DataFrames
R for Data Science: Data Structures
R for Data Science: Importing and Exporting Data
R for Data Science: Data Exploration
R for Data Science: Regression Methods
Discover how to apply regression
R for Data Science: Classification & Clustering
Data Science Statistics: Simple Descriptive Statistics
Explore the two most basic types of descriptive statistics, measures of central tendency and dispersion. Examine the most common measures of each type, as well as their strengths and weaknesses.
Data Science Statistics: Common Approaches to Sampling Data
The goal of all modeling is generalizing as well as possible from a sample to the population as a whole. Explore the first step in this process, obtaining a representative sample from which meaningful generalizable insights can be obtained.
Data Science Statistics: Inferential Statistics
Inferential statistics go beyond merely describing a dataset and seek to posit and prove or disprove the existence of relationships within the data. Explore hypothesis testing, which finds wide applications in data science.
Accessing Data with Spark: An Introduction to Spark
Getting Started with Hadoop: Fundamentals & MapReduce
Apache Hadoop is a collection of open-source software utilities that facilitates solving data science problems. In this course, you will explore the theory behind big data analysis using Hadoop and how MapReduce enables the parallel processing of large datasets distributed on a cluster of machines.
Getting Started with Hadoop: Developing a Basic MapReduce Application
Getting Started with Hadoop: Developing a Basic MapReduce Application
Hadoop HDFS: Introduction
HDFS is the file system which enables the parallel processing of big data in distributed cluster. Explore the concepts of analyzing large datasets and explore how Hadoop and HDFS make this process very efficient.
Hadoop HDFS: Introduction to the Shell
Discover how to set up a Hadoop Cluster on the cloud and explore the bundled web apps - the YARN Cluster Manager app and the HDFS NameNode UI. Then use the hadoop fs and hdfs dfs shells to browse the Hadoop file system.
Hadoop HDFS: Working with Files
Explore the Hadoop file system using the HDFS dfs shell and perform basic file and directory-level operations. Transfer files between a local file system and HDFS and explore ways to create and delete files on HDFS.
Hadoop HDFS: File Permissions
HDFS is the file system which enables the parallel processing of big data in distributed cluster. When managing a data warehouse, not all users should be given free reign over all the datasets. Explore how file permissions can be viewed and configured in HDFS. The NameNode UI is used to monitor and explore HDFS.
Data Silos, Lakes, & Streams: Introduction
Traditional data warehousing is transitioning to be more
- cloud-based and this can be a key area that must be mastered for
- data science. In this course you will examine the organizational
- implications of data silos and explore how data lakes can help make
- data secure, discoverable, and queryable. Discover how data lakes
- can work with batch and streaming data.
Data Silos, Lakes, and Streams: Data Lakes on AWS
Traditional data warehousing is transitioning to be more
- cloud-based and this can be a key area that must be mastered for
- data science. In this course, you will discover how to build a data
- lake on the AWS cloud by storing data in S3 buckets and indexing
- this data using AWS Glue. Explore how to run crawlers to
- automatically crawl data in S3 to generate metadata tables in
- Glue.
Data Silos, Lakes, & Streams: Sources, Visualizations, & ETL Operations
Traditional data warehousing is transitioning to be more
- cloud-based and this can be a key area that must be mastered for
- data science. In this course, you will discover how to configure
- Glue crawlers to work with different data stores on AWS. Examine
- how to visualize the data stored in the data lake with AWS
- QuickSight and how to perform ETL operations on the data using Glue
- scripts.
Data Analysis Application
Discover how to perform data analysis using Anaconda Python, R, and related analytical libraries and tools.
Data Science Fundamentals for Python and MongoDB
Beginning Apache Spark 2: With Resilient Distributed Datasets, Spark SQL, Structured Streaming and Spark Machine Learning Library
Big Data and Hadoop: Learn by Example
Pro Hadoop Data Analytics: Designing and Building Big Data Systems using the Hadoop Ecosystem
Practical Enterprise Data Lake Insights: Handle Data-Driven Challenges in an Enterprise Big Data Lake
Enterprise Big Data Engineering, Analytics, and Management
Final Exam: Data Analyst
Final Exam: Data Analyst will test your knowledge and application of the topics presented throughout the Data Analyst track of the Skillsoft Aspire Data Science Journey.
Kenmerken
Meer informatie
Extra product informatie | 0 |
---|---|
Doelgroep | Data-analist |
Voorkennis | Goede analytische skills en basiskennis over statistieken is handig. |
resultaat | Je hebt de handvaten om aan de slag te gaan als Data analist, naast Python en Microsoft R ben je ook in staat om te werken met Hadoop, MongoDB en Data Silo's. |