Training: Statistische analyse en modellering in R.

Data Visualisatie

15 uur

Engels (US)

Training: Statistische analyse en modellering in R.

Data Visualisatie

15 uur

Engels (US)

Statistische analyse en modellering in R.

Productinformatie

Statistieken zijn een belangrijke element bij gegevensanalyse en -modellering. In deze training leer je om verschillende dynamische statistische hulpmiddelen te gebruiken om je gegevens te verkennen en te begrijpen. Je onderzoekt populatieverdelingen om willekeurige variabelen te modelleren, werkt met discrete en continue kansverdelingen en gebruikt discrete kansverdelingstypen, zoals de uniforme, binominale en Poisson-verdelingen. Vervolgens leer je om de nul- en alternatieve hypothesen voor een tests op te stellen en om de resultaten te interpreteren. Je onderzoekt de meest gebruikte statistische toetsen, de T-toets en variantieanalyse (ANOVA).

Regressiemodellen worden gebruikt om continue waarden te voorspellen en zijn de meest gebruikte machine learning-modellen. In deze training leer je te begrijpen wat algoritmen voor machine learning (ML) precies zijn en hoe je ML-modellen kunt gebruiken om resultaten te voorspellen op basis van invoergegevens. Je leert logistische regressie en beslissingsbomen te gebruiken om classificatiemodellen te bouwen en je modellen te evalueren met behulp van nauwkeurigheid, precisie en terugroepactie. Clustering is een leeralgoritme zonder toezicht dat zelf patronen in gegevens ontdekt en helpt bij het identificeren van logische groeperingen. Je leert onderscheid te maken tussen begeleid en niet-gesuperviseerd leren én je herkent hoe regressie- en classificatie-algoritmen verschillen van clustering.

Ten slotte leer je de afweging tussen bias en variantie te begrijpen, waarmee datawetenschappers generaliseerbare modellen kunnen bouwen die goed presteren op testgegevens.

Inhoud van de training

Statistische analyse en modellering in R.

15 uur

Statistical Analysis and Modeling in R: Working with Probability Distributions

Interpreting data is a core pre-processing step in data analysis and modeling. Use this course to practice using various dynamic statistical tools to explore and understand your data.
During this course, you'll explore population distributions to model random variables, work with discrete and continuous probability distributions, and use discrete probability distribution types, such as the uniform, binomial, and Poisson distributions.
You'll also examine continuous distributions, such as the normal and the exponential distributions.
You'll round the course off by learning how to read and interpret QQ plots, which can be used to compare the distributions of two samples of data.
When you're finished, you'll be able to use probability distributions to model events and understand your data.

Statistical Analysis and Modeling in R: Understanding & Interpreting Statistical Tests

Statistical analysis involves making educated guesses known as hypotheses and testing them to see if they hold up. Use this course to learn how to apply hypothesis testing to your data.
Examine the use of descriptive statistics to summarize data and inferential statistics to draw conclusions. Learn how population parameters differ from summary statistics and how confidence intervals are used.
Discover how to perform hypothesis testing on sample data, construct null and alternative hypotheses, and interpret the results of your statistical tests.
Investigate the significance of the p-value of a statistical test and how it can be interpreted using the significance threshold or alpha level.
Additionally, examine the most commonly used statistical tests, the T-test and the analysis of variance (ANOVA).
When you're done, you'll have the confidence to set up the null and alternative hypotheses for your tests and interpret the results.

Statistical Analysis and Modeling in R: Statistical Analysis on Your Data

Hypothesis testing determines whether the educated guesses you've made about your data should be accepted or rejected. T-tests and ANOVA tests are some of the most commonly used methods in hypothesis testing. Knowing how to perform and interpret these tests are core data scientist skills.
In this course, get hands-on running statistical tests on your sample data. Test assumptions made by statistical tests, run T-tests, perform ANOVA analysis, and interpret the results.
Perform the one-sample t-test and the one-sample Z-test. Run the two-sample t-test to compare data from different groups or categories and the paired samples t-test to compare data across time.
When you're finished with this course, you'll have the know-how to run and interpret statistical tests using the R programming language.

Statistical Analysis and Modeling in R: Performing Regression Analysis

Regression models are used to predict continuous values and are some of the most commonly used machine learning models.
Use this course to grasp what exactly machine learning (ML) algorithms are and how you can use ML models to predict outcomes based on input data.
Learn how regression models work, train them, and evaluate regression results using metrics such as R2 and RMSE. Perform regression analysis in R using the ordinary least squares regression. Build models using simple and multiple regression and train a regression model using cross-validation.
Upon completing this course, you'll be able to perform regression to predict continuous values and evaluate these models using metrics such as the R2 and adjusted R2.

Statistical Analysis and Modeling in R: Performing Classification

Classification models are used to classify or categorize data points into two or more categories. Learn how these models work and how you can evaluate your classification models using the confusion matrix and metrics such as accuracy, precision, and recall.
During this course, you'll perform classification using both logistic regression and an imbalanced dataset. You'll also examine why precision or recall scores may be better metrics than accuracy to evaluate such models.
Furthermore, build a classification model using decision trees, visualize the tree structure, and explore the variable importance assigned by this tree structure to understand and interpret the model.
When you've finished this course, you'll be able to confidently use logistic regression and decision trees to build classification models and evaluate your models using accuracy, precision, and recall.

Statistical Analysis and Modeling in R: Performing Clustering

Clustering is an unsupervised learning algorithm that self-discovers patterns in data and helps identify logical groupings. Use this course to distinguish between supervised and unsupervised learning and recognize how regression and classification algorithms differ from clustering.
Examine the basic principles of clustering models and how k-means clustering finds logical groupings in your data. Learn the evaluation techniques used in clustering and find the optimal number of clusters in your data using both the elbow method and the Silhouette score.
Perform clustering on a dataset with multiple attributes and visualize clusters in your data using principal components.
When you've completed this course, you'll be able to find groupings in your data using k-means clustering and compute the optimal number of clusters for your data.

Statistical Analysis and Modeling in R: Building Regularized Models & Ensemble Models

Understanding the bias-variance trade-off allows data scientists to build generalizable models that perform well on test data. Machine learning models are considered a good fit if they can extract general patterns or dominant trends in the training data and use these to make predictions on unseen instances.
Use this course to discover what it means for your model to be a good fit for the training data. Identify underfit and overfit models and what the bias-variance trade-off represents in machine learning.
Mitigate overfitting on training data using regularized regression models, train and evaluate models built using ridge regression, lasso regression, and ElasticNet regression, and implement ensemble learning using the random forest model.
When you're done with this course, you'll have the skills and knowledge to train models that learn general patterns using regularized models and ensemble learning.

Kenmerken

Docent inbegrepen

Bereidt voor op officieel examen

Engels (US)

15 uur

Data Visualisatie

90 dagen online toegang

HBO

Meer informatie

Doelgroep	Data-analist
Voorkennis	Geen voorkennis vereist. Ervaring met statistische analyses is aan te raden.
Resultaat	Na het volgen van deze training kan je met behulp van verschillende dynamische statistische tools je data beter begrijpen.

Positieve reacties van cursisten

Training: Leidinggeven aan de AI transformatie

Nuttige training. Het bestelproces verliep vlot, ik kon direct beginnen.

- Mike van Manen

Onbeperkt Leren Abonnement

Onbeperkt Leren aangeschaft omdat je veel waar voor je geld krijgt. Ik gebruik het nog maar kort, maar eerste indruk is goed.

- Floor van Dijk

Al jaren is icttrainingen.nl onze trouwe partner op het gebied van kennisontwikkeling voor onze IT-ers. Wij zijn blij dat wij door het platform van icttrainingen.nl maatwerk en een groot aanbod aan opleidingen kunnen bieden aan ons personeel.

- Loranne, Teamlead bij Inwork

Winkelwagen

Training: Statistische analyse en modellering in R.

Productinformatie

Inhoud van de training

Statistische analyse en modellering in R.

Statistical Analysis and Modeling in R: Working with Probability Distributions

Statistical Analysis and Modeling in R: Understanding & Interpreting Statistical Tests

Statistical Analysis and Modeling in R: Statistical Analysis on Your Data

Statistical Analysis and Modeling in R: Performing Regression Analysis

Statistical Analysis and Modeling in R: Performing Classification

Statistical Analysis and Modeling in R: Performing Clustering

Statistical Analysis and Modeling in R: Building Regularized Models & Ensemble Models

Kenmerken

Meer informatie

Bekijk meer

Contact

Training: Statistische analyse en modellering in R.

Productinformatie

Inhoud van de training

Statistische analyse en modellering in R.

Statistical Analysis and Modeling in R: Working with Probability Distributions

Statistical Analysis and Modeling in R: Understanding & Interpreting Statistical Tests

Statistical Analysis and Modeling in R: Statistical Analysis on Your Data

Statistical Analysis and Modeling in R: Performing Regression Analysis

Statistical Analysis and Modeling in R: Performing Classification

Statistical Analysis and Modeling in R: Performing Clustering

Statistical Analysis and Modeling in R: Building Regularized Models & Ensemble Models

Kenmerken

Meer informatie

Inloggen