Notes de cours

Complete WEEK1 note: Machine Learning & Learning Algorithms(BM05BAM)

0 fois vendu

Cours
Machine Learning & Learning Algorithms (BM05BAM)

Établissement
Erasmus Universiteit Rotterdam (EUR)

Book
An Introduction to Statistical Learning

THIS IS A COMPLETE NOTE FROM ALL BOOKS + LECTURE! Save your time for internships, other courses by studying over this note! Are you a 1st/2nd year of Business Analytics Management student at RSM, who want to survive the block 2 Machine Learning module? Are you overwhelmed with 30 pages of re...

[Montrer plus]

Aperçu 2 sur 10 pages

Voir l'exemple

Publié le 12 mars 2024
Nombre de pages 10
Écrit en 2023/2024
Type Notes de cours
Professeur(s) Jason roos
Contient Toutes les classes

machine learning
parametric
non parametric
supervised learning
unsupervised learning
regression
classification
statistical learning
islr2
islr
ml

Titre de l’ouvrage:An Introduction to Statistical Learning

Auteur(s):Gareth James, Daniela Witten

Édition:Inconnu
ISBN:9781071614204
Édition:Inconnu

Notes de cours
Complete WEEK7 note: Machine Learning & Learning Algorithms(BM05BAM)
Notes de cours
Complete WEEK6 note: Machine Learning & Learning Algorithms(BM05BAM)
Notes de cours
Complete WEEK3 note: Machine Learning & Learning Algorithms(BM05BAM)

Établissement
Erasmus Universiteit Rotterdam (EUR)
Cours
Business Analytics and Management
Cours
Machine Learning & Learning Algorithms (BM05BAM)

€12,49

Egalement disponible en groupe à partir de €21,99

Ajouter au panier

Ajouter au liste de veux

Garantie de satisfaction à 100%
Disponible immédiatement après paiement
En ligne et en PDF
Tu n'es attaché à rien

Document également disponible en groupe (2)

(50% off!) Prepare for First Exam: Machine Learning & Learning Algorithms(BM05BAM)

€ 47,46 € 21,99 4 éléments

1. Notes de cours - Complete week4 note: machine learning & learning algorithms(bm05bam)
2. Notes de cours - Complete week3 note: machine learning & learning algorithms(bm05bam)
3. Notes de cours - Complete week2 note: machine learning & learning algorithms(bm05bam)
4. Notes de cours - Complete week1 note: machine learning & learning algorithms(bm05bam)
Montrer plus

(55% off!) Full Bundle: Machine Learning & Learning Algorithms(BM05BAM)

€ 84,43 € 35,99 7 éléments

1. Notes de cours - Complete week5 note: machine learning & learning algorithms(bm05bam)
2. Notes de cours - Complete week4 note: machine learning & learning algorithms(bm05bam)
3. Notes de cours - Complete week3 note: machine learning & learning algorithms(bm05bam)
4. Notes de cours - Complete week2 note: machine learning & learning algorithms(bm05bam)
5. Notes de cours - Complete week1 note: machine learning & learning algorithms(bm05bam)
6. Notes de cours - Complete week6 note: machine learning & learning algorithms(bm05bam)
7. Notes de cours - Complete week7 note: machine learning & learning algorithms(bm05bam)
Montrer plus

HLM : Chapter 1 The Machine learning Landscape

Machine learning = field of study that gives computers the ability to learn without being
explicitly programmed.

Types of Machine learning system : added to the ISRL chapter 2
Main Challenges of Machine Learning:
Generalization
Generalization problem could be caused by sampling bias, overfitting.

It is crucial to use a training set that is representative of the cases you want to generalize
to. This is often harder than it sounds: if the sample is too small, you will have sampling
noise (i.e., nonrepresentative data as a result of chance/outlier/data errors)

However, even very large samples can be nonrepresentative if the sampling method is
flawed. This is called sampling bias.

Regularization: Constraining a model to make it simpler and reduce the risk of overfitting.
The amount of regularization to apply during learning can be controlled by a
hyperparameter. A hyperparameter is a parameter of a learning algorithm (not of the
model
- It must be set prior to training and remains constant during training.
- If you set the regularization hyperparameter to a very large value, you will get an
almost flat model (a slope close to zero)

Hyperparameter vs Parameter
- A model parameter is
o estimated during model training.
o internally optimized
- A hyperparameter must be
o specified before model training.
o optimized externally.

Concept drift: It happens when the relationship that model estimate changes after
training the model, due to the external conceptual change in the circumstance.

Testing and Validating:
A better option than testing on the new data is to split your data into two sets: the
training set and the test set, which allow you to test the performance before moving on
the actual practice. As these names imply, you train your model using the training set,
and you test it using the test set.
- It is common to use 80% of the data for training and hold out 20% for testing.
However, this depends on the size of the dataset:

, The error rate on new cases is called the generalization error (or out-of-sample error), and
by evaluating your model on the test set, you get an estimate of this error.

Hyperparameter Tuning and model selection
Suppose you are hesitating between two types of models (say, a linear model and a
polynomial model): how can you decide between them?

When you want to compare just two different models: just train models on the same
train data and compare the generalization performance with test data.
When you want to find the best performing hyperparameter among 100 options: You
cannot do the same.
- When you measure the generalization error multiple times on the test set, you
adapt the model and hyperparameters to produce the best model for that
particular set so it won’t perform as well on the new data.

A common solution to this problem is called holdout validation: you simply hold out part
of the training set to evaluate several candidate models and select the best one. The new
held-out set is called the validation set (or sometimes the development set, or dev set).

Process
1. You train multiple models with various hyperparameters on the reduced training
set.
2. You select the model that performs best on the validation set (holdout validation
process)
a. if the model performs poorly on the train-dev set, then it must have overfit
the training set, so you should try to simplify or regularize the model, get
more training data, and clean up the training data.
3. You train the best model on the full training set, including the validation set
4. Test the generalization error on the test set.

Validation set should not be too small: then model evaluations will be imprecise
Validation set should not be too large: remaining training set will be much smaller, which
would change the performance result after training on the full training set.

One way to solve this problem is Cross validation that uses small validation sets. Each
model is evaluated once per validation set after it is trained on the rest of the data. By
averaging the evaluations of the mode, you get much more accurate measure of
performance.
- It also means that training time is multiplied by the number of validation sets.

No Free Lunch Theorem
David Wolpert demonstrated that if you make absolutely no assumption about the data,
then there is no reason to prefer one model over any other. This is called the No Free
Lunch (NFL) theorem.

Les avantages d'acheter des résumés chez Stuvia:

Qualité garantie par les avis des clients

Les clients de Stuvia ont évalués plus de 700 000 résumés. C'est comme ça que vous savez que vous achetez les meilleurs documents.

L’achat facile et rapide

Vous pouvez payer rapidement avec iDeal, carte de crédit ou Stuvia-crédit pour les résumés. Il n'y a pas d'adhésion nécessaire.

Focus sur l’essentiel

Vos camarades écrivent eux-mêmes les notes d’étude, c’est pourquoi les documents sont toujours fiables et à jour. Cela garantit que vous arrivez rapidement au coeur du matériel.

Foire aux questions

Qu'est-ce que j'obtiens en achetant ce document ?

Vous obtenez un PDF, disponible immédiatement après votre achat. Le document acheté est accessible à tout moment, n'importe où et indéfiniment via votre profil.

Garantie de remboursement : comment ça marche ?

Notre garantie de satisfaction garantit que vous trouverez toujours un document d'étude qui vous convient. Vous remplissez un formulaire et notre équipe du service client s'occupe du reste.

Auprès de qui est-ce que j'achète ce résumé ?

Stuvia est une place de marché. Alors, vous n'achetez donc pas ce document chez nous, mais auprès du vendeur ArisMaya. Stuvia facilite les paiements au vendeur.

Est-ce que j'aurai un abonnement?

Non, vous n'achetez ce résumé que pour €12,49. Vous n'êtes lié à rien après votre achat.

Peut-on faire confiance à Stuvia ?

4.6 étoiles sur Google & Trustpilot (+1000 avis)

75282 résumés ont été vendus ces 30 derniers jours

Fondée en 2010, la référence pour acheter des résumés depuis déjà 15 ans

Commencez à vendre!

Récemment vu par vous

Resume ·

(0)

Populaire universiteiten

Populaire hogescholen

Populaire studieboeken voor Communicatie en Taal

Populaire studieboeken voor Economie en Bedrijf

Populaire studieboeken voor Exact en Informatica

Populaire studieboeken voor Gedrag en Maatschappij

Populaire studieboeken voor Gezondheid en Geneeskunde

Populaire studieboeken voor Recht en Bestuur

Vendeur

Resume ·

Notes de cours

Complete WEEK1 note: Machine Learning & Learning Algorithms(BM05BAM)

Infos sur le Document

Sujets

Livre connecté

Plus de résumés pour

École, étude et sujet

Vendeur

Aperçu du contenu

Les avantages d'acheter des résumés chez Stuvia:

Qualité garantie par les avis des clients

L’achat facile et rapide

Focus sur l’essentiel

Foire aux questions

Qu'est-ce que j'obtiens en achetant ce document ?

Garantie de remboursement : comment ça marche ?

Auprès de qui est-ce que j'achète ce résumé ?

Est-ce que j'aurai un abonnement?

Peut-on faire confiance à Stuvia ?

Récemment vu par vous

Resume ·

Gezondheidsrecht samenvatting