College aantekeningen

2024 Machine Learning Notes Highlights(full))

Name: 2024 Machine Learning Notes Highlights(full))
SKU: doc_4455259
Rating: 1.00 (2 reviews)
Author: thaboty

2 beoordelingen

2 keer verkocht

Vak
2103TEWDAS

Instelling
Universiteit Antwerpen (UA)

I achieved a score of 18 out of 20, the greatest distinction, in the 'Machine Learning' course in 2024. This success is attributed to the systematic study material I authored on my own. It includes chapter highlights, detailed explanations of key concepts, and, most significantly, clarifications on...

[Meer zien]

Voorbeeld 4 van de 81 pagina's

Bekijk voorbeeld

Geupload op 8 februari 2024
Aantal pagina's 81
Geschreven in 2023/2024
Type College aantekeningen
Docent(en) David martens
Bevat Alle colleges

machine learning
data mining
python
decision tree
random tree
svm
ann
adaboost
coding
metrics
generalisation
overfitting
data science principles
data science
auc

2 beoordelingen

Door: achrafledou3 • 1 maand geleden

Door: UATEWBEDRIJFSKUNDE • 4 maanden geleden

Volgen

thaboty Lid sinds 1 jaar 5 documenten verkocht

€9,99

Ook beschikbaar in voordeelbundel v.a. €11,49

In winkelwagen

Op verlanglijstje

100% tevredenheidsgarantie
Direct beschikbaar na je betaling
Lees online óf als PDF
Geen vaste maandelijkse kosten

Ook beschikbaar in voordeelbundel (1)

2023- 2024 Machine Learning& Data Ethics

€ 14,98 € 11,49

2x verkocht

2 items

1. Samenvatting - Antwerpe 2024 "data science ethics" mindmap
2. College aantekeningen - 2024 machine learning notes highlights(full))
Meer zien

Dit document is ook in delen beschikbaar:

2024 Machine Learning Notes Highlights (first part)

(0)

€6,39

0x verkocht

I achieved a score of 18 out of 20, the greatest distinction, in the 'Machine Learning' course in 2024. This success is attributed to the systematic study material I authored on my own. In the first part, it contains chapter 1 to 5, covering the CRISP framework, decision tree, overfitting, ROC/AUC&Profit curves, and Bayes, with a meticulously made navagation pane.

i x

College aantekeningen
• 42 pagina's •
door thaboty •
geupload 2024

i x

Universiteit Antwerpen • 2103TEWDAS

2024 Machine Learning Notes Highlights (second part)

(0)

€6,39

0x verkocht

I achieved a score of 18 out of 20, the greatest distinction, in the 'Machine Learning' course in 2024. This success is attributed to the systematic study material I authored on my own. In the second part, it contains chapter 6 to 9, covering the KNN, clustering, recommendation system, ANN, text mining, etc. with a meticulously made navagation pane.

i x

College aantekeningen
• 39 pagina's •
door thaboty •
geupload 2024

i x

Universiteit Antwerpen • 2103TEWDAS

10.3 Lec2 CRISP-framework
Significant point of this Lec (SP):
• The difference between explanatory modeling and predictive modeling; pg4-19
1. Different goal
2. Different evaluation
3. Different modeling paths
• [NB]Data preprocessing:
1. its motivation or reason: why we should do it; pg21
2. what should we do and how (sampling, encoding, missing values,
outliers, normalizing…) ;pg22-46
• The difference between types of variables

Highlight:
• Slides: pg4-19, pg 21, pg 22-46
• Books:

Info:
11.15 visit AXA, register first; ?? TBC project upload; 12.12 ceremony competition

1 The difference between Exp vs Pre modeling
Goals!DEFINITION"
• Explanatory modeling: Theory-based, statistical testing of causal hypothesis
• Predictive modeling: Data science methods to make predictions
Evaluation
• Explanatory modeling: Strength of relationship in statistical model

1

, • Predictive modeling: Ability to accurately predict new observations

Modeling path: (17:00--)
• Data collection\ data preparation\data partitioning (important! Next week)
1. Data collection, similar
2. Data preparation, facing data missing—explanatory modeling can throw it
away; but for predictive modeling, it’ll be a problem.
3. Data partitioning, not important for explanatory but super important for
predictive modeling. (more info:
https://www.cockroachlabs.com/blog/what-is-data-partitioning-and-
how-to-do-it-right/)
• About the choice of variables:
1. for explanatory modeling, operationalized variables serve as practical
instruments for investigating the underlying conceptual constructs and
the relationships between them. For example, a questionnaire designed
to assess a person's level of depression (the construct) by asking about
their feelings and behaviors is a practical instrument. The term is used
often in the social sciences because scientists in that field have to spend
so much time creating and validating their constructs of interest, just to
be able to measure for them.)
2. for predictive modeling, the variables can be way broad, hundreds to
thousands, of course those should be available at first.
Notable words:
collaborative filtering models

2

, Definition: Collaborative filtering filters information by using the
interactions and data collected by the system from other users. It’s
based on the idea that people who agreed in their evaluation of certain
items are likely to agree again in the future. The algorithm supports
recommended system, for example, Taobao , amazon, netflix #$%
&'

Other differences:

• Explaining does not necessarily lead to predictions: variables nor present.
• Multicollinearity is a problem in explanatory model but not usually in predictive
modeling. Multicollinearity will not affect the ability of the model to predict. (A
websites clearify this: https://hackernoon.com/multicollinearity-and-its-
importance-in-machine-learning)
• Method:
1. explanatory—interpretable statistical method;
2. predictive—accurate machine learning method.
• Validation:
1. Model fit and R2. R2 is a measure of the goodness of fit of a model.[11] In
regression, the R2 coefficient of determination is a statistical measure of how
well the regression predictions approximate the real data points. An R2 of 1
indicates that the regression predictions perfectly fit the data.
2. Generalisation and accuracy.
• Y=f(X), to explain, test a given f; to predict, find f.

3

, 2 Data preprocessing
• Motivation and reason: dirty and noisy data, inconsistent data, incomplete data…
1-Data preprocessing. Sampling:
1. Definition: Select a suitable or representative sample to determine the
parameters and characteristics of the whole population.
2. Reason: economic, time, large and partly accessible population, computation
power.
3. How to do sampling and things to avoid: stratified sampling ()*+,
timing!many data vs recent/relevant data",avoid seasonality
effects(sales during summer and winter).
2-Data preprocessing. Encoding:
1. Encoding is the process of converting categorical data into a numerical
format that machine learning algorithms can understand.
2. Encoding vs Normalizing

4

Dit zijn jouw voordelen als je samenvattingen koopt bij Stuvia:

Bewezen kwaliteit door reviews

Studenten hebben al meer dan 850.000 samenvattingen beoordeeld. Zo weet jij zeker dat je de beste keuze maakt!

In een paar klikken geregeld

Geen gedoe — betaal gewoon eenmalig met iDeal, Bancontact of creditcard en je bent klaar. Geen abonnement nodig.

Focus op de essentie

Studenten maken samenvattingen voor studenten. Dat betekent: actuele inhoud waar jij écht wat aan hebt. Geen overbodige details!

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Je krijgt een PDF, die direct beschikbaar is na je aankoop. Het gekochte document is altijd, overal en oneindig toegankelijk via je profiel.

Tevredenheidsgarantie: hoe werkt dat?

Onze tevredenheidsgarantie zorgt ervoor dat je altijd een studiedocument vindt dat goed bij je past. Je vult een formulier in en onze klantenservice regelt de rest.

Van wie koop ik deze samenvatting?

Stuvia is een marktplaats, je koop dit document dus niet van ons, maar van verkoper thaboty. Stuvia faciliteert de betaling aan de verkoper.

Zit ik meteen vast aan een abonnement?

Nee, je koopt alleen deze samenvatting voor €9,99. Je zit daarna nergens aan vast.

Is Stuvia te vertrouwen?

4,6 sterren op Google & Trustpilot (+1000 reviews)

Afgelopen 30 dagen zijn er 68175 samenvattingen verkocht

Opgericht in 2010, al 15 jaar dé plek om samenvattingen te kopen

Start met verkopen

College aantekeningen

2024 Machine Learning Notes Highlights(full))

Document informatie

Onderwerpen

Geschreven voor

2 beoordelingen

Verkoper

Ontvangen beoordelingen

Voorbeeld van de inhoud

Dit zijn jouw voordelen als je samenvattingen koopt bij Stuvia:

Bewezen kwaliteit door reviews

In een paar klikken geregeld

Focus op de essentie

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Tevredenheidsgarantie: hoe werkt dat?

Van wie koop ik deze samenvatting?

Zit ik meteen vast aan een abonnement?

Is Stuvia te vertrouwen?