Samenvatting

Computer Vision & 3D Image Processing (5LSH0) lecture slides summary

Name: Computer Vision & 3D Image Processing (5LSH0) lecture slides summary
SKU: doc_930034
Rating: 5.00 (3 reviews)
Author: jarllemmens

3 beoordelingen

139 keer bekeken 6 keer verkocht

Instelling
Technische Universiteit Eindhoven (TUE)

Extensive summary (86 pages) of the Computer Vision & 3D Image Processing (5LSH0) course lecture slides. - Module 1: Feature Extraction and Matching - Module 2: Classification – clustering - Module 3: Classification – supervised - Module 4: Introduction to Deep Learning ...

[Meer zien]

Laatste update van het document: 3 jaar geleden

Voorbeeld 4 van de 87 pagina's

Bekijk voorbeeld

Geupload op 30 december 2020
Bestand laatst geupdate op 26 januari 2021
Aantal pagina's 87
Geschreven in 2020/2021
Type Samenvatting

classification clustering
classification supervised
intro deep learning
deep learning features
feature extraction matching
classification using convolutional neural networks
object detection using dee

3 beoordelingen

Door: kw1 • 3 jaar geleden

Door: brechtvangils • 3 jaar geleden

Door: JohnSengers • 3 jaar geleden

Volgen

jarllemmens Lid sinds 4 jaar 8 documenten verkocht

€8,49

Toegevoegd

In winkelwagen Op verlanglijstje

100% tevredenheidsgarantie
Direct beschikbaar na betaling
Zowel online als in PDF
Je zit nergens aan vast

Computer Vision & 3D Image Processing
5LSH0 Lecture Summary
2020-2021

Jarl Lemmens
j.l.a.lemmens@student.tue.nl

,Module 0 – Introduction
Some cool applications of the VCA group:

- Create a 3D model of complex spaces, with multiple levels and hallways.
- Create 3D models of extremely large spaces such as a cathedral.
- Person / object re-identification with multiple cameras so the trajectory can be estimated.
- Synthesizing traffic signs from street-view imagery, by generating realistic examples and
retrain detectors.
- Accurate localization by image matching (take a picture somewhere, compare it with a
database of city pictures, and match features to an exact location).

A computer receives an image as a large matrix of (RGB) values. The goal of computer
vision is to make the computer understand what can be seen in these values. Which set of
values correspond to a certain object or activity, and from which position did the camera
capture that environment.

Some computer vision application examples:
- Optical character recognition (OCR), which converts scanned docs to text. Think of the
scanning option in Google translate.
- Object detection. Detect faces, humans, cars or any other object of interest in an image.
- Activity detection and classification. Detect when fights, burglary or other abnormal
behavior occurs in surveillance images.
- Guiding doctors in diagnosis, therapy and surgery. For example a network that ‘reads’ an
image of a melanoma and outputs whether it is malignant (bad) or benign (not so bad).
- Allow robots to see and interpret its surroundings.
- Enable autonomous driving, by detecting driving lanes, traffic signs, other vehicles etc.
- Special effects: motion capture. (use a human face to capture facial motions and translate
these to an animal’s face)
- 3D modeling of environment. For example the 3d option in Google Maps.

Overview of the topics:
- Module 1: Feature Extraction and Matching p.3
- Module 2: Classification – clustering p.13
- Module 3: Classification – supervised p.17
- Module 4: Introduction to Deep Learning p.22
- Module 5: Classification Using Convolutional Neural Networks p.28
- Module 6: Object Detection using Deep Learning p.44
- Module 7: Object tracking p.47
- Module 8: Person re-id p.52
- Module 9: Camera model, Projection matrix and 3D Geometry p.57
- Module 10: 3D Reconstruction, data fusion and SLAM p.64
- Module 11: Structure from Motion p.77
- Module 12: Segmentation using Convolutional Neural Networks p.80
- Module 13: Behavior Analysis p.84

,Module 1 – Feature Extraction and Matching
Color spaces

The most common color system is the RGB system (red-green-blue), however, other
systems exist as well.

HSV – Hue, Saturation, Value (also called intensity)
CMYK – Cyan, Magenta, Yellow, Key (which is black)
YUV – Luma (brightness), and Chrominance (color)

These different color spaces can be converted from and to each other. For example RGB to
HSV (which might be the most used conversion of them all):

𝑉 = max(𝑅, 𝐺, 𝐵)
max(𝑅, 𝐺, 𝐵) − min(𝑅, 𝐺, 𝐵)
𝑆=
max(𝑅, 𝐺, 𝐵)
(𝐺 − 𝐵)
0+ , 𝑖𝑓 max(𝑅, 𝐺, 𝐵) = 𝑅
max(𝑅, 𝐺, 𝐵) − min(𝑅, 𝐺, 𝐵)
(𝐵 − 𝑅)
𝐻 = 60 × 2 + , 𝑖𝑓 max(𝑅, 𝐺, 𝐵) = 𝐺
max(𝑅, 𝐺, 𝐵) − min(𝑅, 𝐺, 𝐵)
(𝑅 − 𝐺)
4+ , 𝑖𝑓 max(𝑅, 𝐺, 𝐵) = 𝐵
{ max(𝑅, 𝐺, 𝐵) − min(𝑅, 𝐺, 𝐵)

Hue is a value between 0 and 360, so if the equation returns greater than 360, or smaller
than 0, add or subtract 360 until it is within the correct range.

Feature points

Feature points are used for
image alignment (think of
panorama images), 3D
reconstruction, motion
tracking, object recognition,
image matching and retrieval,
robot navigation and more.

, Feature points, feature descriptors, or just features are small parts of information about an
image. This can be a mathematical operation, or a structure like edges/shapes etc. This
converts an image into an efficient vector description. A good feature is invariant to
transformations. (Invariance = The property of remaining unchanged regardless of changes
in the conditions of measurement).
Geometric invariance: translation, rotation, scale..
Photometric invariance: brightness, exposure..

Features should be/have:
- Discriminative: should be able to attenuate important nuances
- Descriptive power: allow rich descriptions
- Sufficient in quantity: hundreds or thousands in one image.
- Relatively low computation cost: real-time performance should be achievable
- Generality: exploit features in various images types.

Canny Edge Detector

The 3 main objectives for edge detection:
- Optimal edge pixel detection without false edges
(reduce noise responses).
- Good localization of the edges (minimal error distance).
- Single response per edge (one pixel (width) per edge).

A canny detector contains 4 steps.

1) Remove noise by filtering the image with a Gaussian filter.

Take a box filer (for
example a 3x3 box
with all 1 coefficients,
and a 1/9 factor) to
get the average value
per box. Slide this
box over the image,
and the center cell of
the box will get the
new averaged value.

Instead of the box filter, often a gaussian filter is used, this will result in a blurred image. This

Voordelen van het kopen van samenvattingen bij Stuvia op een rij:

Verzekerd van kwaliteit door reviews

Stuvia-klanten hebben meer dan 700.000 samenvattingen beoordeeld. Zo weet je zeker dat je de beste documenten koopt!

Snel en makkelijk kopen

Je betaalt supersnel en eenmalig met iDeal, creditcard of Stuvia-tegoed voor de samenvatting. Zonder lidmaatschap.

Focus op de essentie

Samenvattingen worden geschreven voor en door anderen. Daarom zijn de samenvattingen altijd betrouwbaar en actueel. Zo kom je snel tot de kern!

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Je krijgt een PDF, die direct beschikbaar is na je aankoop. Het gekochte document is altijd, overal en oneindig toegankelijk via je profiel.

Tevredenheidsgarantie: hoe werkt dat?

Onze tevredenheidsgarantie zorgt ervoor dat je altijd een studiedocument vindt dat goed bij je past. Je vult een formulier in en onze klantenservice regelt de rest.

Van wie koop ik deze samenvatting?

Stuvia is een marktplaats, je koop dit document dus niet van ons, maar van verkoper jarllemmens. Stuvia faciliteert de betaling aan de verkoper.

Zit ik meteen vast aan een abonnement?

Nee, je koopt alleen deze samenvatting voor €8,49. Je zit daarna nergens aan vast.

Is Stuvia te vertrouwen?

4,6 sterren op Google & Trustpilot (+1000 reviews)

Afgelopen 30 dagen zijn er 85073 samenvattingen verkocht

Opgericht in 2010, al 14 jaar dé plek om samenvattingen te kopen

Start met verkopen

Populaire Universiteiten

Populaire Hogescholen

Populaire Scholen

Populaire samengevatte studieboeken voor Communicatie en Taal

Populaire samengevatte studieboeken voor Economie en Bedrijf

Populaire samengevatte studieboeken voor Exact en Informatica

Populaire samengevatte studieboeken voor Gedrag en Maatschappij

Populaire samengevatte studieboeken voor Gezondheid en Geneeskunde

Populaire samengevatte studieboeken voor Onderwijs en Opvoeding

Populaire samengevatte studieboeken voor Recht en Bestuur

De beste samenvattingen om je Wft-diploma te behalen

De beste samenvattingen om je theorie examens te behalen

De beste samenvattingen voor je cursus in de Veiligheidsbranche

De beste samenvattingen voor Gezondheid & Hygiëne cursussen

De beste samenvattingen voor zakelijke cursussen

De beste samenvattingen voor je PABO WisCAT cursus

Populaire vakken

Populaire vakken

Populaire vakken

Boekverslagen en samenvattingen

Samenvatting

Computer Vision & 3D Image Processing (5LSH0) lecture slides summary

Document informatie

Onderwerpen

Geschreven voor

3 beoordelingen

Verkoper

Ontvangen beoordelingen

Voorbeeld van de inhoud

Voordelen van het kopen van samenvattingen bij Stuvia op een rij:

Verzekerd van kwaliteit door reviews

Snel en makkelijk kopen

Focus op de essentie

Veelgestelde vragen

Wat krijg ik als ik dit document koop?

Tevredenheidsgarantie: hoe werkt dat?

Van wie koop ik deze samenvatting?

Zit ik meteen vast aan een abonnement?

Is Stuvia te vertrouwen?