100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached
logo-home
CSC1033- Summary Notes $13.63   Add to cart

Summary

CSC1033- Summary Notes

 52 views  1 purchase
  • Course
  • Institution

Notes for the module CSC1033. Summary notes for examination in May/June.

Last document update: 8 months ago

Preview 2 out of 50  pages

  • February 3, 2024
  • March 4, 2024
  • 50
  • 2023/2024
  • Summary
  • Unknown
avatar-seller
Big Data
What is Big Data?
. It means bigger than “normal” (not universally agreed definition). The term is an all-encompassing
one including data, data frameworks along with the tools and techniques used to process and
analyze the data.
. There are different ways that data can be big:
o Many Data points.
o Many Variables.
o High Frequency.
o Data Complexity.
. Big Data refers to a massive amount of data that keeps on growing exponentially with time.
. It is so voluminous (capacious) that it cannot be process or analyzed using convectional data
processing techniques.
. It includes:
o data mining - involves discovering patterns, trends, correlations, or useful information
from large datasets using various techniques such as machine learning, statistics, and
artificial intelligence. ( Identifying customer purchasing patterns from e-commerce data to
improve marketing strategies.)

o data storage - Data storage refers to the process of storing and organizing vast
amounts of data in a structured manner to ensure efficient retrieval and management.
(Using distributed file systems like Hadoop Distributed File System (HDFS) for storing and managing
large datasets.)

o data analysis - Data analysis involves inspecting, cleaning, transforming, and modeling

data to discover useful information, draw conclusions, and support decision-making.
(Analyzing sales data to identify best-performing products and optimize inventory management)

o data sharing - Data sharing involves the exchange of data between different entities or

systems, either within an organization or between organizations. (Data sharing involves the
exchange of data between different entities or systems, either within an organization or between
organizations.)

o data visualisation - Data visualization is the representation of data in graphical or

visual formats, such as charts, graphs, and maps, to make complex datasets more
understandable. (Creating a dashboard with visualizations to show key performance indicators (KPIs)
and trends in real-time.)




1

, Type of Data
Unstructured Data:

. Unstructured data contains a significant amount of uncertain
and imprecise data, such as social media data is inherently
uncertain.
. It refers also to the data that lacks any specific form or
structure whatsoever. This makes it very difficult, and time
consuming to process and analyze unstructured data. (e.g.
Email).


Structured Data:

. Is the data that can be processed, stored, and retrieved in a fixed
format.
. It refers to highly organized information that can be readily and
seamlessly stored and accessed from a database by simple search
engine algorithms.
. E.g The employee table in a company database will be structured as the employee details, their
job positions, their salaries, etc., will be present in an organized manner.


Semi - Structured Data:

. Relates to the data containing both the formats mentioned
above, that is, structured and unstructured data.
. It refers to the data that although has not been classified
under a particular repository (database) yet contains vital
information or tags that segregate individual elements within
the data.
.
The advantage of having semi-structured is that is its
flexible and portable. However, queries are less efficient
that in constrained structure.




2

The benefits of buying summaries with Stuvia:

Guaranteed quality through customer reviews

Guaranteed quality through customer reviews

Stuvia customers have reviewed more than 700,000 summaries. This how you know that you are buying the best documents.

Quick and easy check-out

Quick and easy check-out

You can quickly pay through credit card or Stuvia-credit for the summaries. There is no membership needed.

Focus on what matters

Focus on what matters

Your fellow students write the study notes themselves, which is why the documents are always reliable and up-to-date. This ensures you quickly get to the core!

Frequently asked questions

What do I get when I buy this document?

You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.

Satisfaction guarantee: how does it work?

Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.

Who am I buying these notes from?

Stuvia is a marketplace, so you are not buying this document from us, but from seller giovanniconstantina04. Stuvia facilitates payment to the seller.

Will I be stuck with a subscription?

No, you only buy these notes for $13.63. You're not tied to anything after your purchase.

Can Stuvia be trusted?

4.6 stars on Google & Trustpilot (+1000 reviews)

79223 documents were sold in the last 30 days

Founded in 2010, the go-to place to buy study notes for 14 years now

Start selling
$13.63  1x  sold
  • (0)
  Add to cart