Big Data
What is Big Data?
. It means bigger than “normal” (not universally agreed definition). The term is an all-encompassing
one including data, data frameworks along with the tools and techniques used to process and
analyze the data.
. There are different ways that data can be big:
o Many Data points.
o Many Variables.
o High Frequency.
o Data Complexity.
. Big Data refers to a massive amount of data that keeps on growing exponentially with time.
. It is so voluminous (capacious) that it cannot be process or analyzed using convectional data
processing techniques.
. It includes:
o data mining - involves discovering patterns, trends, correlations, or useful information
from large datasets using various techniques such as machine learning, statistics, and
artificial intelligence. ( Identifying customer purchasing patterns from e-commerce data to
improve marketing strategies.)
o data storage - Data storage refers to the process of storing and organizing vast
amounts of data in a structured manner to ensure efficient retrieval and management.
(Using distributed file systems like Hadoop Distributed File System (HDFS) for storing and managing
large datasets.)
o data analysis - Data analysis involves inspecting, cleaning, transforming, and modeling
data to discover useful information, draw conclusions, and support decision-making.
(Analyzing sales data to identify best-performing products and optimize inventory management)
o data sharing - Data sharing involves the exchange of data between different entities or
systems, either within an organization or between organizations. (Data sharing involves the
exchange of data between different entities or systems, either within an organization or between
organizations.)
o data visualisation - Data visualization is the representation of data in graphical or
visual formats, such as charts, graphs, and maps, to make complex datasets more
understandable. (Creating a dashboard with visualizations to show key performance indicators (KPIs)
and trends in real-time.)
1
, Type of Data
Unstructured Data:
. Unstructured data contains a significant amount of uncertain
and imprecise data, such as social media data is inherently
uncertain.
. It refers also to the data that lacks any specific form or
structure whatsoever. This makes it very difficult, and time
consuming to process and analyze unstructured data. (e.g.
Email).
Structured Data:
. Is the data that can be processed, stored, and retrieved in a fixed
format.
. It refers to highly organized information that can be readily and
seamlessly stored and accessed from a database by simple search
engine algorithms.
. E.g The employee table in a company database will be structured as the employee details, their
job positions, their salaries, etc., will be present in an organized manner.
Semi - Structured Data:
. Relates to the data containing both the formats mentioned
above, that is, structured and unstructured data.
. It refers to the data that although has not been classified
under a particular repository (database) yet contains vital
information or tags that segregate individual elements within
the data.
.
The advantage of having semi-structured is that is its
flexible and portable. However, queries are less efficient
that in constrained structure.
2
The benefits of buying summaries with Stuvia:
Guaranteed quality through customer reviews
Stuvia customers have reviewed more than 700,000 summaries. This how you know that you are buying the best documents.
Quick and easy check-out
You can quickly pay through credit card or Stuvia-credit for the summaries. There is no membership needed.
Focus on what matters
Your fellow students write the study notes themselves, which is why the documents are always reliable and up-to-date. This ensures you quickly get to the core!
Frequently asked questions
What do I get when I buy this document?
You get a PDF, available immediately after your purchase. The purchased document is accessible anytime, anywhere and indefinitely through your profile.
Satisfaction guarantee: how does it work?
Our satisfaction guarantee ensures that you always find a study document that suits you well. You fill out a form, and our customer service team takes care of the rest.
Who am I buying these notes from?
Stuvia is a marketplace, so you are not buying this document from us, but from seller giovanniconstantina04. Stuvia facilitates payment to the seller.
Will I be stuck with a subscription?
No, you only buy these notes for $13.71. You're not tied to anything after your purchase.