100% satisfaction guarantee Immediately available after payment Both online and in PDF No strings attached 4.2 TrustPilot
logo-home
Exam (elaborations)

Data Science - Big Data Assessment Exam Questions with 100% Verified Answers

Rating
-
Sold
-
Pages
30
Grade
A+
Uploaded on
30-10-2024
Written in
2024/2025

Data Science - Big Data Assessment Exam Questions with 100% Verified Answers

Institution
Data Science
Course
Data Science










Whoops! We can’t load your doc right now. Try again or contact support.

Written for

Institution
Data Science
Course
Data Science

Document information

Uploaded on
October 30, 2024
Number of pages
30
Written in
2024/2025
Type
Exam (elaborations)
Contains
Questions & answers

Subjects

Content preview

Data Science - Big Data Assessment
Exam Questions with 100% Verified
Answers

when building a standalone application, you need to create the SparkContext. To
do this in Scala, you would include which of the following within the main
method?


val conf=new SparkConf().setAppName("AuctionsApp")
val sc= new SparkContext(conf)


val sc = SparkContext().setAppName("AuctionsApp"_
val conf= SparkConf(sc)


val sc=new SparkContext()


val conf= new SparkConf().setAppName("AuctionsApp") - ✔ ✔ val
conf=new SparkConf().setAppName("AuctionsApp")
val sc= new SparkContext(conf)

,which of these are responsible to output key-value pairs from the Reducer phase
to output files?


reducer Writer


RecordReader


RecordWriter


none of these - ✔ ✔ RecordWriter


The HDFS block size configuration is 128MB in your hadoop cluster. The HDFS
directory contains 50 small files each of 200 MB in size. How many map tasks
will be created when the inputformat for your job is TextInputFormat?


100
128
50

200 - ✔ ✔ 100


which of the following application types can Spark run in addition to batch-
processing jobs?


all
graph processing

, machine learning

stream processin - ✔ ✔ graph processing


in what year was apache spark made an open-source technology? - ✔ ✔ 2010


what kind of data can be handled by spark?


semi-structured
unstructured
structured

all - ✔ ✔ all


spark is 100x faster than MapReduce due to development in scala - ✔ ✔ false


the following are characteristics shared by hadoop and spark, except ____


- both use open source api's to link between different tools
- both are data processing platforms
-both have their own file system
-both are cluster computing environments - ✔ ✔ both have their own file system


spark can store its data in ___


- all

Get to know the seller

Seller avatar
Reputation scores are based on the amount of documents a seller has sold for a fee and the reviews they have received for those documents. There are three levels: Bronze, Silver and Gold. The better the reputation, the more your can rely on the quality of the sellers work.
QUINTER New York College Of Dentistry
View profile
Follow You need to be logged in order to follow users or courses
Sold
344
Member since
2 year
Number of followers
104
Documents
38476
Last sold
1 week ago

3.5

58 reviews

5
26
4
8
3
7
2
1
1
16

Recently viewed by you

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Frequently asked questions