databricks data engineering associate questions wi
Geschreven voor
Databricks Data Engineering Associate
Databricks Data Engineering Associate
Verkoper
Volgen
Fordenken
Ontvangen beoordelingen
Voorbeeld van de inhoud
Databricks Data Engineering Associate
questions with answers
Question 1 w
Which of the following describes a benefit of a data lakehouse that is unavailable in a
w w w w w w w w w w w w w w w
traditional data warehouse?
w w w
A. A data lakehouse provides a relational system of data management.
w w w w w w w w w w
B. A data lakehouse captures snapshots of data for version control purposes.
w w w w w w w w w w w
C. A data lakehouse couples storage and compute for complete control.
w w w w w w w w w w
D. A data lakehouse utilizes proprietary storage formats for data.
w w w w w w w w w
E. A data lakehouse enables both batch and streaming analytics. - ANSWER: ➡ E. A data
w w w w w w w w w w w ww w w w
lakehouse enables both batch and streaming analytics.
w w w w w w w
Question 2 w
Which of the following locations hosts the driver and worker nodes of a Databricks-managed
w w w w w w w w w w w w w
cluster?
w
A. Data plane
w w
B. Control plane
w w
C. Databricks Filesystem
w w
D. JDBC data source
w w w
E. Databricks web application - ANSWER: ➡ A. Data plane
w w w w w ww w w w
Question 3 w
A data architect is designing a data model that works for both video-based machine learning
w w w w w w w w w w w w w w
workloads and highly audited batch ETL/ELT workloads.
w w w w w w w
Which of the following describes how using a data lakehouse can help the data architect meet
w w w w w w w w w w w w w w w
the needs of both workloads?
w w w w w
A. A data lakehouse requires very little data modeling.
w w w w w w w w
,B. A data lakehouse combines compute and storage for simple governance.
w w w w w w w w w w
C. A data lakehouse provides autoscaling for compute clusters.
w w w w w w w w
D. A data lakehouse stores unstructured data and is ACID-compliant.
w w w w w w w w w
E. A data lakehouse fully exists in the cloud. - ANSWER: ➡ D. A data lakehouse stores
w w w w w w w w w w ww w w w w w
unstructured data and is ACID-compliant.
w w w w w
Question 4 w
Which of the following describes a scenario in which a data engineer will want to use a Job
w w w w w w w w w w w w w w w w w
cluster instead of an all-purpose cluster?
w w w w w w
A. An ad-hoc analytics report needs to be developed while minimizing compute costs.
w w w w w w w w w w w w
B. A data team needs to collaborate on the development of a machine learning model.
w w w w w w w w w w w w w w
C. An automated workflow needs to be run every 30 minutes.
w w w w w w w w w w
D. A Databricks SQL query needs to be scheduled for upward reporting.
w w w w w w w w w w w
E. A data engineer needs to manually investigate a production error. - ANSWER: ➡ C. An
w w w w w w w w w w w w ww w w
automated workflow needs to be run every 30 minutes.
w w w w w w w w w
Question 5 w
A data engineer has created a Delta table as part of a data pipeline. Downstream data analysts
w w w w w w w w w w w w w w w w
now need SELECT permission on the Delta table.
w w w w w w w w
Assuming the data engineer is the Delta table owner, which part of the Databricks Lakehouse
w w w w w w w w w w w w w w
Platform can the data engineer use to grant the data analysts the appropriate access?
w w w w w w w w w w w w w w
A. Reposw
B. Jobs w
C. Data Explorer
w w
D. Databricks Filesystem
w w
E. Dashboards - ANSWER: ➡ C. Data Explorer
w w w ww w w w
Question 6 w
Two junior data engineers are authoring separate parts of a single data pipeline notebook.
w w w w w w w w w w w w w
They are working on separate Git branches so they can pair program on the same notebook
w w w w w w w w w w w w w w w w
, simultaneously. A senior data engineer experienced in Databricks suggests there is a better
w w w w w w w w w w w w w
alternative for this type of collaboration.
w w w w w w
Which of the following supports the senior data engineer's claim?
w w w w w w w w w
A. Databricks Notebooks support automatic change-tracking and versioning
w w w w w w w
B. Databricks Notebooks support real-time coauthoring on a single notebook
w w w w w w w w w
C. Databricks Notebooks support commenting and notification comments
w w w w w w w
D. Databricks Notebooks support the use of multiple languages in the same notebook
w w w w w w w w w w w w
E. Databricks Notebooks support the creation of interactive data visualizations - ANSWER:
w w w w w w w w w w w w
➡ B. Databricks Notebooks support real-time coauthoring on a single notebook
w w w w w w w w w w w
Question 7 w
Which of the following describes how Databricks Repos can help facilitate CI/CD workflows
w w w w w w w w w w w w
on the Databricks Lakehouse Platform?
w w w w w
A. Databricks Repos can facilitate the pull request, review, and approval process before
w w w w w w w w w w w w
merging branches
w w
B. Databricks Repos can merge changes from a secondary Git branch into a main Git branch
w w w w w w w w w w w w w w w
C. Databricks Repos can be used to design, develop, and trigger Git automation pipelines
w w w w w w w w w w w w w
D. Databricks Repos can store the single-source-of-truth Git repository
w w w w w w w w
E. Databricks Repos can commit or push code changes to trigger a CI/CD process - ANSWER:
w w w w w w w w w w w w w w w w
➡ E. Databricks Repos can commit or push code changes to trigger a CI/CD process
w w w w w w w w w w w w w w w
Question 8 w
Which of the following statements describes Delta Lake?
w w w w w w w
A. Delta Lake is an open source analytics engine used for big data workloads.
w w w w w w w w w w w w w
B. Delta Lake is an open format storage layer that delivers reliability, security, and
w w w w w w w w w w w w w
performance.
w
C. Delta Lake is an open source platform to help manage the complete machine learning
w w w w w w w w w w w w w w
lifecycle.
w
D. Delta Lake is an open source data storage format for distributed data.
w w w w w w w w w w w w
Voordelen van het kopen van samenvattingen bij Stuvia op een rij:
√ Verzekerd van kwaliteit door reviews
Stuvia-klanten hebben meer dan 700.000 samenvattingen beoordeeld. Zo weet je zeker dat je de beste documenten koopt!
Snel en makkelijk kopen
Je betaalt supersnel en eenmalig met iDeal, Bancontact of creditcard voor de samenvatting. Zonder lidmaatschap.
Focus op de essentie
Samenvattingen worden geschreven voor en door anderen. Daarom zijn de samenvattingen altijd betrouwbaar en actueel. Zo kom je snel tot de kern!
Veelgestelde vragen
Wat krijg ik als ik dit document koop?
Je krijgt een PDF, die direct beschikbaar is na je aankoop. Het gekochte document is altijd, overal en oneindig toegankelijk via je profiel.
Tevredenheidsgarantie: hoe werkt dat?
Onze tevredenheidsgarantie zorgt ervoor dat je altijd een studiedocument vindt dat goed bij je past. Je vult een formulier in en onze klantenservice regelt de rest.
Van wie koop ik deze samenvatting?
Stuvia is een marktplaats, je koop dit document dus niet van ons, maar van verkoper Fordenken. Stuvia faciliteert de betaling aan de verkoper.
Zit ik meteen vast aan een abonnement?
Nee, je koopt alleen deze samenvatting voor €12,18. Je zit daarna nergens aan vast.