Databricks-Certified-Data-Engineer-Associate無料問題集「Databricks Certified Data Engineer Associate」

質問 1

Which of the following Structured Streaming queries is performing a hop from a Silver table to a Gold table?

（A）

（B）

（C）

（D）

（E）

正解：A 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 2

A dataset has been defined using Delta Live Tables and includes an expectations clause:
CONSTRAINT valid_timestamp EXPECT (timestamp > '2020-01-01') ON VIOLATION FAIL UPDATE What is the expected behavior when a batch of data containing data that violates these constraints is processed?

（A）Records that violate the expectation cause the job to fail.

（B）Records that violate the expectation are dropped from the target dataset and recorded as invalid in the event log.

（C）Records that violate the expectation are added to the target dataset and flagged as invalid in a field added to the target dataset.

（D）Records that violate the expectation are added to the target dataset and recorded as invalid in the event log.

（E）Records that violate the expectation are dropped from the target dataset and loaded into a quarantine table.

正解：A 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 3

A data engineer is maintaining a data pipeline. Upon data ingestion, the data engineer notices that the source data is starting to have a lower level of quality. The data engineer would like to automate the process of monitoring the quality level.
Which of the following tools can the data engineer use to solve this problem?

（A）Delta Lake

（B）Delta Live Tables

（C）Auto Loader

（D）Data Explorer

（E）Unity Catalog

正解：B 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 4

A data analyst has a series of queries in a SQL program. The data analyst wants this program to run every day. They only want the final query in the program to run on Sundays. They ask for help from the data engineering team to complete this task.
Which of the following approaches could be used by the data engineering team to complete this task?

（A）They could automatically restrict access to the source table in the final query so that it is only accessible on Sundays.

（B）They could only run the entire program on Sundays.

（C）They could redesign the data model to separate the data used in the final query into a new table.

（D）They could wrap the queries using PySpark and use Python's control flow system to determine when to run the final query.

（E）They could submit a feature request with Databricks to add this functionality.

正解：D 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 5

A Delta Live Table pipeline includes two datasets defined using streaming live table. Three datasets are defined against Delta Lake table sources using live table.
The table is configured to run in Production mode using the Continuous Pipeline Mode.
What is the expected outcome after clicking Start to update the pipeline assuming previously unprocessed data exists and all definitions are valid?

（A）All datasets will be updated at set intervals until the pipeline is shut down. The compute resources will be deployed for the update and terminated when the pipeline is stopped.

（B）All datasets will be updated once and the pipeline will shut down. The compute resources will be terminated.

（C）All datasets will be updated once and the pipeline will shut down. The compute resources will persist to allow for additional testing.

（D）All datasets will be updated at set intervals until the pipeline is shut down. The compute resources will persist to allow for additional testing.

正解：A 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 6

Which of the following can be used to simplify and unify siloed data architectures that are specialized for specific use cases?

（A）Data lake

（B）Data warehouse

（C）None of these

（D）All of these

（E）Data lakehouse

正解：E 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 7

A data analysis team has noticed that their Databricks SQL queries are running too slowly when connected to their always-on SQL endpoint. They claim that this issue is present when many members of the team are running small queries simultaneously. They ask the data engineering team for help. The data engineering team notices that each of the team's queries uses the same SQL endpoint.
Which of the following approaches can the data engineering team use to improve the latency of the team's queries?

（A）They can turn on the Serverless feature for the SQL endpoint.

（B）They can turn on the Serverless feature for the SQL endpoint and change the Spot Instance Policy to "Reliability Optimized."

（C）They can increase the cluster size of the SQL endpoint.

（D）They can turn on the Auto Stop feature for the SQL endpoint.

（E）They can increase the maximum bound of the SQL endpoint's scaling range.

正解：E 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 8

A data engineer that is new to using Python needs to create a Python function to add two integers together and return the sum?
Which of the following code blocks can the data engineer use to complete this task?

（A）

（B）

（C）

（D）

（E）

正解：A 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 9

A data engineer wants to create a new table containing the names of customers who live in France.
They have written the following command:
CREATE TABLE customersInFrance
_____ AS
SELECT id,
firstName,
lastName
FROM customerLocations
WHERE country = 'FRANCE';
A senior data engineer mentions that it is organization policy to include a table property indicating that the new table includes personally identifiable information (Pll).
Which line of code fills in the above blank to successfully complete the task?

（A）COMMENT "Contains PIT

（B）"COMMENT PII"

（C）TBLPROPERTIES PII

（D）511

正解：C 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 10

In which of the following scenarios should a data engineer use the MERGE INTO command instead of the INSERT INTO command?

（A）When the source table can be deleted

（B）When the location of the data needs to be changed

（C）When the target table is an external table

（D）When the source is not a Delta table

（E）When the target table cannot contain duplicate records

正解：E 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

Databricks-Certified-Data-Engineer-Associate 無料問題集「Databricks Certified Data Engineer Associate」

弊社を連絡する

関連リンク

トップ試験