D-DS-FN-23 無料問題集「EMC Dell Data Science Foundations」

Adata scientist is investigating a new database column that needs to be integrated into their model. The column contains 10,000 labels with 300 unique values.
Which data structure should be used when working in R?

Which word or phrase completes the statement; "A theater actor is to 'artistic and expressive' as a data scientist is to."?

In the data preparation phase of the data analytics lifecycle, what does the term "data conditioning" refer to?

解説: (JPNTest メンバーにのみ表示されます)
Which SQL OLAP extension provides all possible grouping combinations?

Refer to the exhibit.

You are using K-means clustering to classify customer behavior for a large retailer. You need to determine the optimum number of customer groups. You plot the within-sum-of- squares (wss) data as shown in the exhibit.
How many customer groups should you specify?

A study was run to identify general dietary patterns among the residents of a small town. Twelve thousand people were surveyed and the data was subject to K-means clustering.
In one of the iterations, there were six clusters formed with 38, 1560, 1799, 2560, 2893, and 3150 respondents.
What should be the next step in identifying optimal clusters?

Refer to the Exhibit.

You are working on creating an OLAP query that outputs several rows of with summary rows of subtotals and grand totals in addition to regular rows that may contain NULL as shown in the exhibit.
Which function can you use in your query to distinguish the row from a regular row to a subtotal row?

You are assigned the task of creating customer profiles for your company. In your database, you have
25 key input variables that come together to define 2,500 customers. You decide to run a K-means cluster analysis on the 25 input variables based on k=4 to build your profiles.
Your analysis resulted in four cluster populations:
Cluster A=1,000 customers
Cluster B=560 customers
Cluster C=925 customers
Cluster D=15 customers
What should be attempted first to more evenly distribute the customer population across clusters?

Refer to the exhibit.

The graph represents an ROC space with four classifiers labelled A through D.
Which point in the graph represents a perfect classification?

Refer to the exhibit.

You are using k-means clustering to discover groupings within a data set. You plot within- sum-of-squares (wss) of multiple cluster sizes.
Based on the exhibit, how many clusters should you use in your analysis?

How is dimensionality defined in a "bag of words" document representation?

You have an automotive database containing numeric characteristics such as engine size, horsepower, and top speed.
Which technique could you use to group similar cars together?

Consider these itemsets:
(hat, scarf, coat)
(hat, scarf, coat, gloves)
(hat, scarf, gloves)
(hat, gloves)
(scarf, coat, gloves)
What is the confidence of the rule (hat, scarf) => gloves?

What provides the means for matching and manipulating text strings in SQL?

Which process in text analysis can be used to reduce dimensionality?

A disk drive manufacturer has a defect rate of less than 1.0% with 98% confidence. A quality assurance team samples 1000 disk drives and finds 14 defective units.
Which action should the team recommend?

弊社を連絡する

我々は12時間以内ですべてのお問い合わせを答えます。

オンラインサポート時間:( UTC+9 ) 9:00-24:00
月曜日から土曜日まで

サポート:現在連絡