CCA-500無料問題集「Cloudera Certified Administrator for Apache Hadoop (CCAH)」

質問 1

You have A 20 node Hadoop cluster, with 18 slave nodes and 2 master nodes running HDFS High Availability (HA). You
want to minimize the chance of data loss in your cluster. What should you do?

（A）Run the ResourceManager on a different master from the NameNode in order to load-share HDFS metadata
processing

（B）Configure the cluster's disk drives with an appropriate fault tolerant RAID level

（C）Add another master node to increase the number of nodes running the JournalNode which increases the number
of machines available to HA to create a quorum

（D）Set an HDFS replication factor that provides data redundancy, protecting against node failure

（E）Run a Secondary NameNode on a different master from the NameNode in order to provide automatic recovery
from a NameNode failure.

正解：A 解答を投票する

質問 2

Assume you have a file named foo.txt in your local directory. You issue the following three commands:
Hadoop fs -mkdir input
Hadoop fs -put foo.txt input/foo.txt
Hadoop fs -put foo.txt input
What happens when you issue the third command?

（A）The write silently fails

（B）You get a warning that foo.txt is being overwritten

（C）You get an error message telling you that foo.txt already exists, and asking you if you would like to overwrite it.

（D）The write succeeds, overwriting foo.txt in HDFS with no warning

（E）You get a error message telling you that foo.txt already exists. The file is not written to HDFS

（F）You get an error message telling you that input is not a directory

（G）The file is uploaded and stored as a plain file named input

正解：B、E 解答を投票する

質問 3

A user comes to you, complaining that when she attempts to submit a Hadoop job, it fails. There is a Directory in HDFS
named /data/input. The Jar is named j.jar, and the driver class is named DriverClass.
She runs the command:
Hadoop jar j.jar DriverClass /data/input/data/output
The error message returned includes the line:
PriviligedActionException as:training (auth:SIMPLE)
cause:org.apache.hadoop.mapreduce.lib.input.invalidInputException:
Input path does not exist: file:/data/input
What is the cause of the error?

（A）The output directory already exists

（B）The Hadoop configuration files on the client do not point to the cluster

（C）The name of the driver has been spelled incorrectly on the command line

（D）The user is not authorized to run the job on the cluster

（E）The directory name is misspelled in HDFS

正解：D 解答を投票する

質問 4

Which YARN daemon or service negotiations map and reduce Containers from the Scheduler, tracking their status and
monitoring progress?

（A）ApplicationManager

（B）ApplicationMaster

（C）ResourceManager

（D）NodeManager

正解：C 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 5

You use the hadoop fs -put command to add a file "sales.txt" to HDFS. This file is small enough that it fits into a single
block, which is replicated to three nodes in your cluster (with a replication factor of 3). One of the nodes holding this
file (a single block) fails. How will the cluster handle the replication of file in this situation?

（A）The cluster will re-replicate the file the next time the system administrator reboots the NameNode daemon (as long
as the file's replication factor doesn't fall below)

（B）This will be immediately re-replicated and all other HDFS operations on the cluster will halt until the cluster's
replication values are resorted

（C）The file will be re-replicated automatically after the NameNode determines it is under-replicated based on the
block reports it receives from the NameNodes

（D）The file will remain under-replicated until the administrator brings that node back online

正解：A 解答を投票する

質問 6

You need to analyze 60,000,000 images stored in JPEG format, each of which is approximately 25 KB. Because you
Hadoop cluster isn't optimized for storing and processing many small files, you decide to do the following actions:
1. Group the individual images into a set of larger files
2. Use the set of larger files as input for a MapReduce job that processes them directly with python using Hadoop
streaming.
Which data serialization system gives the flexibility to do this?

（A）XML

（B）SequenceFiles

（C）CSV

（D）JSON

（E）Avro

（F）HTML

正解：A、C 解答を投票する

CCA-500 無料問題集「Cloudera Certified Administrator for Apache Hadoop (CCAH)」

弊社を連絡する

関連リンク

トップ試験