Delve into HDFS Data Management with 30 carefully designed MCQs, covering data placement policy, load balancing with HDFS balancer, DataNode capacity and health, and HDFS file block management techniques.
hdfs balancer -starthdfs dfsadmin -balancerstart-balancerhdfs balancer -executehdfs dfsadmin -reporthdfs datanode -statushdfs nodemanager -statushdfs healthcheckhdfs balancerhdfs dfsckhdfs cleanuphdfs adminhdfs dfs -lshdfs dfs -stathdfs fsckhdfs dfs -blocks| Qno | Answer |
|---|---|
| 1 | C) First replica on the writer’s node, others on different racks |
| 2 | B) To increase fault tolerance by spreading data across racks |
| 3 | C) 3 |
| 4 | C) On a different rack from the first two replicas |
| 5 | C) Ensuring fault tolerance and data reliability |
| 6 | B) To rebalance data blocks across DataNodes for even utilization |
| 7 | C) When there is an imbalance in DataNode storage utilization |
| 8 | C) By comparing the storage utilization of DataNodes |
| 9 | B) hdfs dfsadmin -balancer |
| 10 | B) Data blocks are moved from overutilized to underutilized DataNodes |
| 11 | B) By summing up the disk space available across all disks on the node |
| 12 | C) Missing block reports |
| 13 | B) To confirm the DataNode is active and operational |
| 14 | B) It replicates the missing blocks to other nodes |
| 15 | A) hdfs dfsadmin -report |
| 16 | B) A fixed-size unit of data into which files are split |
| 17 | B) 128 MB |
| 18 | B) By copying blocks across multiple nodes and racks |
| 19 | C) Improved performance for large file reads |
| 20 | A) By using custom replication factors |
| 21 | B) hdfs dfsck |
| 22 | B) Increased memory usage on the NameNode |
| 23 | B) By replicating the block from another node |
| 24 | C) hdfs fsck |
| 25 | A) New replicas are created and stored on additional DataNodes |