Sunday, July 30, 2017

30 TOP Hadoop Admin Interview Questions and Answers pdf free download

Read the most frequently asked 30 top Hadoop Admin interview questions and answers for freshers and experienced job interview questions pdf download free.
1.  Which operating system(s) are supported for production Hadoop deployment?
2.  What is the role of the namenode?
3.  What happen on the namenode when a client tries to read a data file?
4.  What are the hardware requirements for a Hadoop cluster (primary and secondary namenodes and datanodes)?
5.  What mode(s) can Hadoop code be run in?
6.  How would an Hadoop administrator deploy various components of Hadoop in production?
7.  What is the best practice to deploy the secondary namenode
8.  Is there a standard procedure to deploy Hadoop?
9.  What is the role of the secondary namenode?
10. What are the side effects of not running a secondary name node?
11. What happen if a datanode loses network connection for a few minutes?
12. What happen if one of the datanodes has much slower CPU?
13. What is speculative execution?
14. How many racks do you need to create an Hadoop cluster in order to make sure that the cluster operates reliably?
15. Are there any special requirements for namenode?
16. What is distributed copy (distcp)?
17. What is replication factor?
18. What daemons run on Master nodes?
19. What is rack awareness?
20. What is the role of the jobtracker in an Hadoop cluster? 
21. How does the Hadoop cluster tolerate datanode failures?
22. What is the procedure for namenode recovery?
23. Web-UI shows that half of the datanodes are in decommissioning mode. What does that mean? Is it safe to remove those nodes from the network?
24. What does the Hadoop administrator have to do after adding new datanodes to the Hadoop cluster?
25. If the Hadoop administrator needs to make a change, which configuration file does he need to change?
26. Map Reduce jobs are failing on a cluster that was just restarted. They worked before restart. What could be wrong?
27. Map Reduce jobs take too long. What can be done to improve the performance of the cluster?
28. How often do you need to reformat the namenode?
29. After increasing the replication level, I still see that data is under replicated. What could be wrong?
30. What is the procedure for namenode recovery?

3 comments: