Difference between nfs and hdfs ?
Answer : NFS (Network File System) is one of the oldest and popular...
Why do we need Data Locality in Hadoop ?
Answer : Datasets in HDFS store as blocks in DataNodes...
What are the running modes of Hadoop ?
Answer : The three running modes of Hadoop are as follows...
What are the configuration parameters in a “MapReduce” program ?
Answer : Input location of Jobs in the distributed file system...
What are the core components of Hadoop ?
Answer : Hadoop is an open source framework that is meant...
What are the features of Hadoop ?
Answer : Hadoop supports the storage and processing of big data...
What is the difference between Hadoop and RDBMS ?
Answer : Hadoop Based on ‘Schema on Read’...
How to recover a NameNode when it is down ?
Answer : Use the FsImage which is file system metadata...
What are the Six V’s of Big Data ?
Answer : Volume represents the volume i.e. amount of data that is growing...
What is fsck ?
Answer : Fsck stands for File System Check. This command is used...
Why Hadoop used for Big Data Analytics ?
Answer : Big data analytics is the process of examining large data...
What are the components of HDFS and YARN ?
Answer : NameNode is the master node for processing metadata information...
What are the steps to deploy a big data solution ?
Answer : There are three steps to deploy a Big Data ...
How big data analysis helpful in increasing business revenue ?
Answer : Big data analysis has become very important for the businesses...
What is Big Data ?
Answer : Big Data is a term associated with complex...
What is Shark ?
Answer : Shark is a tool, developed for people who are from...
What is the difference between Spark and Hadoop MapReduce ?
Answer : Apache Spark is an open-source distributed cluster-computing...
What is the purpose of @outputSchema decorator in Python UDF when using in Apache Pig ?
Answer:A UDF has input and output. Here is the different ways you can specify the output format of a Python UDF through use...
What is the main difference between pig vs hive vs sql ?
Answer:Pig does not have a dedicated metadata database. Hive makes use of the exact variation of dedicated SQL-DDL language...
What is the difference between Pig, Hive and HBase ?
Answer:It is used for semi structured data. ,Hive is query engine,HBase is a data storage particularly for unstructured data.