<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>what is big data hadoop - Wikitechy</title>
	<atom:link href="https://www.wikitechy.com/interview-questions/tag/what-is-big-data-hadoop/feed/" rel="self" type="application/rss+xml" />
	<link>https://www.wikitechy.com/interview-questions/tag/what-is-big-data-hadoop/</link>
	<description>Interview Questions</description>
	<lastBuildDate>Wed, 22 Sep 2021 05:53:05 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9</generator>

<image>
	<url>https://www.wikitechy.com/interview-questions/wp-content/uploads/2025/10/cropped-wikitechy-icon-32x32.png</url>
	<title>what is big data hadoop - Wikitechy</title>
	<link>https://www.wikitechy.com/interview-questions/tag/what-is-big-data-hadoop/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>What is the difference between Apache Hadoop and Cloudera in big data ?</title>
		<link>https://www.wikitechy.com/interview-questions/cloudera-impala/what-is-the-difference-between-apache-hadoop-and-cloudera-in-big-data/</link>
					<comments>https://www.wikitechy.com/interview-questions/cloudera-impala/what-is-the-difference-between-apache-hadoop-and-cloudera-in-big-data/#respond</comments>
		
		<dc:creator><![CDATA[Editor]]></dc:creator>
		<pubDate>Mon, 19 Jul 2021 06:38:24 +0000</pubDate>
				<category><![CDATA[Cloudera Impala]]></category>
		<category><![CDATA[Accenture interview questions and answers]]></category>
		<category><![CDATA[Agreeya Solutions interview questions and answers]]></category>
		<category><![CDATA[apache hadoop vs cloudera]]></category>
		<category><![CDATA[Atos interview questions and answers]]></category>
		<category><![CDATA[big data analytics stages]]></category>
		<category><![CDATA[CASTING NETWORKS INDIA PVT LIMITED interview questions and answers]]></category>
		<category><![CDATA[cloudera admin interview questions]]></category>
		<category><![CDATA[cloudera hadoop distribution]]></category>
		<category><![CDATA[cloudera hadoop download]]></category>
		<category><![CDATA[cloudera hiring process]]></category>
		<category><![CDATA[cloudera manager interview questions]]></category>
		<category><![CDATA[cloudera questions]]></category>
		<category><![CDATA[cloudera software engineer interview questions]]></category>
		<category><![CDATA[cloudera vs hortonworks 2016]]></category>
		<category><![CDATA[cloudera vs hortonworks 2017]]></category>
		<category><![CDATA[cloudera vs hortonworks certification]]></category>
		<category><![CDATA[difference between bigdata and hadoop]]></category>
		<category><![CDATA[does mapr supports streaming data ingestion]]></category>
		<category><![CDATA[FIS Global Business Solutions India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Genpact interview questions and answers]]></category>
		<category><![CDATA[hadoop admin and cloudera certification interview question]]></category>
		<category><![CDATA[hadoop distributions comparison]]></category>
		<category><![CDATA[IBM interview questions and answers]]></category>
		<category><![CDATA[IHS Markit interview questions and answers]]></category>
		<category><![CDATA[Indecomm Global Services interview questions and answers]]></category>
		<category><![CDATA[interview questions on cloudera hadoop]]></category>
		<category><![CDATA[Iris Software interview questions and answers]]></category>
		<category><![CDATA[Mindtree interview questions and answers]]></category>
		<category><![CDATA[Nagarro Software Pvt. Ltd interview questions and answers]]></category>
		<category><![CDATA[Prokarma Softech Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Sapient Consulting Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Standard Chartered Global Business Services Private Limited interview questions and answers]]></category>
		<category><![CDATA[Synechron Technologies Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Tata Consultancy Service interview questions and answers]]></category>
		<category><![CDATA[Tech Mahindra interview questions and answers]]></category>
		<category><![CDATA[ValueLabs interview questions and answers]]></category>
		<category><![CDATA[what is big data and hadoop]]></category>
		<category><![CDATA[what is big data hadoop]]></category>
		<category><![CDATA[what is cloudera]]></category>
		<category><![CDATA[what is cloudera hadoop]]></category>
		<category><![CDATA[what is hadoop technology]]></category>
		<category><![CDATA[what is hadoop used for]]></category>
		<category><![CDATA[Wipro Infotech interview questions and answers]]></category>
		<guid isPermaLink="false">https://www.wikitechy.com/interview-questions/?p=1023</guid>

					<description><![CDATA[Answer : Apache Hadoop is the Hadoop distribution from Apache group...]]></description>
										<content:encoded><![CDATA[<div class="TextHeading">
<div class="hddn">
<h2 id="difference-between-apache-hadoop-and-cloudera-in-big-data" class="color-purple" style="text-align: justify;">Difference between Apache Hadoop and Cloudera in big data</h2>
</div>
</div>
<div class="row">
<div class="col-sm-12">
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Apache Hadoop is the Hadoop distribution from Apache group.</li>
<li>Cloudera Hadoop has its own supply of Hadoop which is designed on top of Apache Hadoop. so it does not have latest release of Hadoop.</li>
<li>Cloudera Hadoop contains extra tools. while Hadoop distributions like Cloudera Search, Impala, Cloudera Navigator and Cloudera Manager these are not involved in Cloudera Hadoop.</li>
<li>These additional tools turns out Cloudera Hadoop to be slightly open source and proprietary.</li>
</ul>
</div>
</div>
<div class="text-center row" style="text-align: justify;">
<div class="col-sm-12">
<div id="bsa-zone_1590522538159-8_123456"></div>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img fetchpriority="high" decoding="async" class="aligncenter size-medium" src="https://cdn.wikitechy.com/interview-questions/cloudera impala/distrbution-hadoop.png" alt="Distribution hadoop" width="768" height="315" /></div>
</div>
<div class="Content">
<div class="hddn">
<ul>
<li style="text-align: justify;">Apache Hadoop is very important to use to cope up with any data, no matter how large or what type or complex it is. It works like a fluid way or can say it is very faster, cheaper and reliable analytic tool than others.</li>
<li style="text-align: justify;">Even, it can easily work on unstructured, and schemaless data and people can easily ground their data into a format without reformating or putting any other efforts.</li>
<li style="text-align: justify;">Apart from this, via Apache Hadoop various programming model has been simplified to allow you to quickly write and test the software in distributed systems.</li>
<li style="text-align: justify;">This is an open-source Apache Hadoop distribution usually, focus on enterprise-class deployments.</li>
<li style="text-align: justify;">Cloudera Inc is a very popular American software company, known for providing Apache Hadoop-based software, services, and full support for the same.</li>
<li style="text-align: justify;">With Cloudera, it is said that Hadoop is just a starting to create your data management strategy, and you can easily use the same to add the security and other various functions to create an enterprise-grade foundation for your data.</li>
</ul>
</div>
</div>
</div>
</div>
]]></content:encoded>
					
					<wfw:commentRss>https://www.wikitechy.com/interview-questions/cloudera-impala/what-is-the-difference-between-apache-hadoop-and-cloudera-in-big-data/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Why do we need Data Locality in Hadoop ?</title>
		<link>https://www.wikitechy.com/interview-questions/big-data/why-do-we-need-data-locality-in-hadoop/</link>
					<comments>https://www.wikitechy.com/interview-questions/big-data/why-do-we-need-data-locality-in-hadoop/#respond</comments>
		
		<dc:creator><![CDATA[Editor]]></dc:creator>
		<pubDate>Mon, 12 Jul 2021 18:21:39 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[3 data locality]]></category>
		<category><![CDATA[Accenture interview questions and answers]]></category>
		<category><![CDATA[apache hadoop]]></category>
		<category><![CDATA[AT&T interview questions and answers]]></category>
		<category><![CDATA[Atos interview questions and answers]]></category>
		<category><![CDATA[azure hadoop]]></category>
		<category><![CDATA[big data hadoop]]></category>
		<category><![CDATA[Capgemini interview questions and answers]]></category>
		<category><![CDATA[CASTING NETWORKS INDIA PVT LIMITED interview questions and answers]]></category>
		<category><![CDATA[CGI Group Inc interview questions and answers]]></category>
		<category><![CDATA[Collabera Technologiesinterview questions and answers]]></category>
		<category><![CDATA[data flow in mapreduces]]></category>
		<category><![CDATA[data locality]]></category>
		<category><![CDATA[data locality c++]]></category>
		<category><![CDATA[data locality definition]]></category>
		<category><![CDATA[data locality in cloud computing]]></category>
		<category><![CDATA[Data locality in Hadoop]]></category>
		<category><![CDATA[data locality in spark]]></category>
		<category><![CDATA[data locality in yarn]]></category>
		<category><![CDATA[data locality nutanix]]></category>
		<category><![CDATA[data locality optimization in hadoop]]></category>
		<category><![CDATA[data localization in hadoop]]></category>
		<category><![CDATA[Dell International Services India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[distributed file system]]></category>
		<category><![CDATA[Ernst & Young interview questions and answers]]></category>
		<category><![CDATA[Flipkart interview questions and answers]]></category>
		<category><![CDATA[Genpact interview questions and answers]]></category>
		<category><![CDATA[hadoop cluster]]></category>
		<category><![CDATA[hadoop data partitioning]]></category>
		<category><![CDATA[hadoop database]]></category>
		<category><![CDATA[hadoop distributed file system]]></category>
		<category><![CDATA[hadoop ecosystem]]></category>
		<category><![CDATA[hadoop file system]]></category>
		<category><![CDATA[hadoop framework]]></category>
		<category><![CDATA[hadoop mapreduce]]></category>
		<category><![CDATA[hadoop optimization techniques]]></category>
		<category><![CDATA[hdfs architecture]]></category>
		<category><![CDATA[IBM interview questions and answers]]></category>
		<category><![CDATA[Importance of Data Locality]]></category>
		<category><![CDATA[Improving Data Processing Performance with Hadoop Data Locality]]></category>
		<category><![CDATA[in the local disk of the name node the files which are stored persistently are]]></category>
		<category><![CDATA[Indecomm Global Services interview questions and answers]]></category>
		<category><![CDATA[Introduction to Data Locality in Hadoop MapReduce]]></category>
		<category><![CDATA[Job scheduling for optimizing data locality in Hadoop clusters]]></category>
		<category><![CDATA[L&T Infotech interview questions and answers]]></category>
		<category><![CDATA[locality optimization in compiler design]]></category>
		<category><![CDATA[mapreduce data locality]]></category>
		<category><![CDATA[Mindtree interview questions and answers]]></category>
		<category><![CDATA[NetApp interview questions and answers]]></category>
		<category><![CDATA[R Systems interview questions and answers]]></category>
		<category><![CDATA[rack awareness in hadoop]]></category>
		<category><![CDATA[RBS India Development Centre Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[SAP Labs India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Tata Consultancy Service interview questions and answers]]></category>
		<category><![CDATA[Tech Mahindra interview questions and answers]]></category>
		<category><![CDATA[Trigent Software interview questions and answers]]></category>
		<category><![CDATA[UnitedHealth Group interview questions and answers]]></category>
		<category><![CDATA[Virtusa Consulting Services Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Wells Fargo interview questions and answers]]></category>
		<category><![CDATA[what is big data and hadoop]]></category>
		<category><![CDATA[what is big data hadoop]]></category>
		<category><![CDATA[What is Data Locality]]></category>
		<category><![CDATA[what is data locality in hadoop]]></category>
		<category><![CDATA[What is Data Locality in HadoopWhat does the term 'data locality' mean in Hadoop]]></category>
		<category><![CDATA[What is Data locality optimization in hadoop]]></category>
		<category><![CDATA[what is data localization in hadoop]]></category>
		<category><![CDATA[what is hadoop]]></category>
		<category><![CDATA[what is hadoop used for]]></category>
		<category><![CDATA[Wipro Infotech interview questions and answers]]></category>
		<category><![CDATA[Wipro interview questions and answers]]></category>
		<category><![CDATA[Xoriant Solutions Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[yarn hadoop]]></category>
		<category><![CDATA[ZS Associates interview questions and answers]]></category>
		<guid isPermaLink="false">https://www.wikitechy.com/interview-questions/?p=287</guid>

					<description><![CDATA[Answer : Datasets in HDFS store as blocks in DataNodes...]]></description>
										<content:encoded><![CDATA[<div class="TextHeading">
<div class="hddn">
<h2 id="why-do-we-need-data-locality-in-hadoop" class="color-pink" style="text-align: justify;">Why do we need Data Locality in Hadoop ?</h2>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/why-we-need-data-locality-in-hadoop.png" alt=" Data Locality in Hadoop " /></div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Datasets in <a href="https://www.wikitechy.com/tutorials/sqoop/sqoop-vs-hdfs" target="_blank" rel="noopener">HDFS</a> store as blocks in DataNodes the Hadoop cluster.</li>
<li>During the execution of a <a href="https://www.wikitechy.com/tutorials/hive/hive-mapreduce-hadoop-mapreduce" target="_blank" rel="noopener">MapReduce</a> job the individual Mapper processes the blocks (Input Splits).</li>
<li>If the data does not reside in the same node where the Mapper is executing the job, the data needs to be copied from the DataNode over the <a href="https://www.wikitechy.com/errors-and-fixes/sql/cluster-network-name-showing-netbios-status-as-the-system-cannot-find-the-file-specified" target="_blank" rel="noopener">network</a> to the mapper DataNode.</li>
</ul>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/data-locality-in-hadoop.gif" alt="Datasets in HDFS - Data Locality in Hadoop" /></div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Now if a MapReduce job has more than 100 Mapper and each Mapper tries to copy the data from other DataNode in the cluster simultaneously, it would cause serious network congestion which is a big performance issue of the overall system.</li>
<li>Hence, data proximity to the computation is an effective and cost-effective solution which is technically termed as <a href="https://www.wikitechy.com/interview-questions/hadoop/what-are-the-features-of-hadoop/" target="_blank" rel="noopener">Data locality in Hadoop</a>. It helps to increase the overall throughput of the system.</li>
</ul>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/mapreduce-job-data-locality.gif" alt=" " /></div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="types-of-data-locality" class="color-green">Types of data locality</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li><b>Data local</b>
<ul>
<li>In this type data and the mapper resides on the same node. This is the closest proximity of data and the most preferred scenario.</li>
</ul>
</li>
</ul>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li><b>Rack Local</b>
<ul>
<li>In this type data and the mapper resides on the same node. This is the closest proximity of data and the most preferred scenario.</li>
<li>In this scenarios mapper and data reside on the same rack but on the different data nodes.</li>
</ul>
</li>
</ul>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li><b>Different Rack</b>
<ul>
<li>In this scenario mapper and data reside on the different racks.</li>
</ul>
</li>
</ul>
</div>
</div>
<div class="ImageContent">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/types-of-data-locality.jpg" alt="Types of data locality" /></div>
</div>
]]></content:encoded>
					
					<wfw:commentRss>https://www.wikitechy.com/interview-questions/big-data/why-do-we-need-data-locality-in-hadoop/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>What are the features of Hadoop ?</title>
		<link>https://www.wikitechy.com/interview-questions/big-data/what-are-the-features-of-hadoop/</link>
					<comments>https://www.wikitechy.com/interview-questions/big-data/what-are-the-features-of-hadoop/#respond</comments>
		
		<dc:creator><![CDATA[Editor]]></dc:creator>
		<pubDate>Mon, 12 Jul 2021 17:55:46 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[9 Features Of Hadoop]]></category>
		<category><![CDATA[Accenture interview questions and answers]]></category>
		<category><![CDATA[advantages of hadoop over data warehouse]]></category>
		<category><![CDATA[apache hadoop]]></category>
		<category><![CDATA[applications of hadoopa]]></category>
		<category><![CDATA[architecture of hadoop]]></category>
		<category><![CDATA[AT&T interview questions and answers]]></category>
		<category><![CDATA[Atos interview questions and answers]]></category>
		<category><![CDATA[big data hadoop]]></category>
		<category><![CDATA[bigdata and hadoop]]></category>
		<category><![CDATA[Capgemini interview questions and answers]]></category>
		<category><![CDATA[CASTING NETWORKS INDIA PVT LIMITED interview questions and answers]]></category>
		<category><![CDATA[CGI Group Inc interview questions and answers]]></category>
		<category><![CDATA[characteristics of big data]]></category>
		<category><![CDATA[characteristics of hadoop]]></category>
		<category><![CDATA[Collabera Technologiesinterview questions and answers]]></category>
		<category><![CDATA[components of hadoop]]></category>
		<category><![CDATA[conclusion of hadoop]]></category>
		<category><![CDATA[Dell International Services India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[distributed file system]]></category>
		<category><![CDATA[dvantages of hadoop ecosystem]]></category>
		<category><![CDATA[Ernst & Young interview questions and answers]]></category>
		<category><![CDATA[features of big data]]></category>
		<category><![CDATA[Features of Hadoop]]></category>
		<category><![CDATA[features of hdfs]]></category>
		<category><![CDATA[Flipkart interview questions and answers]]></category>
		<category><![CDATA[functionalities of hadoop cluster]]></category>
		<category><![CDATA[functionality of hadoop cluster]]></category>
		<category><![CDATA[Genpact interview questions and answers]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[hadoop architecture]]></category>
		<category><![CDATA[hadoop certification]]></category>
		<category><![CDATA[hadoop commands]]></category>
		<category><![CDATA[hadoop ecosystem]]></category>
		<category><![CDATA[hadoop explained]]></category>
		<category><![CDATA[hadoop features and advantages]]></category>
		<category><![CDATA[hadoop hive]]></category>
		<category><![CDATA[hadoop mapreduce]]></category>
		<category><![CDATA[hadoop the definitive guide]]></category>
		<category><![CDATA[hdfs]]></category>
		<category><![CDATA[hdfs architecture]]></category>
		<category><![CDATA[IBM interview questions and answers]]></category>
		<category><![CDATA[Indecomm Global Services interview questions and answers]]></category>
		<category><![CDATA[key distinctions of hadoop are mcq]]></category>
		<category><![CDATA[key features of big data]]></category>
		<category><![CDATA[Key Features of Hadoop]]></category>
		<category><![CDATA[L&T Infotech interview questions and answers]]></category>
		<category><![CDATA[mapreduce]]></category>
		<category><![CDATA[mapreduce features]]></category>
		<category><![CDATA[Mindtree interview questions and answers]]></category>
		<category><![CDATA[NetApp interview questions and answers]]></category>
		<category><![CDATA[pig hadoop]]></category>
		<category><![CDATA[R Systems interview questions and answers]]></category>
		<category><![CDATA[RBS India Development Centre Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[SAP Labs India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[special features of hadoop]]></category>
		<category><![CDATA[Tata Consultancy Service interview questions and answers]]></category>
		<category><![CDATA[Tech Mahindra interview questions and answers]]></category>
		<category><![CDATA[Top 10 Features of Big Data Hadoop]]></category>
		<category><![CDATA[Trigent Software interview questions and answers]]></category>
		<category><![CDATA[UnitedHealth Group interview questions and answers]]></category>
		<category><![CDATA[Virtusa Consulting Services Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Wells Fargo interview questions and answers]]></category>
		<category><![CDATA[What are the main features of Hadoop]]></category>
		<category><![CDATA[What are the most important features of Hadoop]]></category>
		<category><![CDATA[what is big data hadoop]]></category>
		<category><![CDATA[what is hadoop]]></category>
		<category><![CDATA[Wipro Infotech interview questions and answers]]></category>
		<category><![CDATA[Wipro interview questions and answers]]></category>
		<category><![CDATA[Xoriant Solutions Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[yarn architecture]]></category>
		<category><![CDATA[yarn hadoop]]></category>
		<category><![CDATA[ZS Associates interview questions and answers]]></category>
		<guid isPermaLink="false">https://www.wikitechy.com/interview-questions/?p=279</guid>

					<description><![CDATA[Answer : Hadoop supports the storage and processing of big data...]]></description>
										<content:encoded><![CDATA[<div class="TextHeading">
<div class="hddn">
<h2 id="what-are-the-features-of-hadoop" class="color-pink" style="text-align: justify;">What are the features of Hadoop ?</h2>
</div>
</div>
<p style="text-align: justify;">Hadoop supports the storage and processing of big data. It is the best solution for handling big data challenges. Some important features of Hadoop are –</p>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/what-are-the-features-of-hadoop.png" alt="What are the features of Hadoop" /></div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="open-source" class="color-green">Open Source</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Hadoop is an <a href="https://www.wikitechy.com/technology/10-great-open-source-apps-for-android/" target="_blank" rel="noopener">open source</a> framework which means it is available free of cost.</li>
<li>Also, the users are allowed to change the source code as per their requirements.</li>
</ul>
</div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="distributed-processing" class="color-green">Distributed Processing</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li><a href="https://www.wikitechy.com/tutorials/apache-pig/apache-pig-tutorial/satellite-image-processing-using-hadoop.php" target="_blank" rel="noopener">Hadoop</a> supports distributed processing of data i.e. faster processing.</li>
<li>The data in Hadoop HDFS is stored in a distributed manner and MapReduce is responsible for the parallel processing of data.</li>
</ul>
</div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="fault-tolerance" class="color-green">Fault Tolerance</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Hadoop is highly <a href="https://www.wikitechy.com/interview-questions/vsphere/what-is-vmware-fault-tolerance" target="_blank" rel="noopener">fault-tolerant</a>. It creates three replicas for each block at different nodes, by default.</li>
<li>This number can be changed according to the requirement. So, we can recover the data from another node if one node fails.</li>
<li>The detection of node failure and recovery of data is done automatically.</li>
</ul>
</div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="reliability" class="color-green">Reliability</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Hadoop stores data on the cluster in a reliable manner that is independent of machine.</li>
<li>So, the data stored in Hadoop environment is not affected by the failure of the machine.</li>
</ul>
</div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="scalability" class="color-green">Scalability</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Another important feature of Hadoop is the scalability. It is compatible with the other hardware and we can easily ass the new hardware to the nodes.</li>
</ul>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/scalibility-in-hadoop.gif" alt="Scalability in hadoop" /></div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="economic" class="color-green">Economic</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Apache Hadoop is not very expensive as it runs on a cluster of commodity hardware.</li>
<li>Hadoop also provides huge cost saving also as it is very easy to add more nodes on the fly here. So if requirement increases, then you can increase nodes as well without any downtime and without requiring much of pre-planning.</li>
</ul>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/economic-in-hadoop.gif" alt="Economic in hadoop" /></div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="easy-to-use" class="color-green">Easy to use</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>No need of client to deal with distributed computing, the framework takes care of all the things. So this feature of Hadoop is easy to use.</li>
</ul>
</div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="data-locality" class="color-green">Data Locality</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>This one is a unique features of Hadoop that made it easily handle the Big Data. Hadoop works on data locality principle which states that move computation to data instead of data to computation.</li>
<li>When a client submits the MapReduce algorithm, this algorithm is moved to data in the cluster rather than bringing data to the location where the algorithm is submitted and then processing it.</li>
</ul>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/data-locality-in-hadoop.gif" alt="Data Locality " /></div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="high-availability" class="color-green">High Availability</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>The data stored in Hadoop is available to access even after the hardware failure. In case of hardware failure, the data can be accessed from another path.</li>
</ul>
</div>
</div>
<div class="ImageContent">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/high-availability-in-hadoop.gif" alt="High Availability " /></div>
</div>
]]></content:encoded>
					
					<wfw:commentRss>https://www.wikitechy.com/interview-questions/big-data/what-are-the-features-of-hadoop/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>What is Big Data ?</title>
		<link>https://www.wikitechy.com/interview-questions/big-data/what-is-big-data/</link>
					<comments>https://www.wikitechy.com/interview-questions/big-data/what-is-big-data/#respond</comments>
		
		<dc:creator><![CDATA[Editor]]></dc:creator>
		<pubDate>Mon, 12 Jul 2021 16:21:38 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Accenture interview questions and answers]]></category>
		<category><![CDATA[Altimetrik India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[big data]]></category>
		<category><![CDATA[big data analysis course]]></category>
		<category><![CDATA[big data analytics]]></category>
		<category><![CDATA[big data analytics courses]]></category>
		<category><![CDATA[big data analytics examples]]></category>
		<category><![CDATA[big data analytics tools]]></category>
		<category><![CDATA[big data analytics tutorial]]></category>
		<category><![CDATA[big data and analytics]]></category>
		<category><![CDATA[big data applications]]></category>
		<category><![CDATA[big data architecture]]></category>
		<category><![CDATA[big data definition]]></category>
		<category><![CDATA[big data example]]></category>
		<category><![CDATA[big data examples]]></category>
		<category><![CDATA[big data implementation examples]]></category>
		<category><![CDATA[big data management]]></category>
		<category><![CDATA[big data platform]]></category>
		<category><![CDATA[big data practical example]]></category>
		<category><![CDATA[big data processing]]></category>
		<category><![CDATA[big data solutions]]></category>
		<category><![CDATA[big data tutorial]]></category>
		<category><![CDATA[big data what is it]]></category>
		<category><![CDATA[Capgemini interview questions and answers]]></category>
		<category><![CDATA[CASTING NETWORKS INDIA PVT LIMITED interview questions and answers]]></category>
		<category><![CDATA[CGI Group Inc interview questions and answers]]></category>
		<category><![CDATA[characteristics of big data]]></category>
		<category><![CDATA[Collabera Technologies interview questions and answers]]></category>
		<category><![CDATA[cool big data examples]]></category>
		<category><![CDATA[Dell International Services India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[examples of big data in healthcare]]></category>
		<category><![CDATA[Flipkart interview questions and answers]]></category>
		<category><![CDATA[Genpact interview questions and answers]]></category>
		<category><![CDATA[IBM interview questions and answers]]></category>
		<category><![CDATA[JPMorgan Chase & Co interview questions and answers]]></category>
		<category><![CDATA[L&T Infotech interview questions and answers]]></category>
		<category><![CDATA[Mindtree interview questions and answers]]></category>
		<category><![CDATA[Oracle Corporation interview questions and answers]]></category>
		<category><![CDATA[Prokarma Softech Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[que es big data]]></category>
		<category><![CDATA[R Systems interview questions and answers]]></category>
		<category><![CDATA[RBS India Development Centre Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Reliance Industries Ltd interview questions and answers]]></category>
		<category><![CDATA[Sapient Consulting Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[show big is big data]]></category>
		<category><![CDATA[Synechron Te interview questions and answers]]></category>
		<category><![CDATA[Tata Consultancy Service interview questions and answers]]></category>
		<category><![CDATA[Tech Mahindra interview questions and answers]]></category>
		<category><![CDATA[Trigent Software interview questions and answers]]></category>
		<category><![CDATA[types of big data]]></category>
		<category><![CDATA[UnitedHealth Group interview questions and answers]]></category>
		<category><![CDATA[Virtusa Consulting Services Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Wells Fargo interview questions and answers]]></category>
		<category><![CDATA[what is big data]]></category>
		<category><![CDATA[what is big data analytics]]></category>
		<category><![CDATA[what is big data and hadoop]]></category>
		<category><![CDATA[what is big data concept]]></category>
		<category><![CDATA[what is big data definition]]></category>
		<category><![CDATA[what is big data hadoop]]></category>
		<category><![CDATA[what is big data in marketing]]></category>
		<category><![CDATA[what is big data technology]]></category>
		<category><![CDATA[what is big data used for]]></category>
		<category><![CDATA[what is considered big data]]></category>
		<category><![CDATA[what is meant by big data]]></category>
		<category><![CDATA[Wipro Infotech interview questions and answers]]></category>
		<category><![CDATA[Xoriant Solutions Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Yodlee Infotech Pvt Ltd interview questions and answers]]></category>
		<guid isPermaLink="false">https://www.wikitechy.com/interview-questions/?p=256</guid>

					<description><![CDATA[Answer : Big Data is a term associated with complex...]]></description>
										<content:encoded><![CDATA[<div class="TextHeading">
<div class="hddn">
<h2 id="what-is-big-data" class="color-pink" style="text-align: justify;">What is Big Data ?</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Big Data is a term associated with complex and large datasets.</li>
<li>A <a href="https://www.wikitechy.com/tutorials/sqoop/sqoop-import-the-results-of-a-query-from-a-relational-database-into-hdfs" target="_blank" rel="noopener">relational database</a> cannot handle big data, and that’s why special tools and methods are used to perform operations on a vast collection of data.</li>
<li>Big data enables companies to understand their business better and helps them derive meaningful information from the unstructured and raw data collected on a regular basis.</li>
<li>Big data also allows companies to make better business decisions backed by data.</li>
</ul>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block" src="https://cdn.wikitechy.com/interview-questions/big-data/what-is-big-data.gif" alt="What is Big Data" /></div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="characteristics-of-big-data" class="color-purple">Characteristics Of Big Data</h2>
</div>
</div>
<div class="Content">
<div class="hddn">
<ul>
<li style="text-align: justify;"><b>Volume</b> &#8211; The Size of data plays a very essential role in determining value out of data.</li>
<li style="text-align: justify;"><b>Variety</b> &#8211; The next aspect of Big Data is its variety. Variety refers to heterogeneous sources and the nature of data, both structured and unstructured.</li>
<li style="text-align: justify;"><b>Velocity</b> &#8211; The term &#8216;velocity&#8217; refers to the speed of generation of data.</li>
<li style="text-align: justify;"><b>Variability</b> &#8211; The process of being able to handle and manage the data effectively.</li>
</ul>
</div>
</div>
]]></content:encoded>
					
					<wfw:commentRss>https://www.wikitechy.com/interview-questions/big-data/what-is-big-data/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
	</channel>
</rss>
