<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>what is hadoop used for - Wikitechy</title>
	<atom:link href="https://www.wikitechy.com/interview-questions/tag/what-is-hadoop-used-for/feed/" rel="self" type="application/rss+xml" />
	<link>https://www.wikitechy.com/interview-questions/tag/what-is-hadoop-used-for/</link>
	<description>Interview Questions</description>
	<lastBuildDate>Wed, 22 Sep 2021 05:53:05 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9.4</generator>

<image>
	<url>https://www.wikitechy.com/interview-questions/wp-content/uploads/2025/10/cropped-wikitechy-icon-32x32.png</url>
	<title>what is hadoop used for - Wikitechy</title>
	<link>https://www.wikitechy.com/interview-questions/tag/what-is-hadoop-used-for/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>What is the difference between Apache Hadoop and Cloudera in big data ?</title>
		<link>https://www.wikitechy.com/interview-questions/cloudera-impala/what-is-the-difference-between-apache-hadoop-and-cloudera-in-big-data/</link>
					<comments>https://www.wikitechy.com/interview-questions/cloudera-impala/what-is-the-difference-between-apache-hadoop-and-cloudera-in-big-data/#respond</comments>
		
		<dc:creator><![CDATA[Editor]]></dc:creator>
		<pubDate>Mon, 19 Jul 2021 06:38:24 +0000</pubDate>
				<category><![CDATA[Cloudera Impala]]></category>
		<category><![CDATA[Accenture interview questions and answers]]></category>
		<category><![CDATA[Agreeya Solutions interview questions and answers]]></category>
		<category><![CDATA[apache hadoop vs cloudera]]></category>
		<category><![CDATA[Atos interview questions and answers]]></category>
		<category><![CDATA[big data analytics stages]]></category>
		<category><![CDATA[CASTING NETWORKS INDIA PVT LIMITED interview questions and answers]]></category>
		<category><![CDATA[cloudera admin interview questions]]></category>
		<category><![CDATA[cloudera hadoop distribution]]></category>
		<category><![CDATA[cloudera hadoop download]]></category>
		<category><![CDATA[cloudera hiring process]]></category>
		<category><![CDATA[cloudera manager interview questions]]></category>
		<category><![CDATA[cloudera questions]]></category>
		<category><![CDATA[cloudera software engineer interview questions]]></category>
		<category><![CDATA[cloudera vs hortonworks 2016]]></category>
		<category><![CDATA[cloudera vs hortonworks 2017]]></category>
		<category><![CDATA[cloudera vs hortonworks certification]]></category>
		<category><![CDATA[difference between bigdata and hadoop]]></category>
		<category><![CDATA[does mapr supports streaming data ingestion]]></category>
		<category><![CDATA[FIS Global Business Solutions India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Genpact interview questions and answers]]></category>
		<category><![CDATA[hadoop admin and cloudera certification interview question]]></category>
		<category><![CDATA[hadoop distributions comparison]]></category>
		<category><![CDATA[IBM interview questions and answers]]></category>
		<category><![CDATA[IHS Markit interview questions and answers]]></category>
		<category><![CDATA[Indecomm Global Services interview questions and answers]]></category>
		<category><![CDATA[interview questions on cloudera hadoop]]></category>
		<category><![CDATA[Iris Software interview questions and answers]]></category>
		<category><![CDATA[Mindtree interview questions and answers]]></category>
		<category><![CDATA[Nagarro Software Pvt. Ltd interview questions and answers]]></category>
		<category><![CDATA[Prokarma Softech Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Sapient Consulting Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Standard Chartered Global Business Services Private Limited interview questions and answers]]></category>
		<category><![CDATA[Synechron Technologies Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Tata Consultancy Service interview questions and answers]]></category>
		<category><![CDATA[Tech Mahindra interview questions and answers]]></category>
		<category><![CDATA[ValueLabs interview questions and answers]]></category>
		<category><![CDATA[what is big data and hadoop]]></category>
		<category><![CDATA[what is big data hadoop]]></category>
		<category><![CDATA[what is cloudera]]></category>
		<category><![CDATA[what is cloudera hadoop]]></category>
		<category><![CDATA[what is hadoop technology]]></category>
		<category><![CDATA[what is hadoop used for]]></category>
		<category><![CDATA[Wipro Infotech interview questions and answers]]></category>
		<guid isPermaLink="false">https://www.wikitechy.com/interview-questions/?p=1023</guid>

					<description><![CDATA[Answer : Apache Hadoop is the Hadoop distribution from Apache group...]]></description>
										<content:encoded><![CDATA[<div class="TextHeading">
<div class="hddn">
<h2 id="difference-between-apache-hadoop-and-cloudera-in-big-data" class="color-purple" style="text-align: justify;">Difference between Apache Hadoop and Cloudera in big data</h2>
</div>
</div>
<div class="row">
<div class="col-sm-12">
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Apache Hadoop is the Hadoop distribution from Apache group.</li>
<li>Cloudera Hadoop has its own supply of Hadoop which is designed on top of Apache Hadoop. so it does not have latest release of Hadoop.</li>
<li>Cloudera Hadoop contains extra tools. while Hadoop distributions like Cloudera Search, Impala, Cloudera Navigator and Cloudera Manager these are not involved in Cloudera Hadoop.</li>
<li>These additional tools turns out Cloudera Hadoop to be slightly open source and proprietary.</li>
</ul>
</div>
</div>
<div class="text-center row" style="text-align: justify;">
<div class="col-sm-12">
<div id="bsa-zone_1590522538159-8_123456"></div>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img fetchpriority="high" decoding="async" class="aligncenter size-medium" src="https://cdn.wikitechy.com/interview-questions/cloudera impala/distrbution-hadoop.png" alt="Distribution hadoop" width="768" height="315" /></div>
</div>
<div class="Content">
<div class="hddn">
<ul>
<li style="text-align: justify;">Apache Hadoop is very important to use to cope up with any data, no matter how large or what type or complex it is. It works like a fluid way or can say it is very faster, cheaper and reliable analytic tool than others.</li>
<li style="text-align: justify;">Even, it can easily work on unstructured, and schemaless data and people can easily ground their data into a format without reformating or putting any other efforts.</li>
<li style="text-align: justify;">Apart from this, via Apache Hadoop various programming model has been simplified to allow you to quickly write and test the software in distributed systems.</li>
<li style="text-align: justify;">This is an open-source Apache Hadoop distribution usually, focus on enterprise-class deployments.</li>
<li style="text-align: justify;">Cloudera Inc is a very popular American software company, known for providing Apache Hadoop-based software, services, and full support for the same.</li>
<li style="text-align: justify;">With Cloudera, it is said that Hadoop is just a starting to create your data management strategy, and you can easily use the same to add the security and other various functions to create an enterprise-grade foundation for your data.</li>
</ul>
</div>
</div>
</div>
</div>
]]></content:encoded>
					
					<wfw:commentRss>https://www.wikitechy.com/interview-questions/cloudera-impala/what-is-the-difference-between-apache-hadoop-and-cloudera-in-big-data/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>What is the use of hive language ?</title>
		<link>https://www.wikitechy.com/interview-questions/hive/what-is-the-use-of-hive-language/</link>
					<comments>https://www.wikitechy.com/interview-questions/hive/what-is-the-use-of-hive-language/#respond</comments>
		
		<dc:creator><![CDATA[Editor]]></dc:creator>
		<pubDate>Tue, 13 Jul 2021 21:20:39 +0000</pubDate>
				<category><![CDATA[Hive]]></category>
		<category><![CDATA[Accenture interview questions and answers]]></category>
		<category><![CDATA[Altimetrik India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[ANI Technologies Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[apache hive]]></category>
		<category><![CDATA[Capgemini interview questions and answers]]></category>
		<category><![CDATA[CASTING NETWORKS INDIA PVT LIMITED interview questions and answers]]></category>
		<category><![CDATA[CGI Group Inc interview questions and answers]]></category>
		<category><![CDATA[Collabera Technologies interview questions and answers]]></category>
		<category><![CDATA[Dell International Services India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Flipkart interview questions and answers]]></category>
		<category><![CDATA[Genpact interview questions and answers]]></category>
		<category><![CDATA[hadoop pig hive]]></category>
		<category><![CDATA[hadoop query language]]></category>
		<category><![CDATA[hive architecture in hadoop]]></category>
		<category><![CDATA[hive documentation]]></category>
		<category><![CDATA[hive query based interview questions]]></category>
		<category><![CDATA[hive scenario based interview questions]]></category>
		<category><![CDATA[how does hive work]]></category>
		<category><![CDATA[IBM interview questions and answers]]></category>
		<category><![CDATA[Impetus Technologies interview questions and answers]]></category>
		<category><![CDATA[Indiabulls Technology Solutions Ltd interview questions and answers]]></category>
		<category><![CDATA[Mindtree interview questions and answers]]></category>
		<category><![CDATA[NetApp interview questions and answers]]></category>
		<category><![CDATA[Prokarma Softech Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[R Systems interview questions and answers]]></category>
		<category><![CDATA[Reliance Industries Ltd interview questions and answers]]></category>
		<category><![CDATA[Synechron Te interview questions and answers]]></category>
		<category><![CDATA[Tata Consultancy Service interview questions and answers]]></category>
		<category><![CDATA[Tech Mahindra interview questions and answers]]></category>
		<category><![CDATA[Trigent Software interview questions and answers]]></category>
		<category><![CDATA[UnitedHealth Group interview questions and answers]]></category>
		<category><![CDATA[Virtusa Consulting Services Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Wells Fargo interview questions and answers]]></category>
		<category><![CDATA[what do hives look like]]></category>
		<category><![CDATA[what does hive stand for]]></category>
		<category><![CDATA[what is hadoop used for]]></category>
		<category><![CDATA[what is hives means]]></category>
		<category><![CDATA[what to use for hives]]></category>
		<category><![CDATA[Wipro Infotech interview questions and answers]]></category>
		<category><![CDATA[Wipro interview questions and answers]]></category>
		<category><![CDATA[Yash Technologies interview questions and answers]]></category>
		<category><![CDATA[Yodlee Infotech Pvt Ltd interview questions and answers]]></category>
		<guid isPermaLink="false">https://www.wikitechy.com/interview-questions/?p=564</guid>

					<description><![CDATA[Answer : Hive is often used as the interface to an Apache Hadoop based data warehouse]]></description>
										<content:encoded><![CDATA[<div class="TextHeading">
<div class="hddn">
<h2 id="uses-of-hive-language" class="color-green">Uses of Hive language</h2>
</div>
</div>
<div class="Content">
<div class="hddn">
<ul>
<li><b>Hive</b> is often <b>used</b> as the interface to an Apache Hadoop based data warehouse.</li>
<li><b>Hive</b> is considered friendlier and more familiar to users who are <b>used</b> to using SQL for querying data.</li>
<li>It is a platform to develop SQL scripts to perform MapReduce operations.</li>
</ul>
</div>
</div>
<div class="text-center row">
<div class="col-sm-12">
<div id="bsa-zone_1590522538159-8_123456"></div>
</div>
</div>
<div class="ImageContent">
<div class="hddn"><img decoding="async" class="aligncenter size-medium" src="https://cdn.wikitechy.com/interview-questions/hive/why-is-hive-language-used-for.png" alt="why is hive language used for" width="917" height="519" /></div>
</div>
]]></content:encoded>
					
					<wfw:commentRss>https://www.wikitechy.com/interview-questions/hive/what-is-the-use-of-hive-language/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Why do we need Data Locality in Hadoop ?</title>
		<link>https://www.wikitechy.com/interview-questions/big-data/why-do-we-need-data-locality-in-hadoop/</link>
					<comments>https://www.wikitechy.com/interview-questions/big-data/why-do-we-need-data-locality-in-hadoop/#respond</comments>
		
		<dc:creator><![CDATA[Editor]]></dc:creator>
		<pubDate>Mon, 12 Jul 2021 18:21:39 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[3 data locality]]></category>
		<category><![CDATA[Accenture interview questions and answers]]></category>
		<category><![CDATA[apache hadoop]]></category>
		<category><![CDATA[AT&T interview questions and answers]]></category>
		<category><![CDATA[Atos interview questions and answers]]></category>
		<category><![CDATA[azure hadoop]]></category>
		<category><![CDATA[big data hadoop]]></category>
		<category><![CDATA[Capgemini interview questions and answers]]></category>
		<category><![CDATA[CASTING NETWORKS INDIA PVT LIMITED interview questions and answers]]></category>
		<category><![CDATA[CGI Group Inc interview questions and answers]]></category>
		<category><![CDATA[Collabera Technologiesinterview questions and answers]]></category>
		<category><![CDATA[data flow in mapreduces]]></category>
		<category><![CDATA[data locality]]></category>
		<category><![CDATA[data locality c++]]></category>
		<category><![CDATA[data locality definition]]></category>
		<category><![CDATA[data locality in cloud computing]]></category>
		<category><![CDATA[Data locality in Hadoop]]></category>
		<category><![CDATA[data locality in spark]]></category>
		<category><![CDATA[data locality in yarn]]></category>
		<category><![CDATA[data locality nutanix]]></category>
		<category><![CDATA[data locality optimization in hadoop]]></category>
		<category><![CDATA[data localization in hadoop]]></category>
		<category><![CDATA[Dell International Services India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[distributed file system]]></category>
		<category><![CDATA[Ernst & Young interview questions and answers]]></category>
		<category><![CDATA[Flipkart interview questions and answers]]></category>
		<category><![CDATA[Genpact interview questions and answers]]></category>
		<category><![CDATA[hadoop cluster]]></category>
		<category><![CDATA[hadoop data partitioning]]></category>
		<category><![CDATA[hadoop database]]></category>
		<category><![CDATA[hadoop distributed file system]]></category>
		<category><![CDATA[hadoop ecosystem]]></category>
		<category><![CDATA[hadoop file system]]></category>
		<category><![CDATA[hadoop framework]]></category>
		<category><![CDATA[hadoop mapreduce]]></category>
		<category><![CDATA[hadoop optimization techniques]]></category>
		<category><![CDATA[hdfs architecture]]></category>
		<category><![CDATA[IBM interview questions and answers]]></category>
		<category><![CDATA[Importance of Data Locality]]></category>
		<category><![CDATA[Improving Data Processing Performance with Hadoop Data Locality]]></category>
		<category><![CDATA[in the local disk of the name node the files which are stored persistently are]]></category>
		<category><![CDATA[Indecomm Global Services interview questions and answers]]></category>
		<category><![CDATA[Introduction to Data Locality in Hadoop MapReduce]]></category>
		<category><![CDATA[Job scheduling for optimizing data locality in Hadoop clusters]]></category>
		<category><![CDATA[L&T Infotech interview questions and answers]]></category>
		<category><![CDATA[locality optimization in compiler design]]></category>
		<category><![CDATA[mapreduce data locality]]></category>
		<category><![CDATA[Mindtree interview questions and answers]]></category>
		<category><![CDATA[NetApp interview questions and answers]]></category>
		<category><![CDATA[R Systems interview questions and answers]]></category>
		<category><![CDATA[rack awareness in hadoop]]></category>
		<category><![CDATA[RBS India Development Centre Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[SAP Labs India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Tata Consultancy Service interview questions and answers]]></category>
		<category><![CDATA[Tech Mahindra interview questions and answers]]></category>
		<category><![CDATA[Trigent Software interview questions and answers]]></category>
		<category><![CDATA[UnitedHealth Group interview questions and answers]]></category>
		<category><![CDATA[Virtusa Consulting Services Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Wells Fargo interview questions and answers]]></category>
		<category><![CDATA[what is big data and hadoop]]></category>
		<category><![CDATA[what is big data hadoop]]></category>
		<category><![CDATA[What is Data Locality]]></category>
		<category><![CDATA[what is data locality in hadoop]]></category>
		<category><![CDATA[What is Data Locality in HadoopWhat does the term 'data locality' mean in Hadoop]]></category>
		<category><![CDATA[What is Data locality optimization in hadoop]]></category>
		<category><![CDATA[what is data localization in hadoop]]></category>
		<category><![CDATA[what is hadoop]]></category>
		<category><![CDATA[what is hadoop used for]]></category>
		<category><![CDATA[Wipro Infotech interview questions and answers]]></category>
		<category><![CDATA[Wipro interview questions and answers]]></category>
		<category><![CDATA[Xoriant Solutions Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[yarn hadoop]]></category>
		<category><![CDATA[ZS Associates interview questions and answers]]></category>
		<guid isPermaLink="false">https://www.wikitechy.com/interview-questions/?p=287</guid>

					<description><![CDATA[Answer : Datasets in HDFS store as blocks in DataNodes...]]></description>
										<content:encoded><![CDATA[<div class="TextHeading">
<div class="hddn">
<h2 id="why-do-we-need-data-locality-in-hadoop" class="color-pink" style="text-align: justify;">Why do we need Data Locality in Hadoop ?</h2>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/why-we-need-data-locality-in-hadoop.png" alt=" Data Locality in Hadoop " /></div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Datasets in <a href="https://www.wikitechy.com/tutorials/sqoop/sqoop-vs-hdfs" target="_blank" rel="noopener">HDFS</a> store as blocks in DataNodes the Hadoop cluster.</li>
<li>During the execution of a <a href="https://www.wikitechy.com/tutorials/hive/hive-mapreduce-hadoop-mapreduce" target="_blank" rel="noopener">MapReduce</a> job the individual Mapper processes the blocks (Input Splits).</li>
<li>If the data does not reside in the same node where the Mapper is executing the job, the data needs to be copied from the DataNode over the <a href="https://www.wikitechy.com/errors-and-fixes/sql/cluster-network-name-showing-netbios-status-as-the-system-cannot-find-the-file-specified" target="_blank" rel="noopener">network</a> to the mapper DataNode.</li>
</ul>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/data-locality-in-hadoop.gif" alt="Datasets in HDFS - Data Locality in Hadoop" /></div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Now if a MapReduce job has more than 100 Mapper and each Mapper tries to copy the data from other DataNode in the cluster simultaneously, it would cause serious network congestion which is a big performance issue of the overall system.</li>
<li>Hence, data proximity to the computation is an effective and cost-effective solution which is technically termed as <a href="https://www.wikitechy.com/interview-questions/hadoop/what-are-the-features-of-hadoop/" target="_blank" rel="noopener">Data locality in Hadoop</a>. It helps to increase the overall throughput of the system.</li>
</ul>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/mapreduce-job-data-locality.gif" alt=" " /></div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="types-of-data-locality" class="color-green">Types of data locality</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li><b>Data local</b>
<ul>
<li>In this type data and the mapper resides on the same node. This is the closest proximity of data and the most preferred scenario.</li>
</ul>
</li>
</ul>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li><b>Rack Local</b>
<ul>
<li>In this type data and the mapper resides on the same node. This is the closest proximity of data and the most preferred scenario.</li>
<li>In this scenarios mapper and data reside on the same rack but on the different data nodes.</li>
</ul>
</li>
</ul>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li><b>Different Rack</b>
<ul>
<li>In this scenario mapper and data reside on the different racks.</li>
</ul>
</li>
</ul>
</div>
</div>
<div class="ImageContent">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/types-of-data-locality.jpg" alt="Types of data locality" /></div>
</div>
]]></content:encoded>
					
					<wfw:commentRss>https://www.wikitechy.com/interview-questions/big-data/why-do-we-need-data-locality-in-hadoop/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Why Hadoop used for Big Data Analytics ?</title>
		<link>https://www.wikitechy.com/interview-questions/big-data/why-hadoop-used-for-big-data-analytics/</link>
					<comments>https://www.wikitechy.com/interview-questions/big-data/why-hadoop-used-for-big-data-analytics/#respond</comments>
		
		<dc:creator><![CDATA[Editor]]></dc:creator>
		<pubDate>Mon, 12 Jul 2021 17:16:29 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Accenture interview questions and answers]]></category>
		<category><![CDATA[AT&T interview questions and answers]]></category>
		<category><![CDATA[Atos interview questions and answers]]></category>
		<category><![CDATA[big data analytics]]></category>
		<category><![CDATA[big data hadoop]]></category>
		<category><![CDATA[big data hadoop certification]]></category>
		<category><![CDATA[big data hadoop tutorial]]></category>
		<category><![CDATA[big data notes]]></category>
		<category><![CDATA[big data toolshow big data and hadoop are linked]]></category>
		<category><![CDATA[big data tutorial]]></category>
		<category><![CDATA[Capgemini interview questions and answers]]></category>
		<category><![CDATA[CASTING NETWORKS INDIA PVT LIMITED interview questions and answers]]></category>
		<category><![CDATA[CGI Group Inc interview questions and answers]]></category>
		<category><![CDATA[Collabera Technologiesinterview questions and answers]]></category>
		<category><![CDATA[Dell International Services India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[difference between big data and data science]]></category>
		<category><![CDATA[difference between big data and hadoop]]></category>
		<category><![CDATA[difference between hadoop and spark]]></category>
		<category><![CDATA[Ernst & Young interview questions and answers]]></category>
		<category><![CDATA[Flipkart interview questions and answers]]></category>
		<category><![CDATA[Genpact interview questions and answers]]></category>
		<category><![CDATA[hadoop architecture]]></category>
		<category><![CDATA[hadoop as big data solution]]></category>
		<category><![CDATA[hadoop database]]></category>
		<category><![CDATA[hadoop example]]></category>
		<category><![CDATA[hadoop modules]]></category>
		<category><![CDATA[hadoop storage]]></category>
		<category><![CDATA[IBM interview questions and answers]]></category>
		<category><![CDATA[Indecomm Global Services interview questions and answers]]></category>
		<category><![CDATA[L&T Infotech interview questions and answers]]></category>
		<category><![CDATA[Mindtree interview questions and answers]]></category>
		<category><![CDATA[NetApp interview questions and answers]]></category>
		<category><![CDATA[R Systems interview questions and answers]]></category>
		<category><![CDATA[RBS India Development Centre Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[SAP Labs India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Tata Consultancy Service interview questions and answers]]></category>
		<category><![CDATA[Tech Mahindra interview questions and answers]]></category>
		<category><![CDATA[Trigent Software interview questions and answers]]></category>
		<category><![CDATA[UnitedHealth Group interview questions and answers]]></category>
		<category><![CDATA[Virtusa Consulting Services Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Wells Fargo interview questions and answers]]></category>
		<category><![CDATA[what is big data]]></category>
		<category><![CDATA[what is hadoop used for]]></category>
		<category><![CDATA[Wipro Infotech interview questions and answers]]></category>
		<category><![CDATA[Wipro interview questions and answers]]></category>
		<category><![CDATA[Xoriant Solutions Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[ZS Associates interview questions and answers]]></category>
		<guid isPermaLink="false">https://www.wikitechy.com/interview-questions/?p=269</guid>

					<description><![CDATA[Answer : Big data analytics is the process of examining large data...]]></description>
										<content:encoded><![CDATA[<div class="TextHeading">
<div class="hddn">
<h2 id="why-hadoop-used-for-big-data-analytics" class="color-pink" style="text-align: justify;">Why Hadoop used for Big Data Analytics ?</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li><a href="https://www.wikitechy.com/interview-questions/hadoop/what-is-big-data/" target="_blank" rel="noopener">Big data</a> analytics is the process of examining large data sets to uncover hidden patterns, unknown correlations, market trends, customer preferences and other useful business information.</li>
<li>Hadoop is a framework to store and process big data. Hadoop specifically designed to provide distributed storage and parallel data processing that big data requires.</li>
</ul>
</div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="hadoop-is-the-best-solution-for-storing-and-processing-big-data-because" class="color-blue">Hadoop is the best solution for storing and processing big data because:</h2>
</div>
</div>
<p style="text-align: justify;">Hadoop stores huge files as they are (raw) without specifying any schema.</p>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li><b>High scalability</b> &#8211; We can add any number of nodes, hence enhancing performance dramatically.</li>
<li><b>High availability</b> &#8211; In <a href="https://www.wikitechy.com/interview-questions/apache-pig/what-is-the-advantages-of-pig-in-hadoop/" target="_blank" rel="noopener">hadoop</a> data is highly available despite hardware failure. If a machine or few hardware crashes, then we can access data from another path.</li>
<li><b>Reliable</b> &#8211; Data is reliably stored on the cluster despite of machine failure.</li>
<li><b>Economic</b> &#8211; Hadoop runs on a cluster of commodity hardware which is not very expensive.</li>
</ul>
</div>
</div>
<div class="text-center row" style="text-align: justify;"></div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="what-is-hadoop" class="color-purple">What is Hadoop ?</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li><a href="https://www.wikitechy.com/interview-questions/apache-pig/what-is-the-difference-between-pig-hive-and-mapreduce" target="_blank" rel="noopener">Hadoop</a> is an open source project from Apache Software Foundation.</li>
<li>It provides a software framework for distributing and running applications on clusters of servers that is inspired by Google’s Map-Reduce programming model as well as its file system(GFS).</li>
<li>Hadoop was originally written for the nutch search engine project.</li>
<li>Hadoop is open source framework written in Java. It efficiently processes large volumes of data on a cluster of commodity hardware.</li>
<li>Hadoop can be setup on single machine , but the real power of Hadoop comes with a cluster of machines , it can be scaled from a single machine to thousands of nodes. Hadoop consists of two key parts,
<ul>
<li>Hadoop Distributes File System(HDFS)</li>
<li>Map-Reduce.</li>
</ul>
</li>
</ul>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/hadoop-overview.png" alt="Hadoop Overview" /></div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="hadoop-distributed-file-systemhdfs" class="color-blue">Hadoop Distributed File System(HDFS)</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>HDFS is a highly fault tolerant, distributed, reliable, scalable file system for data storage.</li>
<li>HDFS stores multiple copies of data on different nodes; a file is split up into blocks (Default 64 MB) and stored across multiple machines.</li>
<li>Hadoop cluster typically has a single namenode and number of datanodes to form the HDFS cluster.</li>
</ul>
</div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="map-reduce" class="color-blue">Map-Reduce</h2>
</div>
</div>
<div class="Content">
<div class="hddn">
<ul>
<li style="text-align: justify;">Map-Reduce is a programming model designed for processing large volumes of data in parallel by dividing the work into a set of independent tasks.</li>
<li style="text-align: justify;">It is also a paradigm for distributed processing of large data set over a cluster of nodes.</li>
</ul>
</div>
</div>
]]></content:encoded>
					
					<wfw:commentRss>https://www.wikitechy.com/interview-questions/big-data/why-hadoop-used-for-big-data-analytics/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
	</channel>
</rss>
