<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>hadoop framework - Wikitechy</title>
	<atom:link href="https://www.wikitechy.com/interview-questions/tag/hadoop-framework/feed/" rel="self" type="application/rss+xml" />
	<link>https://www.wikitechy.com/interview-questions/tag/hadoop-framework/</link>
	<description>Interview Questions</description>
	<lastBuildDate>Wed, 22 Sep 2021 05:53:05 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9</generator>

<image>
	<url>https://www.wikitechy.com/interview-questions/wp-content/uploads/2025/10/cropped-wikitechy-icon-32x32.png</url>
	<title>hadoop framework - Wikitechy</title>
	<link>https://www.wikitechy.com/interview-questions/tag/hadoop-framework/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Difference between nfs and hdfs ?</title>
		<link>https://www.wikitechy.com/interview-questions/big-data/difference-between-nfs-and-hdfs/</link>
					<comments>https://www.wikitechy.com/interview-questions/big-data/difference-between-nfs-and-hdfs/#respond</comments>
		
		<dc:creator><![CDATA[Editor]]></dc:creator>
		<pubDate>Mon, 12 Jul 2021 18:26:28 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Accenture interview questions and answers]]></category>
		<category><![CDATA[advantages of nfs and afs]]></category>
		<category><![CDATA[afs vs nfs in distributed system]]></category>
		<category><![CDATA[apache hdfs]]></category>
		<category><![CDATA[apache hdfs nfs]]></category>
		<category><![CDATA[AT&T interview questions and answers]]></category>
		<category><![CDATA[Atos interview questions and answers]]></category>
		<category><![CDATA[Capgemini interview questions and answers]]></category>
		<category><![CDATA[CASTING NETWORKS INDIA PVT LIMITED interview questions and answers]]></category>
		<category><![CDATA[CGI Group Inc interview questions and answers]]></category>
		<category><![CDATA[Collabera Technologiesinterview questions and answers]]></category>
		<category><![CDATA[compare between nfs and afs]]></category>
		<category><![CDATA[Dell International Services India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[dfs vs nfs]]></category>
		<category><![CDATA[difference between dfs and hdfs]]></category>
		<category><![CDATA[difference between distributed file system and hadoop distributed file system]]></category>
		<category><![CDATA[difference between hadoop and hdfs]]></category>
		<category><![CDATA[difference between hdfs dfs -ls and hdfs dfs -ls]]></category>
		<category><![CDATA[difference between nas and hdfs]]></category>
		<category><![CDATA[difference between nfs and coda file system]]></category>
		<category><![CDATA[difference between nfs and dfs]]></category>
		<category><![CDATA[difference between nfs and hdfs]]></category>
		<category><![CDATA[difference between normal file system and hdfs]]></category>
		<category><![CDATA[Ernst & Young interview questions and answers]]></category>
		<category><![CDATA[ewhat is nfs]]></category>
		<category><![CDATA[Flipkart interview questions and answers]]></category>
		<category><![CDATA[Genpact interview questions and answers]]></category>
		<category><![CDATA[hadoop framework]]></category>
		<category><![CDATA[hadoop fs example]]></category>
		<category><![CDATA[hadoop nfs performance]]></category>
		<category><![CDATA[hdfs architecture]]></category>
		<category><![CDATA[hdfs nfs example]]></category>
		<category><![CDATA[hdfs vs nfs performance]]></category>
		<category><![CDATA[hdfs vs ntfs]]></category>
		<category><![CDATA[how does hdfs work]]></category>
		<category><![CDATA[IBM interview questions and answers]]></category>
		<category><![CDATA[Indecomm Global Services interview questions and answers]]></category>
		<category><![CDATA[L&T Infotech interview questions and answers]]></category>
		<category><![CDATA[Mindtree interview questions and answers]]></category>
		<category><![CDATA[NetApp interview questions and answers]]></category>
		<category><![CDATA[nfs and afs difference]]></category>
		<category><![CDATA[nfs export point]]></category>
		<category><![CDATA[nfs file system]]></category>
		<category><![CDATA[nfs vs afs differenc]]></category>
		<category><![CDATA[nfs vs afs difference]]></category>
		<category><![CDATA[nfs vs hdfs]]></category>
		<category><![CDATA[R Systems interview questions and answers]]></category>
		<category><![CDATA[RBS India Development Centre Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[SAP Labs India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Tata Consultancy Service interview questions and answers]]></category>
		<category><![CDATA[Tech Mahindra interview questions and answers]]></category>
		<category><![CDATA[the hadoop distributed file system]]></category>
		<category><![CDATA[Trigent Software interview questions and answers]]></category>
		<category><![CDATA[understanding hdfs nfs]]></category>
		<category><![CDATA[UnitedHealth Group interview questions and answers]]></category>
		<category><![CDATA[Virtusa Consulting Services Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Wells Fargo interview questions and answers]]></category>
		<category><![CDATA[what is hdfs replication factor]]></category>
		<category><![CDATA[what is nfs]]></category>
		<category><![CDATA[where is hdfs used]]></category>
		<category><![CDATA[Wipro Infotech interview questions and answers]]></category>
		<category><![CDATA[Wipro interview questions and answers]]></category>
		<category><![CDATA[Xoriant Solutions Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[ZS Associates interview questions and answers]]></category>
		<guid isPermaLink="false">https://www.wikitechy.com/interview-questions/?p=289</guid>

					<description><![CDATA[Answer : NFS (Network File System) is one of the oldest and popular...]]></description>
										<content:encoded><![CDATA[<div class="TextHeading">
<div class="hddn">
<h2 id="difference-between-nfs-and-hdfs" class="color-pink" style="text-align: justify;">Difference between nfs and hdfs ?</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>NFS (Network File System) is one of the oldest and popular distributed file storage systems. Whereas HDFS (Hadoop Distributed File System) is the recently used and popular one to handle big data.</li>
</ul>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/difference-between-nfs-and-hdfs.png" alt="Difference between nfs and hdfs" /></div>
</div>
<div class="hddn">
<table class="table-bordered table-striped table table-responsive">
<tbody>
<tr style="text-align: justify;">
<th>CRITERIA</th>
<th>NFS</th>
<th>HDFS</th>
</tr>
<tr style="text-align: justify;">
<td class="text-leftalign">Data Size Support</td>
<td class="text-leftalign"><a href="https://www.wikitechy.com/interview-questions/mapr/what-is-maprs-direct-access-nfs/" target="_blank" rel="noopener">NFS</a> can store and process small<br />
amount of data.</td>
<td class="text-leftalign">HDFS is mainly use to store and process big data.</td>
</tr>
<tr style="text-align: justify;">
<td class="text-leftalign">Data Storage</td>
<td class="text-leftalign">Data is stored on a single dedicated<br />
hardware.</td>
<td class="text-leftalign">The data blocks are distributed on the local drives of hardware.</td>
</tr>
<tr style="text-align: justify;">
<td class="text-leftalign">Reliability</td>
<td class="text-leftalign">No reliability, data is not available<br />
in the case of machine failure.</td>
<td class="text-leftalign">Data is stored reliably, data is available even after machine failure.</td>
</tr>
<tr style="text-align: justify;">
<td class="text-leftalign">Data Redundancy</td>
<td class="text-leftalign">NFS runs on single machine,<br />
no chances of data redundancy.</td>
<td class="text-leftalign">HDFS runs on a cluster of different machines, data redundancy may occur due to replication protocol.</td>
</tr>
<tr style="text-align: justify;">
<td class="text-leftalign">Target Users</td>
<td class="text-leftalign">Workgroup.</td>
<td class="text-leftalign">Larger than AFS.</td>
</tr>
<tr style="text-align: justify;">
<td class="text-leftalign">Domain</td>
<td class="text-leftalign">Single Domain.</td>
<td class="text-leftalign">Multi Domain.</td>
</tr>
<tr style="text-align: justify;">
<td class="text-leftalign">Client Server Trust</td>
<td class="text-leftalign">Client identity is trusted by default.</td>
<td class="text-leftalign">Client identity is what os tells. No Kerberos Auth.</td>
</tr>
<tr>
<td class="text-leftalign" style="text-align: justify;">Compatability with O/S</td>
<td class="text-leftalign" style="text-align: justify;">Same System calls as of O/S.</td>
<td class="text-leftalign" style="text-align: justify;">Different Calls.Mainly used for non interactive programs.</td>
</tr>
</tbody>
</table>
</div>
]]></content:encoded>
					
					<wfw:commentRss>https://www.wikitechy.com/interview-questions/big-data/difference-between-nfs-and-hdfs/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Why do we need Data Locality in Hadoop ?</title>
		<link>https://www.wikitechy.com/interview-questions/big-data/why-do-we-need-data-locality-in-hadoop/</link>
					<comments>https://www.wikitechy.com/interview-questions/big-data/why-do-we-need-data-locality-in-hadoop/#respond</comments>
		
		<dc:creator><![CDATA[Editor]]></dc:creator>
		<pubDate>Mon, 12 Jul 2021 18:21:39 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[3 data locality]]></category>
		<category><![CDATA[Accenture interview questions and answers]]></category>
		<category><![CDATA[apache hadoop]]></category>
		<category><![CDATA[AT&T interview questions and answers]]></category>
		<category><![CDATA[Atos interview questions and answers]]></category>
		<category><![CDATA[azure hadoop]]></category>
		<category><![CDATA[big data hadoop]]></category>
		<category><![CDATA[Capgemini interview questions and answers]]></category>
		<category><![CDATA[CASTING NETWORKS INDIA PVT LIMITED interview questions and answers]]></category>
		<category><![CDATA[CGI Group Inc interview questions and answers]]></category>
		<category><![CDATA[Collabera Technologiesinterview questions and answers]]></category>
		<category><![CDATA[data flow in mapreduces]]></category>
		<category><![CDATA[data locality]]></category>
		<category><![CDATA[data locality c++]]></category>
		<category><![CDATA[data locality definition]]></category>
		<category><![CDATA[data locality in cloud computing]]></category>
		<category><![CDATA[Data locality in Hadoop]]></category>
		<category><![CDATA[data locality in spark]]></category>
		<category><![CDATA[data locality in yarn]]></category>
		<category><![CDATA[data locality nutanix]]></category>
		<category><![CDATA[data locality optimization in hadoop]]></category>
		<category><![CDATA[data localization in hadoop]]></category>
		<category><![CDATA[Dell International Services India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[distributed file system]]></category>
		<category><![CDATA[Ernst & Young interview questions and answers]]></category>
		<category><![CDATA[Flipkart interview questions and answers]]></category>
		<category><![CDATA[Genpact interview questions and answers]]></category>
		<category><![CDATA[hadoop cluster]]></category>
		<category><![CDATA[hadoop data partitioning]]></category>
		<category><![CDATA[hadoop database]]></category>
		<category><![CDATA[hadoop distributed file system]]></category>
		<category><![CDATA[hadoop ecosystem]]></category>
		<category><![CDATA[hadoop file system]]></category>
		<category><![CDATA[hadoop framework]]></category>
		<category><![CDATA[hadoop mapreduce]]></category>
		<category><![CDATA[hadoop optimization techniques]]></category>
		<category><![CDATA[hdfs architecture]]></category>
		<category><![CDATA[IBM interview questions and answers]]></category>
		<category><![CDATA[Importance of Data Locality]]></category>
		<category><![CDATA[Improving Data Processing Performance with Hadoop Data Locality]]></category>
		<category><![CDATA[in the local disk of the name node the files which are stored persistently are]]></category>
		<category><![CDATA[Indecomm Global Services interview questions and answers]]></category>
		<category><![CDATA[Introduction to Data Locality in Hadoop MapReduce]]></category>
		<category><![CDATA[Job scheduling for optimizing data locality in Hadoop clusters]]></category>
		<category><![CDATA[L&T Infotech interview questions and answers]]></category>
		<category><![CDATA[locality optimization in compiler design]]></category>
		<category><![CDATA[mapreduce data locality]]></category>
		<category><![CDATA[Mindtree interview questions and answers]]></category>
		<category><![CDATA[NetApp interview questions and answers]]></category>
		<category><![CDATA[R Systems interview questions and answers]]></category>
		<category><![CDATA[rack awareness in hadoop]]></category>
		<category><![CDATA[RBS India Development Centre Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[SAP Labs India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Tata Consultancy Service interview questions and answers]]></category>
		<category><![CDATA[Tech Mahindra interview questions and answers]]></category>
		<category><![CDATA[Trigent Software interview questions and answers]]></category>
		<category><![CDATA[UnitedHealth Group interview questions and answers]]></category>
		<category><![CDATA[Virtusa Consulting Services Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Wells Fargo interview questions and answers]]></category>
		<category><![CDATA[what is big data and hadoop]]></category>
		<category><![CDATA[what is big data hadoop]]></category>
		<category><![CDATA[What is Data Locality]]></category>
		<category><![CDATA[what is data locality in hadoop]]></category>
		<category><![CDATA[What is Data Locality in HadoopWhat does the term 'data locality' mean in Hadoop]]></category>
		<category><![CDATA[What is Data locality optimization in hadoop]]></category>
		<category><![CDATA[what is data localization in hadoop]]></category>
		<category><![CDATA[what is hadoop]]></category>
		<category><![CDATA[what is hadoop used for]]></category>
		<category><![CDATA[Wipro Infotech interview questions and answers]]></category>
		<category><![CDATA[Wipro interview questions and answers]]></category>
		<category><![CDATA[Xoriant Solutions Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[yarn hadoop]]></category>
		<category><![CDATA[ZS Associates interview questions and answers]]></category>
		<guid isPermaLink="false">https://www.wikitechy.com/interview-questions/?p=287</guid>

					<description><![CDATA[Answer : Datasets in HDFS store as blocks in DataNodes...]]></description>
										<content:encoded><![CDATA[<div class="TextHeading">
<div class="hddn">
<h2 id="why-do-we-need-data-locality-in-hadoop" class="color-pink" style="text-align: justify;">Why do we need Data Locality in Hadoop ?</h2>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/why-we-need-data-locality-in-hadoop.png" alt=" Data Locality in Hadoop " /></div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Datasets in <a href="https://www.wikitechy.com/tutorials/sqoop/sqoop-vs-hdfs" target="_blank" rel="noopener">HDFS</a> store as blocks in DataNodes the Hadoop cluster.</li>
<li>During the execution of a <a href="https://www.wikitechy.com/tutorials/hive/hive-mapreduce-hadoop-mapreduce" target="_blank" rel="noopener">MapReduce</a> job the individual Mapper processes the blocks (Input Splits).</li>
<li>If the data does not reside in the same node where the Mapper is executing the job, the data needs to be copied from the DataNode over the <a href="https://www.wikitechy.com/errors-and-fixes/sql/cluster-network-name-showing-netbios-status-as-the-system-cannot-find-the-file-specified" target="_blank" rel="noopener">network</a> to the mapper DataNode.</li>
</ul>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/data-locality-in-hadoop.gif" alt="Datasets in HDFS - Data Locality in Hadoop" /></div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Now if a MapReduce job has more than 100 Mapper and each Mapper tries to copy the data from other DataNode in the cluster simultaneously, it would cause serious network congestion which is a big performance issue of the overall system.</li>
<li>Hence, data proximity to the computation is an effective and cost-effective solution which is technically termed as <a href="https://www.wikitechy.com/interview-questions/hadoop/what-are-the-features-of-hadoop/" target="_blank" rel="noopener">Data locality in Hadoop</a>. It helps to increase the overall throughput of the system.</li>
</ul>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/mapreduce-job-data-locality.gif" alt=" " /></div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="types-of-data-locality" class="color-green">Types of data locality</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li><b>Data local</b>
<ul>
<li>In this type data and the mapper resides on the same node. This is the closest proximity of data and the most preferred scenario.</li>
</ul>
</li>
</ul>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li><b>Rack Local</b>
<ul>
<li>In this type data and the mapper resides on the same node. This is the closest proximity of data and the most preferred scenario.</li>
<li>In this scenarios mapper and data reside on the same rack but on the different data nodes.</li>
</ul>
</li>
</ul>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li><b>Different Rack</b>
<ul>
<li>In this scenario mapper and data reside on the different racks.</li>
</ul>
</li>
</ul>
</div>
</div>
<div class="ImageContent">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/types-of-data-locality.jpg" alt="Types of data locality" /></div>
</div>
]]></content:encoded>
					
					<wfw:commentRss>https://www.wikitechy.com/interview-questions/big-data/why-do-we-need-data-locality-in-hadoop/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>What are the core components of Hadoop ?</title>
		<link>https://www.wikitechy.com/interview-questions/big-data/what-are-the-core-components-of-hadoop/</link>
					<comments>https://www.wikitechy.com/interview-questions/big-data/what-are-the-core-components-of-hadoop/#respond</comments>
		
		<dc:creator><![CDATA[Editor]]></dc:creator>
		<pubDate>Mon, 12 Jul 2021 18:01:53 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Accenture interview questions and answers]]></category>
		<category><![CDATA[apache hadoop components]]></category>
		<category><![CDATA[apache hadoop core components were inspired bycomponents of hadoop ecosystemapache hadoop's core components inspired by]]></category>
		<category><![CDATA[apart from its basic components apache hadoop also provides]]></category>
		<category><![CDATA[AT&T interview questions and answers]]></category>
		<category><![CDATA[Atos interview questions and answers]]></category>
		<category><![CDATA[basic components of big data]]></category>
		<category><![CDATA[big data architecture components]]></category>
		<category><![CDATA[big data components]]></category>
		<category><![CDATA[big data ecosystem components]]></category>
		<category><![CDATA[Capgemini interview questions and answers]]></category>
		<category><![CDATA[CASTING NETWORKS INDIA PVT LIMITED interview questions and answers]]></category>
		<category><![CDATA[CGI Group Inc interview questions and answers]]></category>
		<category><![CDATA[Collabera Technologiesinterview questions and answers]]></category>
		<category><![CDATA[components of apache hadoop]]></category>
		<category><![CDATA[components of big data analytics]]></category>
		<category><![CDATA[components of hadoop architecture]]></category>
		<category><![CDATA[components of hadoop cluster based on hdfs]]></category>
		<category><![CDATA[components of hadoop framework]]></category>
		<category><![CDATA[components of hadoop in big data]]></category>
		<category><![CDATA[core apache components in hadoop]]></category>
		<category><![CDATA[core components of hadoop]]></category>
		<category><![CDATA[core components of hadoop ques]]></category>
		<category><![CDATA[core hadoop components]]></category>
		<category><![CDATA[Dell International Services India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Ernst & Young interview questions and answers]]></category>
		<category><![CDATA[explain hadoop architecture and its components with proper diagram]]></category>
		<category><![CDATA[Flipkart interview questions and answers]]></category>
		<category><![CDATA[Genpact interview questions and answers]]></category>
		<category><![CDATA[hadoop and its components]]></category>
		<category><![CDATA[hadoop architecture components]]></category>
		<category><![CDATA[hadoop components]]></category>
		<category><![CDATA[hadoop components diagram]]></category>
		<category><![CDATA[hadoop components explained]]></category>
		<category><![CDATA[hadoop ecosystem components]]></category>
		<category><![CDATA[hadoop framework]]></category>
		<category><![CDATA[hadoop framework components]]></category>
		<category><![CDATA[hadoop includes open source components and closed source ones]]></category>
		<category><![CDATA[hadoop layers]]></category>
		<category><![CDATA[hadoop lower levels]]></category>
		<category><![CDATA[hadoop offers built in redundancy feature for both hdfs and mapreduce]]></category>
		<category><![CDATA[hadoop storage]]></category>
		<category><![CDATA[hdfs components]]></category>
		<category><![CDATA[hortonworks hadoop components]]></category>
		<category><![CDATA[IBM interview questions and answers]]></category>
		<category><![CDATA[Indecomm Global Services interview questions and answers]]></category>
		<category><![CDATA[L&T Infotech interview questions and answers]]></category>
		<category><![CDATA[list of hadoop components]]></category>
		<category><![CDATA[main components of hadoop]]></category>
		<category><![CDATA[mapreduce components]]></category>
		<category><![CDATA[Mindtree interview questions and answers]]></category>
		<category><![CDATA[NetApp interview questions and answers]]></category>
		<category><![CDATA[not a big data component]]></category>
		<category><![CDATA[R Systems interview questions and answers]]></category>
		<category><![CDATA[RBS India Development Centre Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[SAP Labs India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Tata Consultancy Service interview questions and answers]]></category>
		<category><![CDATA[Tech Mahindra interview questions and answers]]></category>
		<category><![CDATA[Trigent Software interview questions and answers]]></category>
		<category><![CDATA[two major components of the mapreduce layer]]></category>
		<category><![CDATA[UnitedHealth Group interview questions and answers]]></category>
		<category><![CDATA[Virtusa Consulting Services Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Wells Fargo interview questions and answers]]></category>
		<category><![CDATA[what are the components of hadoop]]></category>
		<category><![CDATA[what are the core components of hadoop]]></category>
		<category><![CDATA[what is hadoop]]></category>
		<category><![CDATA[which component of hadoop ecosystem supports updation]]></category>
		<category><![CDATA[Wipro Infotech interview questions and answers]]></category>
		<category><![CDATA[Wipro interview questions and answers]]></category>
		<category><![CDATA[Xoriant Solutions Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[ZS Associates interview questions and answers]]></category>
		<guid isPermaLink="false">https://www.wikitechy.com/interview-questions/?p=281</guid>

					<description><![CDATA[Answer : Hadoop is an open source framework that is meant...]]></description>
										<content:encoded><![CDATA[<div class="TextHeading">
<div class="hddn">
<h2 id="what-are-the-core-components-of-hadoop" class="color-pink" style="text-align: justify;">What are the core components of Hadoop ?</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Hadoop is an open source framework that is meant for storage and processing of big data in a distributed manner.</li>
</ul>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/what-are-the-core-components-of-hadoop.png" alt="What are the core components of Hadoop" /></div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="hdfs-hadoop-distributed-file-system" class="color-purple">HDFS (Hadoop Distributed File System)</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li><a href="https://www.wikitechy.com/interview-questions/hadoop/what-are-the-components-of-hdfs-and-yarn" target="_blank" rel="noopener">HDFS</a> is the basic storage system of Hadoop.</li>
<li>The large data files running on a cluster of commodity hardware are stored in HDFS.</li>
<li>It can store data in a reliable manner even when hardware fails.</li>
</ul>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/hdfs.png" alt="HDFS (Hadoop Distributed File System)" /></div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="core-components-of-hadoop" class="color-orange">Core Components of Hadoop</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Hadoop MapReduce</li>
<li>YARN</li>
</ul>
</div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="hadoop-mapreduce" class="color-green">Hadoop MapReduce</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li><a href="https://www.wikitechy.com/tutorials/hive/hive-mapreduce-hadoop-mapreduce" target="_blank" rel="noopener">MapReduce</a> is the Hadoop layer that is responsible for data processing. It writes an application to process unstructured and structured data stored in HDFS.</li>
<li>It is responsible for the parallel processing of high volume of data by dividing data into independent tasks.</li>
<li>The processing is done in two phases Map and Reduce.</li>
<li>The Map is the first phase of processing that specifies complex logic code and the Reduce is the second phase of processing that specifies light-weight operations.</li>
</ul>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/hadoop-mapreduce.gif" alt=" Hadoop MapReduce" /></div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="yarn" class="color-green">YARN</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>The processing framework in Hadoop is <a href="https://forums.wikitechy.com/question/how-to-prevent-spark-executors-from-getting-lost-when-using-yarn-client-mode/" target="_blank" rel="noopener">YARN</a>.</li>
<li>It is used for resource management and provides multiple data processing engines i.e. data science, real-time streaming, and batch processing.</li>
</ul>
</div>
</div>
<div class="ImageContent">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/yarn.jpg" alt=" YARN" /></div>
</div>
]]></content:encoded>
					
					<wfw:commentRss>https://www.wikitechy.com/interview-questions/big-data/what-are-the-core-components-of-hadoop/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
	</channel>
</rss>
