<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>hadoop architecture - Wikitechy</title>
	<atom:link href="https://www.wikitechy.com/interview-questions/tag/hadoop-architecture/feed/" rel="self" type="application/rss+xml" />
	<link>https://www.wikitechy.com/interview-questions/tag/hadoop-architecture/</link>
	<description>Interview Questions</description>
	<lastBuildDate>Tue, 14 Sep 2021 09:47:12 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9.4</generator>

<image>
	<url>https://www.wikitechy.com/interview-questions/wp-content/uploads/2025/10/cropped-wikitechy-icon-32x32.png</url>
	<title>hadoop architecture - Wikitechy</title>
	<link>https://www.wikitechy.com/interview-questions/tag/hadoop-architecture/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>What are the features of Hadoop ?</title>
		<link>https://www.wikitechy.com/interview-questions/big-data/what-are-the-features-of-hadoop/</link>
					<comments>https://www.wikitechy.com/interview-questions/big-data/what-are-the-features-of-hadoop/#respond</comments>
		
		<dc:creator><![CDATA[Editor]]></dc:creator>
		<pubDate>Mon, 12 Jul 2021 17:55:46 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[9 Features Of Hadoop]]></category>
		<category><![CDATA[Accenture interview questions and answers]]></category>
		<category><![CDATA[advantages of hadoop over data warehouse]]></category>
		<category><![CDATA[apache hadoop]]></category>
		<category><![CDATA[applications of hadoopa]]></category>
		<category><![CDATA[architecture of hadoop]]></category>
		<category><![CDATA[AT&T interview questions and answers]]></category>
		<category><![CDATA[Atos interview questions and answers]]></category>
		<category><![CDATA[big data hadoop]]></category>
		<category><![CDATA[bigdata and hadoop]]></category>
		<category><![CDATA[Capgemini interview questions and answers]]></category>
		<category><![CDATA[CASTING NETWORKS INDIA PVT LIMITED interview questions and answers]]></category>
		<category><![CDATA[CGI Group Inc interview questions and answers]]></category>
		<category><![CDATA[characteristics of big data]]></category>
		<category><![CDATA[characteristics of hadoop]]></category>
		<category><![CDATA[Collabera Technologiesinterview questions and answers]]></category>
		<category><![CDATA[components of hadoop]]></category>
		<category><![CDATA[conclusion of hadoop]]></category>
		<category><![CDATA[Dell International Services India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[distributed file system]]></category>
		<category><![CDATA[dvantages of hadoop ecosystem]]></category>
		<category><![CDATA[Ernst & Young interview questions and answers]]></category>
		<category><![CDATA[features of big data]]></category>
		<category><![CDATA[Features of Hadoop]]></category>
		<category><![CDATA[features of hdfs]]></category>
		<category><![CDATA[Flipkart interview questions and answers]]></category>
		<category><![CDATA[functionalities of hadoop cluster]]></category>
		<category><![CDATA[functionality of hadoop cluster]]></category>
		<category><![CDATA[Genpact interview questions and answers]]></category>
		<category><![CDATA[hadoop]]></category>
		<category><![CDATA[hadoop architecture]]></category>
		<category><![CDATA[hadoop certification]]></category>
		<category><![CDATA[hadoop commands]]></category>
		<category><![CDATA[hadoop ecosystem]]></category>
		<category><![CDATA[hadoop explained]]></category>
		<category><![CDATA[hadoop features and advantages]]></category>
		<category><![CDATA[hadoop hive]]></category>
		<category><![CDATA[hadoop mapreduce]]></category>
		<category><![CDATA[hadoop the definitive guide]]></category>
		<category><![CDATA[hdfs]]></category>
		<category><![CDATA[hdfs architecture]]></category>
		<category><![CDATA[IBM interview questions and answers]]></category>
		<category><![CDATA[Indecomm Global Services interview questions and answers]]></category>
		<category><![CDATA[key distinctions of hadoop are mcq]]></category>
		<category><![CDATA[key features of big data]]></category>
		<category><![CDATA[Key Features of Hadoop]]></category>
		<category><![CDATA[L&T Infotech interview questions and answers]]></category>
		<category><![CDATA[mapreduce]]></category>
		<category><![CDATA[mapreduce features]]></category>
		<category><![CDATA[Mindtree interview questions and answers]]></category>
		<category><![CDATA[NetApp interview questions and answers]]></category>
		<category><![CDATA[pig hadoop]]></category>
		<category><![CDATA[R Systems interview questions and answers]]></category>
		<category><![CDATA[RBS India Development Centre Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[SAP Labs India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[special features of hadoop]]></category>
		<category><![CDATA[Tata Consultancy Service interview questions and answers]]></category>
		<category><![CDATA[Tech Mahindra interview questions and answers]]></category>
		<category><![CDATA[Top 10 Features of Big Data Hadoop]]></category>
		<category><![CDATA[Trigent Software interview questions and answers]]></category>
		<category><![CDATA[UnitedHealth Group interview questions and answers]]></category>
		<category><![CDATA[Virtusa Consulting Services Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Wells Fargo interview questions and answers]]></category>
		<category><![CDATA[What are the main features of Hadoop]]></category>
		<category><![CDATA[What are the most important features of Hadoop]]></category>
		<category><![CDATA[what is big data hadoop]]></category>
		<category><![CDATA[what is hadoop]]></category>
		<category><![CDATA[Wipro Infotech interview questions and answers]]></category>
		<category><![CDATA[Wipro interview questions and answers]]></category>
		<category><![CDATA[Xoriant Solutions Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[yarn architecture]]></category>
		<category><![CDATA[yarn hadoop]]></category>
		<category><![CDATA[ZS Associates interview questions and answers]]></category>
		<guid isPermaLink="false">https://www.wikitechy.com/interview-questions/?p=279</guid>

					<description><![CDATA[Answer : Hadoop supports the storage and processing of big data...]]></description>
										<content:encoded><![CDATA[<div class="TextHeading">
<div class="hddn">
<h2 id="what-are-the-features-of-hadoop" class="color-pink" style="text-align: justify;">What are the features of Hadoop ?</h2>
</div>
</div>
<p style="text-align: justify;">Hadoop supports the storage and processing of big data. It is the best solution for handling big data challenges. Some important features of Hadoop are –</p>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/what-are-the-features-of-hadoop.png" alt="What are the features of Hadoop" /></div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="open-source" class="color-green">Open Source</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Hadoop is an <a href="https://www.wikitechy.com/technology/10-great-open-source-apps-for-android/" target="_blank" rel="noopener">open source</a> framework which means it is available free of cost.</li>
<li>Also, the users are allowed to change the source code as per their requirements.</li>
</ul>
</div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="distributed-processing" class="color-green">Distributed Processing</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li><a href="https://www.wikitechy.com/tutorials/apache-pig/apache-pig-tutorial/satellite-image-processing-using-hadoop.php" target="_blank" rel="noopener">Hadoop</a> supports distributed processing of data i.e. faster processing.</li>
<li>The data in Hadoop HDFS is stored in a distributed manner and MapReduce is responsible for the parallel processing of data.</li>
</ul>
</div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="fault-tolerance" class="color-green">Fault Tolerance</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Hadoop is highly <a href="https://www.wikitechy.com/interview-questions/vsphere/what-is-vmware-fault-tolerance" target="_blank" rel="noopener">fault-tolerant</a>. It creates three replicas for each block at different nodes, by default.</li>
<li>This number can be changed according to the requirement. So, we can recover the data from another node if one node fails.</li>
<li>The detection of node failure and recovery of data is done automatically.</li>
</ul>
</div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="reliability" class="color-green">Reliability</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Hadoop stores data on the cluster in a reliable manner that is independent of machine.</li>
<li>So, the data stored in Hadoop environment is not affected by the failure of the machine.</li>
</ul>
</div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="scalability" class="color-green">Scalability</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Another important feature of Hadoop is the scalability. It is compatible with the other hardware and we can easily ass the new hardware to the nodes.</li>
</ul>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/scalibility-in-hadoop.gif" alt="Scalability in hadoop" /></div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="economic" class="color-green">Economic</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Apache Hadoop is not very expensive as it runs on a cluster of commodity hardware.</li>
<li>Hadoop also provides huge cost saving also as it is very easy to add more nodes on the fly here. So if requirement increases, then you can increase nodes as well without any downtime and without requiring much of pre-planning.</li>
</ul>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/economic-in-hadoop.gif" alt="Economic in hadoop" /></div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="easy-to-use" class="color-green">Easy to use</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>No need of client to deal with distributed computing, the framework takes care of all the things. So this feature of Hadoop is easy to use.</li>
</ul>
</div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="data-locality" class="color-green">Data Locality</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>This one is a unique features of Hadoop that made it easily handle the Big Data. Hadoop works on data locality principle which states that move computation to data instead of data to computation.</li>
<li>When a client submits the MapReduce algorithm, this algorithm is moved to data in the cluster rather than bringing data to the location where the algorithm is submitted and then processing it.</li>
</ul>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/data-locality-in-hadoop.gif" alt="Data Locality " /></div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="high-availability" class="color-green">High Availability</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>The data stored in Hadoop is available to access even after the hardware failure. In case of hardware failure, the data can be accessed from another path.</li>
</ul>
</div>
</div>
<div class="ImageContent">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/high-availability-in-hadoop.gif" alt="High Availability " /></div>
</div>
]]></content:encoded>
					
					<wfw:commentRss>https://www.wikitechy.com/interview-questions/big-data/what-are-the-features-of-hadoop/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>Why Hadoop used for Big Data Analytics ?</title>
		<link>https://www.wikitechy.com/interview-questions/big-data/why-hadoop-used-for-big-data-analytics/</link>
					<comments>https://www.wikitechy.com/interview-questions/big-data/why-hadoop-used-for-big-data-analytics/#respond</comments>
		
		<dc:creator><![CDATA[Editor]]></dc:creator>
		<pubDate>Mon, 12 Jul 2021 17:16:29 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Accenture interview questions and answers]]></category>
		<category><![CDATA[AT&T interview questions and answers]]></category>
		<category><![CDATA[Atos interview questions and answers]]></category>
		<category><![CDATA[big data analytics]]></category>
		<category><![CDATA[big data hadoop]]></category>
		<category><![CDATA[big data hadoop certification]]></category>
		<category><![CDATA[big data hadoop tutorial]]></category>
		<category><![CDATA[big data notes]]></category>
		<category><![CDATA[big data toolshow big data and hadoop are linked]]></category>
		<category><![CDATA[big data tutorial]]></category>
		<category><![CDATA[Capgemini interview questions and answers]]></category>
		<category><![CDATA[CASTING NETWORKS INDIA PVT LIMITED interview questions and answers]]></category>
		<category><![CDATA[CGI Group Inc interview questions and answers]]></category>
		<category><![CDATA[Collabera Technologiesinterview questions and answers]]></category>
		<category><![CDATA[Dell International Services India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[difference between big data and data science]]></category>
		<category><![CDATA[difference between big data and hadoop]]></category>
		<category><![CDATA[difference between hadoop and spark]]></category>
		<category><![CDATA[Ernst & Young interview questions and answers]]></category>
		<category><![CDATA[Flipkart interview questions and answers]]></category>
		<category><![CDATA[Genpact interview questions and answers]]></category>
		<category><![CDATA[hadoop architecture]]></category>
		<category><![CDATA[hadoop as big data solution]]></category>
		<category><![CDATA[hadoop database]]></category>
		<category><![CDATA[hadoop example]]></category>
		<category><![CDATA[hadoop modules]]></category>
		<category><![CDATA[hadoop storage]]></category>
		<category><![CDATA[IBM interview questions and answers]]></category>
		<category><![CDATA[Indecomm Global Services interview questions and answers]]></category>
		<category><![CDATA[L&T Infotech interview questions and answers]]></category>
		<category><![CDATA[Mindtree interview questions and answers]]></category>
		<category><![CDATA[NetApp interview questions and answers]]></category>
		<category><![CDATA[R Systems interview questions and answers]]></category>
		<category><![CDATA[RBS India Development Centre Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[SAP Labs India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Tata Consultancy Service interview questions and answers]]></category>
		<category><![CDATA[Tech Mahindra interview questions and answers]]></category>
		<category><![CDATA[Trigent Software interview questions and answers]]></category>
		<category><![CDATA[UnitedHealth Group interview questions and answers]]></category>
		<category><![CDATA[Virtusa Consulting Services Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Wells Fargo interview questions and answers]]></category>
		<category><![CDATA[what is big data]]></category>
		<category><![CDATA[what is hadoop used for]]></category>
		<category><![CDATA[Wipro Infotech interview questions and answers]]></category>
		<category><![CDATA[Wipro interview questions and answers]]></category>
		<category><![CDATA[Xoriant Solutions Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[ZS Associates interview questions and answers]]></category>
		<guid isPermaLink="false">https://www.wikitechy.com/interview-questions/?p=269</guid>

					<description><![CDATA[Answer : Big data analytics is the process of examining large data...]]></description>
										<content:encoded><![CDATA[<div class="TextHeading">
<div class="hddn">
<h2 id="why-hadoop-used-for-big-data-analytics" class="color-pink" style="text-align: justify;">Why Hadoop used for Big Data Analytics ?</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li><a href="https://www.wikitechy.com/interview-questions/hadoop/what-is-big-data/" target="_blank" rel="noopener">Big data</a> analytics is the process of examining large data sets to uncover hidden patterns, unknown correlations, market trends, customer preferences and other useful business information.</li>
<li>Hadoop is a framework to store and process big data. Hadoop specifically designed to provide distributed storage and parallel data processing that big data requires.</li>
</ul>
</div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="hadoop-is-the-best-solution-for-storing-and-processing-big-data-because" class="color-blue">Hadoop is the best solution for storing and processing big data because:</h2>
</div>
</div>
<p style="text-align: justify;">Hadoop stores huge files as they are (raw) without specifying any schema.</p>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li><b>High scalability</b> &#8211; We can add any number of nodes, hence enhancing performance dramatically.</li>
<li><b>High availability</b> &#8211; In <a href="https://www.wikitechy.com/interview-questions/apache-pig/what-is-the-advantages-of-pig-in-hadoop/" target="_blank" rel="noopener">hadoop</a> data is highly available despite hardware failure. If a machine or few hardware crashes, then we can access data from another path.</li>
<li><b>Reliable</b> &#8211; Data is reliably stored on the cluster despite of machine failure.</li>
<li><b>Economic</b> &#8211; Hadoop runs on a cluster of commodity hardware which is not very expensive.</li>
</ul>
</div>
</div>
<div class="text-center row" style="text-align: justify;"></div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="what-is-hadoop" class="color-purple">What is Hadoop ?</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li><a href="https://www.wikitechy.com/interview-questions/apache-pig/what-is-the-difference-between-pig-hive-and-mapreduce" target="_blank" rel="noopener">Hadoop</a> is an open source project from Apache Software Foundation.</li>
<li>It provides a software framework for distributing and running applications on clusters of servers that is inspired by Google’s Map-Reduce programming model as well as its file system(GFS).</li>
<li>Hadoop was originally written for the nutch search engine project.</li>
<li>Hadoop is open source framework written in Java. It efficiently processes large volumes of data on a cluster of commodity hardware.</li>
<li>Hadoop can be setup on single machine , but the real power of Hadoop comes with a cluster of machines , it can be scaled from a single machine to thousands of nodes. Hadoop consists of two key parts,
<ul>
<li>Hadoop Distributes File System(HDFS)</li>
<li>Map-Reduce.</li>
</ul>
</li>
</ul>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img decoding="async" class="img-responsive center-block aligncenter" src="https://cdn.wikitechy.com/interview-questions/hadoop/hadoop-overview.png" alt="Hadoop Overview" /></div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="hadoop-distributed-file-systemhdfs" class="color-blue">Hadoop Distributed File System(HDFS)</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>HDFS is a highly fault tolerant, distributed, reliable, scalable file system for data storage.</li>
<li>HDFS stores multiple copies of data on different nodes; a file is split up into blocks (Default 64 MB) and stored across multiple machines.</li>
<li>Hadoop cluster typically has a single namenode and number of datanodes to form the HDFS cluster.</li>
</ul>
</div>
</div>
<div class="TextHeading" style="text-align: justify;">
<div class="hddn">
<h2 id="map-reduce" class="color-blue">Map-Reduce</h2>
</div>
</div>
<div class="Content">
<div class="hddn">
<ul>
<li style="text-align: justify;">Map-Reduce is a programming model designed for processing large volumes of data in parallel by dividing the work into a set of independent tasks.</li>
<li style="text-align: justify;">It is also a paradigm for distributed processing of large data set over a cluster of nodes.</li>
</ul>
</div>
</div>
]]></content:encoded>
					
					<wfw:commentRss>https://www.wikitechy.com/interview-questions/big-data/why-hadoop-used-for-big-data-analytics/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
	</channel>
</rss>
