<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>what is partitioning - Wikitechy</title>
	<atom:link href="https://www.wikitechy.com/interview-questions/tag/what-is-partitioning/feed/" rel="self" type="application/rss+xml" />
	<link>https://www.wikitechy.com/interview-questions/tag/what-is-partitioning/</link>
	<description>Interview Questions</description>
	<lastBuildDate>Mon, 13 Sep 2021 05:19:08 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9.4</generator>

<image>
	<url>https://www.wikitechy.com/interview-questions/wp-content/uploads/2025/10/cropped-wikitechy-icon-32x32.png</url>
	<title>what is partitioning - Wikitechy</title>
	<link>https://www.wikitechy.com/interview-questions/tag/what-is-partitioning/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>What is the result of clustering a partitioned table in Hive ?</title>
		<link>https://www.wikitechy.com/interview-questions/hive/what-is-the-result-of-clustering-a-partitioned-table-in-hive/</link>
					<comments>https://www.wikitechy.com/interview-questions/hive/what-is-the-result-of-clustering-a-partitioned-table-in-hive/#respond</comments>
		
		<dc:creator><![CDATA[Editor]]></dc:creator>
		<pubDate>Tue, 13 Jul 2021 21:50:50 +0000</pubDate>
				<category><![CDATA[Hive]]></category>
		<category><![CDATA[Accenture interview questions and answers]]></category>
		<category><![CDATA[Altimetrik India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[ANI Technologies Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Capgemini interview questions and answers]]></category>
		<category><![CDATA[CASTING NETWORKS INDIA PVT LIMITED interview questions and answers]]></category>
		<category><![CDATA[CGI Group Inc interview questions and answers]]></category>
		<category><![CDATA[Collabera Technologies interview questions and answers]]></category>
		<category><![CDATA[create external table]]></category>
		<category><![CDATA[create partition table]]></category>
		<category><![CDATA[Dell International Services India Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Flipkart interview questions and answers]]></category>
		<category><![CDATA[Genpact interview questions and answers]]></category>
		<category><![CDATA[hive add partition]]></category>
		<category><![CDATA[hive create database]]></category>
		<category><![CDATA[hive create table]]></category>
		<category><![CDATA[hive external table]]></category>
		<category><![CDATA[hive insert into table values]]></category>
		<category><![CDATA[hive list tables]]></category>
		<category><![CDATA[hive partition]]></category>
		<category><![CDATA[hive query based interview questions]]></category>
		<category><![CDATA[hive scenario based interview questions]]></category>
		<category><![CDATA[hive show databases]]></category>
		<category><![CDATA[how to create table in hive]]></category>
		<category><![CDATA[IBM interview questions and answers]]></category>
		<category><![CDATA[Impetus Technologies interview questions and answers]]></category>
		<category><![CDATA[Indiabulls Technology Solutions Ltd interview questions and answers]]></category>
		<category><![CDATA[Mindtree interview questions and answers]]></category>
		<category><![CDATA[NetApp interview questions and answers]]></category>
		<category><![CDATA[partition table]]></category>
		<category><![CDATA[Prokarma Softech Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[R Systems interview questions and answers]]></category>
		<category><![CDATA[Reliance Industries Ltd interview questions and answers]]></category>
		<category><![CDATA[show partitions hive]]></category>
		<category><![CDATA[Synechron Te interview questions and answers]]></category>
		<category><![CDATA[Tata Consultancy Service interview questions and answers]]></category>
		<category><![CDATA[Tech Mahindra interview questions and answers]]></category>
		<category><![CDATA[Trigent Software interview questions and answers]]></category>
		<category><![CDATA[UnitedHealth Group interview questions and answers]]></category>
		<category><![CDATA[Virtusa Consulting Services Pvt Ltd interview questions and answers]]></category>
		<category><![CDATA[Wells Fargo interview questions and answers]]></category>
		<category><![CDATA[what is partitioning]]></category>
		<category><![CDATA[Wipro Infotech interview questions and answers]]></category>
		<category><![CDATA[Wipro interview questions and answers]]></category>
		<category><![CDATA[Yash Technologies interview questions and answers]]></category>
		<category><![CDATA[Yodlee Infotech Pvt Ltd interview questions and answers]]></category>
		<guid isPermaLink="false">https://www.wikitechy.com/interview-questions/?p=578</guid>

					<description><![CDATA[Answer : Partitioning data is used for distributing load horizontally, helps to organizing data in a very logical fashion.]]></description>
										<content:encoded><![CDATA[<div class="TextHeading">
<div class="hddn">
<h2 id="result-of-clustering-a-partitioned-table-in-hive" class="color-green" style="text-align: justify;">Result of clustering a partitioned table in Hive</h2>
</div>
</div>
<div class="Content" style="text-align: justify;">
<div class="hddn">
<ul>
<li>Partitioning data is used for distributing load horizontally, helps to organizing data in a very logical fashion.</li>
<li>Example:Assume user managing a large employee table and often run queries with where clauses that restrict the results to a particular country or department.</li>
<li>For a quicker query response Hive table can be partitioned BY (country STRING, DEPT STRING). Partitioning tables changes however Hive structures the data storage and Hive can now create subdirectories reflecting the partitioning structure like&#8230;/employees/country=ABC/DEPT=XYZ.</li>
<li>If query limits for employee from country=ABC, it will only scan the contents of 1 directory country=ABC.</li>
<li>This can dramatically improve query performance, but providing the partitioning scheme reflects common filtering.</li>
<li>Hive used to design too several partitions could optimize some queries, but be damaging for other important queries.</li>
<li>Disadvantage several partitions having huge number of Hadoop files and file directories and Name Node should keep all data for the file system in memory.</li>
</ul>
</div>
</div>
<div class="text-center row" style="text-align: justify;">
<div class="col-sm-12">
<div id="bsa-zone_1590522538159-8_123456"></div>
</div>
</div>
<div class="ImageContent" style="text-align: justify;">
<div class="hddn"><img fetchpriority="high" decoding="async" class="aligncenter size-medium" src="https://cdn.wikitechy.com/interview-questions/hive/partitionned-table-in-hive.png" alt="partitionned table in hive" width="602" height="331" /></div>
</div>
<div class="Content">
<div class="hddn">
<ul>
<li style="text-align: justify;">Decomposing data sets into more manageable parts using bucketing technique.</li>
<li style="text-align: justify;">For example, table using date as the top-level partition and employee_id as the second-level partition leads to too many small partitions.</li>
<li style="text-align: justify;">Employee_id contains the column it will be hashed by a user-defined number into buckets.</li>
<li style="text-align: justify;">Records with the same employee_id kept within the same bucket.</li>
<li style="text-align: justify;">Assuming the number of employee_id is much larger than the quantity of buckets, every bucket can have several employee_id.</li>
<li style="text-align: justify;">While creating table you can specify like CLUSTERED BY (employee_id) INTO XX BUCKETS; wherever XX is the number of buckets.</li>
<li style="text-align: justify;">Bucketing has many benefits.The number of buckets is fastened so it does not alternate with data.</li>
<li style="text-align: justify;">If a pair of tables are bucketed by employee_id, Hive will produce a logically correct sampling. Bucketing additionally aids in doing efficient map-side joins etc.</li>
</ul>
</div>
</div>
]]></content:encoded>
					
					<wfw:commentRss>https://www.wikitechy.com/interview-questions/hive/what-is-the-result-of-clustering-a-partitioned-table-in-hive/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
	</channel>
</rss>
