<br />
<b>Warning</b>:  Undefined array key "global_protection_id" in <b>/home/wikitechy/public_html/interview-questions/wp-content/plugins/content-protector/inc/class-ps-rest-handler.php</b> on line <b>51</b><br />
{"id":578,"date":"2021-07-13T21:50:50","date_gmt":"2021-07-13T21:50:50","guid":{"rendered":"https:\/\/www.wikitechy.com\/interview-questions\/?p=578"},"modified":"2021-09-13T05:19:08","modified_gmt":"2021-09-13T05:19:08","slug":"what-is-the-result-of-clustering-a-partitioned-table-in-hive","status":"publish","type":"post","link":"https:\/\/www.wikitechy.com\/interview-questions\/hive\/what-is-the-result-of-clustering-a-partitioned-table-in-hive\/","title":{"rendered":"What is the result of clustering a partitioned table in Hive ?"},"content":{"rendered":"<div class=\"TextHeading\">\n<div class=\"hddn\">\n<h2 id=\"result-of-clustering-a-partitioned-table-in-hive\" class=\"color-green\" style=\"text-align: justify;\">Result of clustering a partitioned table in Hive<\/h2>\n<\/div>\n<\/div>\n<div class=\"Content\" style=\"text-align: justify;\">\n<div class=\"hddn\">\n<ul>\n<li>Partitioning data is used for distributing load horizontally, helps to organizing data in a very logical fashion.<\/li>\n<li>Example:Assume user managing a large employee table and often run queries with where clauses that restrict the results to a particular country or department.<\/li>\n<li>For a quicker query response Hive table can be partitioned BY (country STRING, DEPT STRING). Partitioning tables changes however Hive structures the data storage and Hive can now create subdirectories reflecting the partitioning structure like&#8230;\/employees\/country=ABC\/DEPT=XYZ.<\/li>\n<li>If query limits for employee from country=ABC, it will only scan the contents of 1 directory country=ABC.<\/li>\n<li>This can dramatically improve query performance, but providing the partitioning scheme reflects common filtering.<\/li>\n<li>Hive used to design too several partitions could optimize some queries, but be damaging for other important queries.<\/li>\n<li>Disadvantage several partitions having huge number of Hadoop files and file directories and Name Node should keep all data for the file system in memory.<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<div class=\"text-center row\" style=\"text-align: justify;\">\n<div class=\"col-sm-12\">\n<div id=\"bsa-zone_1590522538159-8_123456\"><\/div>\n<\/div>\n<\/div>\n<div class=\"ImageContent\" style=\"text-align: justify;\">\n<div class=\"hddn\"><img fetchpriority=\"high\" decoding=\"async\" class=\"aligncenter size-medium\" src=\"https:\/\/cdn.wikitechy.com\/interview-questions\/hive\/partitionned-table-in-hive.png\" alt=\"partitionned table in hive\" width=\"602\" height=\"331\" \/><\/div>\n<\/div>\n<div class=\"Content\">\n<div class=\"hddn\">\n<ul>\n<li style=\"text-align: justify;\">Decomposing data sets into more manageable parts using bucketing technique.<\/li>\n<li style=\"text-align: justify;\">For example, table using date as the top-level partition and employee_id as the second-level partition leads to too many small partitions.<\/li>\n<li style=\"text-align: justify;\">Employee_id contains the column it will be hashed by a user-defined number into buckets.<\/li>\n<li style=\"text-align: justify;\">Records with the same employee_id kept within the same bucket.<\/li>\n<li style=\"text-align: justify;\">Assuming the number of employee_id is much larger than the quantity of buckets, every bucket can have several employee_id.<\/li>\n<li style=\"text-align: justify;\">While creating table you can specify like CLUSTERED BY (employee_id) INTO XX BUCKETS; wherever XX is the number of buckets.<\/li>\n<li style=\"text-align: justify;\">Bucketing has many benefits.The number of buckets is fastened so it does not alternate with data.<\/li>\n<li style=\"text-align: justify;\">If a pair of tables are bucketed by employee_id, Hive will produce a logically correct sampling. Bucketing additionally aids in doing efficient map-side joins etc.<\/li>\n<\/ul>\n<\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Answer : Partitioning data is used for distributing load horizontally, helps to organizing data in a very logical fashion.<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"passster_activate_protection":false,"passster_protect_child_pages":"","passster_protection_type":"password","passster_password":"","passster_activate_overwrite_defaults":"","passster_headline":"","passster_instruction":"","passster_placeholder":"","passster_button":"","passster_id":"","passster_activate_misc_settings":"","passster_redirect_url":"","passster_hide":"no","passster_area_shortcode":"","gtb_hide_title":false,"gtb_wrap_title":false,"gtb_class_title":"","gtb_remove_headerfooter":false,"footnotes":""},"categories":[3782],"tags":[195,971,3799,203,199,214,209,3897,3895,205,222,484,3888,3803,3890,3892,3898,3894,3891,621,620,3885,3899,196,3716,712,970,366,3889,208,288,483,3896,976,975,200,974,197,280,364,3893,968,216,2119,969],"class_list":["post-578","post","type-post","status-publish","format-standard","hentry","category-hive","tag-accenture-interview-questions-and-answers","tag-altimetrik-india-pvt-ltd-interview-questions-and-answers","tag-ani-technologies-pvt-ltd-interview-questions-and-answers","tag-capgemini-interview-questions-and-answers","tag-casting-networks-india-pvt-limited-interview-questions-and-answers","tag-cgi-group-inc-interview-questions-and-answers","tag-collabera-technologies-interview-questions-and-answers","tag-create-external-table","tag-create-partition-table","tag-dell-international-services-india-pvt-ltd-interview-questions-and-answers","tag-flipkart-interview-questions-and-answers","tag-genpact-interview-questions-and-answers","tag-hive-add-partition","tag-hive-create-database","tag-hive-create-table","tag-hive-external-table","tag-hive-insert-into-table-values","tag-hive-list-tables","tag-hive-partition","tag-hive-query-based-interview-questions","tag-hive-scenario-based-interview-questions","tag-hive-show-databases","tag-how-to-create-table-in-hive","tag-ibm-interview-questions-and-answers","tag-impetus-technologies-interview-questions-and-answers","tag-indiabulls-technology-solutions-ltd-interview-questions-and-answers","tag-mindtree-interview-questions-and-answers","tag-netapp-interview-questions-and-answers","tag-partition-table","tag-prokarma-softech-pvt-ltd-interview-questions-and-answers","tag-r-systems-interview-questions-and-answers","tag-reliance-industries-ltd-interview-questions-and-answers","tag-show-partitions-hive","tag-synechron-te-interview-questions-and-answers","tag-tata-consultancy-service-interview-questions-and-answers","tag-tech-mahindra-interview-questions-and-answers","tag-trigent-software-interview-questions-and-answers","tag-unitedhealth-group-interview-questions-and-answers","tag-virtusa-consulting-services-pvt-ltd-interview-questions-and-answers","tag-wells-fargo-interview-questions-and-answers","tag-what-is-partitioning","tag-wipro-infotech-interview-questions-and-answers","tag-wipro-interview-questions-and-answers","tag-yash-technologies-interview-questions-and-answers","tag-yodlee-infotech-pvt-ltd-interview-questions-and-answers"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v22.6 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>What is the result of clustering a partitioned table in Hive ?<\/title>\n<meta name=\"description\" content=\"What is the result of clustering a partitioned table in Hive ? - apache hive interview questions - Partitioning data is often used for distributing load horizontally.\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.wikitechy.com\/interview-questions\/hive\/what-is-the-result-of-clustering-a-partitioned-table-in-hive\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"What is the result of clustering a partitioned table in Hive ?\" \/>\n<meta property=\"og:description\" content=\"What is the result of clustering a partitioned table in Hive ? - apache hive interview questions - Partitioning data is often used for distributing load horizontally.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.wikitechy.com\/interview-questions\/hive\/what-is-the-result-of-clustering-a-partitioned-table-in-hive\/\" \/>\n<meta property=\"og:site_name\" content=\"Wikitechy\" \/>\n<meta property=\"article:published_time\" content=\"2021-07-13T21:50:50+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2021-09-13T05:19:08+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/cdn.wikitechy.com\/interview-questions\/hive\/partitionned-table-in-hive.png\" \/>\n<meta name=\"author\" content=\"Editor\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Editor\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.wikitechy.com\/interview-questions\/hive\/what-is-the-result-of-clustering-a-partitioned-table-in-hive\/\",\"url\":\"https:\/\/www.wikitechy.com\/interview-questions\/hive\/what-is-the-result-of-clustering-a-partitioned-table-in-hive\/\",\"name\":\"What is the result of clustering a partitioned table in Hive ?\",\"isPartOf\":{\"@id\":\"https:\/\/www.wikitechy.com\/interview-questions\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.wikitechy.com\/interview-questions\/hive\/what-is-the-result-of-clustering-a-partitioned-table-in-hive\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.wikitechy.com\/interview-questions\/hive\/what-is-the-result-of-clustering-a-partitioned-table-in-hive\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/cdn.wikitechy.com\/interview-questions\/hive\/partitionned-table-in-hive.png\",\"datePublished\":\"2021-07-13T21:50:50+00:00\",\"dateModified\":\"2021-09-13T05:19:08+00:00\",\"author\":{\"@id\":\"https:\/\/www.wikitechy.com\/interview-questions\/#\/schema\/person\/4d5a581fb5470d1560324bddc5e8b757\"},\"description\":\"What is the result of clustering a partitioned table in Hive ? - apache hive interview questions - Partitioning data is often used for distributing load horizontally.\",\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.wikitechy.com\/interview-questions\/hive\/what-is-the-result-of-clustering-a-partitioned-table-in-hive\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.wikitechy.com\/interview-questions\/hive\/what-is-the-result-of-clustering-a-partitioned-table-in-hive\/#primaryimage\",\"url\":\"https:\/\/cdn.wikitechy.com\/interview-questions\/hive\/partitionned-table-in-hive.png\",\"contentUrl\":\"https:\/\/cdn.wikitechy.com\/interview-questions\/hive\/partitionned-table-in-hive.png\"},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.wikitechy.com\/interview-questions\/#website\",\"url\":\"https:\/\/www.wikitechy.com\/interview-questions\/\",\"name\":\"Wikitechy\",\"description\":\"Interview Questions\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.wikitechy.com\/interview-questions\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.wikitechy.com\/interview-questions\/#\/schema\/person\/4d5a581fb5470d1560324bddc5e8b757\",\"name\":\"Editor\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.wikitechy.com\/interview-questions\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/e9531079fe7e07841b7b156c04d65e5f39d4adfd18b6ffe3edfff8ca5aab85b5?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/e9531079fe7e07841b7b156c04d65e5f39d4adfd18b6ffe3edfff8ca5aab85b5?s=96&d=mm&r=g\",\"caption\":\"Editor\"},\"url\":\"https:\/\/www.wikitechy.com\/interview-questions\/author\/editor\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"What is the result of clustering a partitioned table in Hive ?","description":"What is the result of clustering a partitioned table in Hive ? - apache hive interview questions - Partitioning data is often used for distributing load horizontally.","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.wikitechy.com\/interview-questions\/hive\/what-is-the-result-of-clustering-a-partitioned-table-in-hive\/","og_locale":"en_US","og_type":"article","og_title":"What is the result of clustering a partitioned table in Hive ?","og_description":"What is the result of clustering a partitioned table in Hive ? - apache hive interview questions - Partitioning data is often used for distributing load horizontally.","og_url":"https:\/\/www.wikitechy.com\/interview-questions\/hive\/what-is-the-result-of-clustering-a-partitioned-table-in-hive\/","og_site_name":"Wikitechy","article_published_time":"2021-07-13T21:50:50+00:00","article_modified_time":"2021-09-13T05:19:08+00:00","og_image":[{"url":"https:\/\/cdn.wikitechy.com\/interview-questions\/hive\/partitionned-table-in-hive.png"}],"author":"Editor","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Editor","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.wikitechy.com\/interview-questions\/hive\/what-is-the-result-of-clustering-a-partitioned-table-in-hive\/","url":"https:\/\/www.wikitechy.com\/interview-questions\/hive\/what-is-the-result-of-clustering-a-partitioned-table-in-hive\/","name":"What is the result of clustering a partitioned table in Hive ?","isPartOf":{"@id":"https:\/\/www.wikitechy.com\/interview-questions\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.wikitechy.com\/interview-questions\/hive\/what-is-the-result-of-clustering-a-partitioned-table-in-hive\/#primaryimage"},"image":{"@id":"https:\/\/www.wikitechy.com\/interview-questions\/hive\/what-is-the-result-of-clustering-a-partitioned-table-in-hive\/#primaryimage"},"thumbnailUrl":"https:\/\/cdn.wikitechy.com\/interview-questions\/hive\/partitionned-table-in-hive.png","datePublished":"2021-07-13T21:50:50+00:00","dateModified":"2021-09-13T05:19:08+00:00","author":{"@id":"https:\/\/www.wikitechy.com\/interview-questions\/#\/schema\/person\/4d5a581fb5470d1560324bddc5e8b757"},"description":"What is the result of clustering a partitioned table in Hive ? - apache hive interview questions - Partitioning data is often used for distributing load horizontally.","inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.wikitechy.com\/interview-questions\/hive\/what-is-the-result-of-clustering-a-partitioned-table-in-hive\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.wikitechy.com\/interview-questions\/hive\/what-is-the-result-of-clustering-a-partitioned-table-in-hive\/#primaryimage","url":"https:\/\/cdn.wikitechy.com\/interview-questions\/hive\/partitionned-table-in-hive.png","contentUrl":"https:\/\/cdn.wikitechy.com\/interview-questions\/hive\/partitionned-table-in-hive.png"},{"@type":"WebSite","@id":"https:\/\/www.wikitechy.com\/interview-questions\/#website","url":"https:\/\/www.wikitechy.com\/interview-questions\/","name":"Wikitechy","description":"Interview Questions","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.wikitechy.com\/interview-questions\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/www.wikitechy.com\/interview-questions\/#\/schema\/person\/4d5a581fb5470d1560324bddc5e8b757","name":"Editor","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.wikitechy.com\/interview-questions\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/e9531079fe7e07841b7b156c04d65e5f39d4adfd18b6ffe3edfff8ca5aab85b5?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/e9531079fe7e07841b7b156c04d65e5f39d4adfd18b6ffe3edfff8ca5aab85b5?s=96&d=mm&r=g","caption":"Editor"},"url":"https:\/\/www.wikitechy.com\/interview-questions\/author\/editor\/"}]}},"_links":{"self":[{"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/posts\/578","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/comments?post=578"}],"version-history":[{"count":3,"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/posts\/578\/revisions"}],"predecessor-version":[{"id":3458,"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/posts\/578\/revisions\/3458"}],"wp:attachment":[{"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/media?parent=578"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/categories?post=578"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/tags?post=578"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}