<br />
<b>Warning</b>:  Undefined array key "global_protection_id" in <b>/home/wikitechy/public_html/interview-questions/wp-content/plugins/content-protector/inc/class-ps-rest-handler.php</b> on line <b>51</b><br />
{"id":269,"date":"2021-07-12T17:16:29","date_gmt":"2021-07-12T17:16:29","guid":{"rendered":"https:\/\/www.wikitechy.com\/interview-questions\/?p=269"},"modified":"2021-09-14T09:47:12","modified_gmt":"2021-09-14T09:47:12","slug":"why-hadoop-used-for-big-data-analytics","status":"publish","type":"post","link":"https:\/\/www.wikitechy.com\/interview-questions\/big-data\/why-hadoop-used-for-big-data-analytics\/","title":{"rendered":"Why Hadoop used for Big Data Analytics ?"},"content":{"rendered":"<div class=\"TextHeading\">\n<div class=\"hddn\">\n<h2 id=\"why-hadoop-used-for-big-data-analytics\" class=\"color-pink\" style=\"text-align: justify;\">Why Hadoop used for Big Data Analytics ?<\/h2>\n<\/div>\n<\/div>\n<div class=\"Content\" style=\"text-align: justify;\">\n<div class=\"hddn\">\n<ul>\n<li><a href=\"https:\/\/www.wikitechy.com\/interview-questions\/hadoop\/what-is-big-data\/\" target=\"_blank\" rel=\"noopener\">Big data<\/a>\u00a0analytics is the process of examining large data sets to uncover hidden patterns, unknown correlations, market trends, customer preferences and other useful business information.<\/li>\n<li>Hadoop\u00a0is a framework to store and process big data. Hadoop specifically designed to provide distributed storage and parallel data processing that big data requires.<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<div class=\"TextHeading\" style=\"text-align: justify;\">\n<div class=\"hddn\">\n<h2 id=\"hadoop-is-the-best-solution-for-storing-and-processing-big-data-because\" class=\"color-blue\">Hadoop is the best solution for storing and processing big data because:<\/h2>\n<\/div>\n<\/div>\n<p style=\"text-align: justify;\">Hadoop stores huge files as they are (raw) without specifying any schema.<\/p>\n<div class=\"Content\" style=\"text-align: justify;\">\n<div class=\"hddn\">\n<ul>\n<li><b>High scalability<\/b>\u00a0&#8211; We can add any number of nodes, hence enhancing performance dramatically.<\/li>\n<li><b>High availability<\/b>\u00a0&#8211; In\u00a0<a href=\"https:\/\/www.wikitechy.com\/interview-questions\/apache-pig\/what-is-the-advantages-of-pig-in-hadoop\/\" target=\"_blank\" rel=\"noopener\">hadoop<\/a>\u00a0data is highly available despite hardware failure. If a machine or few hardware crashes, then we can access data from another path.<\/li>\n<li><b>Reliable<\/b>\u00a0&#8211; Data is reliably stored on the cluster despite of machine failure.<\/li>\n<li><b>Economic<\/b>\u00a0&#8211; Hadoop runs on a cluster of commodity hardware which is not very expensive.<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<div class=\"text-center row\" style=\"text-align: justify;\"><\/div>\n<div class=\"TextHeading\" style=\"text-align: justify;\">\n<div class=\"hddn\">\n<h2 id=\"what-is-hadoop\" class=\"color-purple\">What is Hadoop ?<\/h2>\n<\/div>\n<\/div>\n<div class=\"Content\" style=\"text-align: justify;\">\n<div class=\"hddn\">\n<ul>\n<li><a href=\"https:\/\/www.wikitechy.com\/interview-questions\/apache-pig\/what-is-the-difference-between-pig-hive-and-mapreduce\" target=\"_blank\" rel=\"noopener\">Hadoop<\/a>\u00a0is an open source project from Apache Software Foundation.<\/li>\n<li>It provides a software framework for distributing and running applications on clusters of servers that is inspired by Google\u2019s Map-Reduce programming model as well as its file system(GFS).<\/li>\n<li>Hadoop was originally written for the nutch search engine project.<\/li>\n<li>Hadoop is open source framework written in Java. It efficiently processes large volumes of data on a cluster of commodity hardware.<\/li>\n<li>Hadoop can be setup on single machine , but the real power of Hadoop comes with a cluster of machines , it can be scaled from a single machine to thousands of nodes. Hadoop consists of two key parts,\n<ul>\n<li>Hadoop Distributes File System(HDFS)<\/li>\n<li>Map-Reduce.<\/li>\n<\/ul>\n<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<div class=\"ImageContent\" style=\"text-align: justify;\">\n<div class=\"hddn\"><img decoding=\"async\" class=\"img-responsive center-block aligncenter\" src=\"https:\/\/cdn.wikitechy.com\/interview-questions\/hadoop\/hadoop-overview.png\" alt=\"Hadoop Overview\" \/><\/div>\n<\/div>\n<div class=\"TextHeading\" style=\"text-align: justify;\">\n<div class=\"hddn\">\n<h2 id=\"hadoop-distributed-file-systemhdfs\" class=\"color-blue\">Hadoop Distributed File System(HDFS)<\/h2>\n<\/div>\n<\/div>\n<div class=\"Content\" style=\"text-align: justify;\">\n<div class=\"hddn\">\n<ul>\n<li>HDFS is a highly fault tolerant, distributed, reliable, scalable file system for data storage.<\/li>\n<li>HDFS stores multiple copies of data on different nodes; a file is split up into blocks (Default 64 MB) and stored across multiple machines.<\/li>\n<li>Hadoop cluster typically has a single namenode and number of datanodes to form the HDFS cluster.<\/li>\n<\/ul>\n<\/div>\n<\/div>\n<div class=\"TextHeading\" style=\"text-align: justify;\">\n<div class=\"hddn\">\n<h2 id=\"map-reduce\" class=\"color-blue\">Map-Reduce<\/h2>\n<\/div>\n<\/div>\n<div class=\"Content\">\n<div class=\"hddn\">\n<ul>\n<li style=\"text-align: justify;\">Map-Reduce is a programming model designed for processing large volumes of data in parallel by dividing the work into a set of independent tasks.<\/li>\n<li style=\"text-align: justify;\">It is also a paradigm for distributed processing of large data set over a cluster of nodes.<\/li>\n<\/ul>\n<\/div>\n<\/div>\n","protected":false},"excerpt":{"rendered":"<p>Answer : Big data analytics is the process of examining large data&#8230;<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"passster_activate_protection":false,"passster_protect_child_pages":"","passster_protection_type":"password","passster_password":"","passster_activate_overwrite_defaults":"","passster_headline":"","passster_instruction":"","passster_placeholder":"","passster_button":"","passster_id":"","passster_activate_misc_settings":"","passster_redirect_url":"","passster_hide":"no","passster_area_shortcode":"","gtb_hide_title":false,"gtb_wrap_title":false,"gtb_class_title":"","gtb_remove_headerfooter":false,"footnotes":""},"categories":[1065],"tags":[195,1119,360,1041,1168,1170,1175,1179,1174,1034,203,199,214,1120,205,1173,948,947,485,222,484,1171,1172,1169,1176,1177,1178,196,212,286,970,366,288,367,206,975,200,974,197,280,364,1031,1167,968,216,285,1121],"class_list":["post-269","post","type-post","status-publish","format-standard","hentry","category-big-data","tag-accenture-interview-questions-and-answers","tag-att-interview-questions-and-answers","tag-atos-interview-questions-and-answers","tag-big-data-analytics","tag-big-data-hadoop","tag-big-data-hadoop-certification","tag-big-data-hadoop-tutorial","tag-big-data-notes","tag-big-data-toolshow-big-data-and-hadoop-are-linked","tag-big-data-tutorial","tag-capgemini-interview-questions-and-answers","tag-casting-networks-india-pvt-limited-interview-questions-and-answers","tag-cgi-group-inc-interview-questions-and-answers","tag-collabera-technologiesinterview-questions-and-answers","tag-dell-international-services-india-pvt-ltd-interview-questions-and-answers","tag-difference-between-big-data-and-data-science","tag-difference-between-big-data-and-hadoop","tag-difference-between-hadoop-and-spark","tag-ernst-young-interview-questions-and-answers","tag-flipkart-interview-questions-and-answers","tag-genpact-interview-questions-and-answers","tag-hadoop-architecture","tag-hadoop-as-big-data-solution","tag-hadoop-database","tag-hadoop-example","tag-hadoop-modules","tag-hadoop-storage","tag-ibm-interview-questions-and-answers","tag-indecomm-global-services-interview-questions-and-answers","tag-lt-infotech-interview-questions-and-answers","tag-mindtree-interview-questions-and-answers","tag-netapp-interview-questions-and-answers","tag-r-systems-interview-questions-and-answers","tag-rbs-india-development-centre-pvt-ltd-interview-questions-and-answers","tag-sap-labs-india-pvt-ltd-interview-questions-and-answers","tag-tata-consultancy-service-interview-questions-and-answers","tag-tech-mahindra-interview-questions-and-answers","tag-trigent-software-interview-questions-and-answers","tag-unitedhealth-group-interview-questions-and-answers","tag-virtusa-consulting-services-pvt-ltd-interview-questions-and-answers","tag-wells-fargo-interview-questions-and-answers","tag-what-is-big-data","tag-what-is-hadoop-used-for","tag-wipro-infotech-interview-questions-and-answers","tag-wipro-interview-questions-and-answers","tag-xoriant-solutions-pvt-ltd-interview-questions-and-answers","tag-zs-associates-interview-questions-and-answers"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v22.6 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Why Hadoop used for Big Data Analytics ? - Big Data<\/title>\n<meta name=\"description\" content=\"Why Hadoop used for Big Data Analytics ? - Big data analytics is the process of examining large data sets to uncover hidden patterns, unknown correlations\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.wikitechy.com\/interview-questions\/big-data\/why-hadoop-used-for-big-data-analytics\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Why Hadoop used for Big Data Analytics ? - Big Data\" \/>\n<meta property=\"og:description\" content=\"Why Hadoop used for Big Data Analytics ? - Big data analytics is the process of examining large data sets to uncover hidden patterns, unknown correlations\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.wikitechy.com\/interview-questions\/big-data\/why-hadoop-used-for-big-data-analytics\/\" \/>\n<meta property=\"og:site_name\" content=\"Wikitechy\" \/>\n<meta property=\"article:published_time\" content=\"2021-07-12T17:16:29+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2021-09-14T09:47:12+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/cdn.wikitechy.com\/interview-questions\/hadoop\/hadoop-overview.png\" \/>\n<meta name=\"author\" content=\"Editor\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Editor\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"2 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.wikitechy.com\/interview-questions\/big-data\/why-hadoop-used-for-big-data-analytics\/\",\"url\":\"https:\/\/www.wikitechy.com\/interview-questions\/big-data\/why-hadoop-used-for-big-data-analytics\/\",\"name\":\"Why Hadoop used for Big Data Analytics ? - Big Data\",\"isPartOf\":{\"@id\":\"https:\/\/www.wikitechy.com\/interview-questions\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.wikitechy.com\/interview-questions\/big-data\/why-hadoop-used-for-big-data-analytics\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.wikitechy.com\/interview-questions\/big-data\/why-hadoop-used-for-big-data-analytics\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/cdn.wikitechy.com\/interview-questions\/hadoop\/hadoop-overview.png\",\"datePublished\":\"2021-07-12T17:16:29+00:00\",\"dateModified\":\"2021-09-14T09:47:12+00:00\",\"author\":{\"@id\":\"https:\/\/www.wikitechy.com\/interview-questions\/#\/schema\/person\/4d5a581fb5470d1560324bddc5e8b757\"},\"description\":\"Why Hadoop used for Big Data Analytics ? - Big data analytics is the process of examining large data sets to uncover hidden patterns, unknown correlations\",\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.wikitechy.com\/interview-questions\/big-data\/why-hadoop-used-for-big-data-analytics\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.wikitechy.com\/interview-questions\/big-data\/why-hadoop-used-for-big-data-analytics\/#primaryimage\",\"url\":\"https:\/\/cdn.wikitechy.com\/interview-questions\/hadoop\/hadoop-overview.png\",\"contentUrl\":\"https:\/\/cdn.wikitechy.com\/interview-questions\/hadoop\/hadoop-overview.png\"},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.wikitechy.com\/interview-questions\/#website\",\"url\":\"https:\/\/www.wikitechy.com\/interview-questions\/\",\"name\":\"Wikitechy\",\"description\":\"Interview Questions\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.wikitechy.com\/interview-questions\/?s={search_term_string}\"},\"query-input\":\"required name=search_term_string\"}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.wikitechy.com\/interview-questions\/#\/schema\/person\/4d5a581fb5470d1560324bddc5e8b757\",\"name\":\"Editor\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.wikitechy.com\/interview-questions\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/e9531079fe7e07841b7b156c04d65e5f39d4adfd18b6ffe3edfff8ca5aab85b5?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/e9531079fe7e07841b7b156c04d65e5f39d4adfd18b6ffe3edfff8ca5aab85b5?s=96&d=mm&r=g\",\"caption\":\"Editor\"},\"url\":\"https:\/\/www.wikitechy.com\/interview-questions\/author\/editor\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Why Hadoop used for Big Data Analytics ? - Big Data","description":"Why Hadoop used for Big Data Analytics ? - Big data analytics is the process of examining large data sets to uncover hidden patterns, unknown correlations","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.wikitechy.com\/interview-questions\/big-data\/why-hadoop-used-for-big-data-analytics\/","og_locale":"en_US","og_type":"article","og_title":"Why Hadoop used for Big Data Analytics ? - Big Data","og_description":"Why Hadoop used for Big Data Analytics ? - Big data analytics is the process of examining large data sets to uncover hidden patterns, unknown correlations","og_url":"https:\/\/www.wikitechy.com\/interview-questions\/big-data\/why-hadoop-used-for-big-data-analytics\/","og_site_name":"Wikitechy","article_published_time":"2021-07-12T17:16:29+00:00","article_modified_time":"2021-09-14T09:47:12+00:00","og_image":[{"url":"https:\/\/cdn.wikitechy.com\/interview-questions\/hadoop\/hadoop-overview.png"}],"author":"Editor","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Editor","Est. reading time":"2 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.wikitechy.com\/interview-questions\/big-data\/why-hadoop-used-for-big-data-analytics\/","url":"https:\/\/www.wikitechy.com\/interview-questions\/big-data\/why-hadoop-used-for-big-data-analytics\/","name":"Why Hadoop used for Big Data Analytics ? - Big Data","isPartOf":{"@id":"https:\/\/www.wikitechy.com\/interview-questions\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.wikitechy.com\/interview-questions\/big-data\/why-hadoop-used-for-big-data-analytics\/#primaryimage"},"image":{"@id":"https:\/\/www.wikitechy.com\/interview-questions\/big-data\/why-hadoop-used-for-big-data-analytics\/#primaryimage"},"thumbnailUrl":"https:\/\/cdn.wikitechy.com\/interview-questions\/hadoop\/hadoop-overview.png","datePublished":"2021-07-12T17:16:29+00:00","dateModified":"2021-09-14T09:47:12+00:00","author":{"@id":"https:\/\/www.wikitechy.com\/interview-questions\/#\/schema\/person\/4d5a581fb5470d1560324bddc5e8b757"},"description":"Why Hadoop used for Big Data Analytics ? - Big data analytics is the process of examining large data sets to uncover hidden patterns, unknown correlations","inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.wikitechy.com\/interview-questions\/big-data\/why-hadoop-used-for-big-data-analytics\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.wikitechy.com\/interview-questions\/big-data\/why-hadoop-used-for-big-data-analytics\/#primaryimage","url":"https:\/\/cdn.wikitechy.com\/interview-questions\/hadoop\/hadoop-overview.png","contentUrl":"https:\/\/cdn.wikitechy.com\/interview-questions\/hadoop\/hadoop-overview.png"},{"@type":"WebSite","@id":"https:\/\/www.wikitechy.com\/interview-questions\/#website","url":"https:\/\/www.wikitechy.com\/interview-questions\/","name":"Wikitechy","description":"Interview Questions","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.wikitechy.com\/interview-questions\/?s={search_term_string}"},"query-input":"required name=search_term_string"}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/www.wikitechy.com\/interview-questions\/#\/schema\/person\/4d5a581fb5470d1560324bddc5e8b757","name":"Editor","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.wikitechy.com\/interview-questions\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/e9531079fe7e07841b7b156c04d65e5f39d4adfd18b6ffe3edfff8ca5aab85b5?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/e9531079fe7e07841b7b156c04d65e5f39d4adfd18b6ffe3edfff8ca5aab85b5?s=96&d=mm&r=g","caption":"Editor"},"url":"https:\/\/www.wikitechy.com\/interview-questions\/author\/editor\/"}]}},"_links":{"self":[{"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/posts\/269","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/comments?post=269"}],"version-history":[{"count":5,"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/posts\/269\/revisions"}],"predecessor-version":[{"id":3693,"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/posts\/269\/revisions\/3693"}],"wp:attachment":[{"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/media?parent=269"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/categories?post=269"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.wikitechy.com\/interview-questions\/wp-json\/wp\/v2\/tags?post=269"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}