<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>kafka vs flume vs scribe - Wikitechy</title>
	<atom:link href="https://www.wikitechy.com/interview-questions/tag/kafka-vs-flume-vs-scribe/feed/" rel="self" type="application/rss+xml" />
	<link>https://www.wikitechy.com/interview-questions/tag/kafka-vs-flume-vs-scribe/</link>
	<description>Interview Questions</description>
	<lastBuildDate>Wed, 15 Sep 2021 05:39:35 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.9</generator>

<image>
	<url>https://www.wikitechy.com/interview-questions/wp-content/uploads/2025/10/cropped-wikitechy-icon-32x32.png</url>
	<title>kafka vs flume vs scribe - Wikitechy</title>
	<link>https://www.wikitechy.com/interview-questions/tag/kafka-vs-flume-vs-scribe/</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>What is the difference between flume and Kafka ?</title>
		<link>https://www.wikitechy.com/interview-questions/apache-flume/what-is-the-difference-between-flume-and-kafka/</link>
					<comments>https://www.wikitechy.com/interview-questions/apache-flume/what-is-the-difference-between-flume-and-kafka/#respond</comments>
		
		<dc:creator><![CDATA[Editor]]></dc:creator>
		<pubDate>Sun, 11 Jul 2021 15:37:08 +0000</pubDate>
				<category><![CDATA[Apache Flume]]></category>
		<category><![CDATA[apache flume disadvantages]]></category>
		<category><![CDATA[apache flume limitations]]></category>
		<category><![CDATA[apache flume vs sqoop]]></category>
		<category><![CDATA[apache flume vs storm]]></category>
		<category><![CDATA[disadvantages of flume]]></category>
		<category><![CDATA[flume interview question]]></category>
		<category><![CDATA[flume kafka integration]]></category>
		<category><![CDATA[flume vs kafka vs spark]]></category>
		<category><![CDATA[flume vs spark streaming]]></category>
		<category><![CDATA[flume vs streaming]]></category>
		<category><![CDATA[kafka vs flume vs scribe]]></category>
		<category><![CDATA[kafka vs sqoop]]></category>
		<category><![CDATA[Wells Fargo interview questions and answers]]></category>
		<category><![CDATA[which type of channel will provide high throughput]]></category>
		<guid isPermaLink="false">https://www.wikitechy.com/interview-questions/?p=135</guid>

					<description><![CDATA[Answer-Kafka can support data streams for multiple applications]]></description>
										<content:encoded><![CDATA[<div class="TextHeading">
<div class="hddn">
<h2 id="difference-between-flume-and-kafka" class="color-green" style="text-align: justify;">Difference between flume and Kafka</h2>
</div>
</div>
<table class="table-bordered table-striped table table-responsive">
<tbody>
<tr>
<th>Flume</th>
<th>Kafka</th>
</tr>
<tr>
<td class="text-leftalign">The Flume is a Distributed reliable system for collecting,<br />
aggregating and moving large amount of data to<br />
centralized datastore like HDFS or Hbase</td>
<td class="text-leftalign">General purpose publish &#8211; subscribe<br />
model messaging system</td>
</tr>
<tr>
<td class="text-leftalign">Adding more consumers means to change the<br />
design of flume pipeline and replicating the channel to<br />
deliver messages to new sink which needs<br />
downtime</td>
<td class="text-leftalign">Easy to add more consumers without downtime</td>
</tr>
<tr>
<td class="text-leftalign">Supports many built-in sources and sinks<br />
out of box</td>
<td class="text-leftalign">Sometimes need to write own producer and consumer code though Spark and Storm have now come up with built-in integrations to Kafka</td>
</tr>
<tr>
<td class="text-leftalign">Flume pushes data into sink and hence<br />
consumers do not have to maintain offset</td>
<td class="text-leftalign">Subscribers are responsible for pulling data<br />
and also maintaining pointer to offset</td>
</tr>
<tr>
<td class="text-leftalign">Events are lost in case the agent goes down</td>
<td class="text-leftalign">Provides fault tolerance</td>
</tr>
<tr>
<td class="text-leftalign">Does not support partitioning</td>
<td class="text-leftalign">Supports partitioning</td>
</tr>
<tr>
<td class="text-leftalign">Flume pushes data to the sink because of<br />
which writes to sink can overwhelm data reads from sink</td>
<td class="text-leftalign">Since kafka does not push data, writes from producer to broker and reads from broker to consumers can happen at their own pace</td>
</tr>
<tr>
<td class="text-leftalign">It is tightly integrated with Hadoop</td>
<td class="text-leftalign">General purpose</td>
</tr>
</tbody>
</table>
<div class="ImageContent">
<div class="hddn"><img fetchpriority="high" decoding="async" class="aligncenter size-medium" src="https://cdn.wikitechy.com/interview-questions/apache-flume/what-is-the-difference-between-flume-and-kafka.png" alt="what is the difference between flume and kafka" width="1097" height="378" /></div>
</div>
]]></content:encoded>
					
					<wfw:commentRss>https://www.wikitechy.com/interview-questions/apache-flume/what-is-the-difference-between-flume-and-kafka/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
	</channel>
</rss>
