The beauty of Flume is Consolidation,
it collect data from different sources even it’s different flume Agents. Flume
source can collect all data flow from different sources and flows through
channel and sink. Finally send this data to HDFS or target destination.
Showing posts with label Flume. Show all posts
Showing posts with label Flume. Show all posts
What is Flume event?
A unit of data with set of string attribute called Flume event. The external source like web-server send events to the source. Internally Flume has inbuilt functionality to understand the source format.
Each log file is consider as an event. Each event has header and value sectors, which has header information and appropriate value that assign to particular header.
What is flume agent?
A flume agent is JVM holds the flume core components (source, channel, sink) through which events flow from an external source like web-servers to destination like HDFS. Agent is heart of the Apache Flume.
Subscribe to:
Posts (Atom)
-
Main driver class which provides job configuration parameters. Mapper class which must extend org.apache.hadoop.mapredu...
-
This will be used to extract various date formats. The available date formats as follows. Syntax: to_char ( date , fo...