03-Flume的配置说明及案例演示

properties of avro-AppSrv-source

agent_foo. sources.avro-AppSrv-source.type = avro

agent_foo.sources.avro-AppSrv-source.bind = localhost

agent_foo.sources.avro-AppSrv-source.port = 10000

properties of mem-channel-1

agent_foo.channels.mem-channel-1.type = memory

agent_foo.channels.mem-channel-1.capacity = 1000

agent_foo .channels.mem-channel-1.transactionCapacity = 100

properties of hdfs-Cluster1-sink

agent_foo.sinks.hdfs-Cluster1-sink.type = hdfs

agent_foo.sinks. hdfs-Cluster1-sink.hdfs.path = hdfs://namenode /flume/webdata

#…

常用的source和sink种类
常用的flume sources

Avro source:

avro

Syslog TCP source:

syslogtcp

Syslog uDP Source:

syslogudp

HTTP Source:

http

Exec source:

exec

JMS source:

jms

Thrift source:

thrift

Spooling directory source:

spooldir

Kafka source:

org.apache.flume. source.kafka, KafkaSource

常见的flume channels

Memory Channel

memory

JDBC Channel

jdbc

Kafka Channel

org.apache.flume.channel.kafka.KafkaChannel

File Channel

file

常用的flume sinks

HDFS Sink

hdfs

HIVE Sink

hive

Logger Sink

logger

Avro Sink

avro

Kafka Sink

org.apache.flume.sink.kafka.KafkaSink

Hbase Sink

hbase

案例演示

======================================================================

案例演示:avro+memory+logger


Avro Source:监听一个指定的Avro端口,通过Awro端口可以获取到Avro client发送过来的文件,即只要应用程序通过Avro端口发送文件,source组件就可以获取到该文件中的内容,输出位置为Logger

1.1 编写采集方案

[root@tianqinglong01 flume]# mkdir flumeconf

[root@tianqinglong01 flume]# cd flumeconf

[root@tianqinglong01 flumeconf]# vi avro-logger.conf

#定义各个组件的名字

a1.sources=avro-sour1

a1.channels=mem-chan1

a1.sinks=logger-sink1

#定义sources组件的相关属性

a1.sources.avro-sour1.type=avro

a1.sources.avro-sour1.bind=tianqinglong01

a1.sources.avro-sour1.port=9999

#定义channels组件的相关属性

a1.channels.mem-chan1.type=memory

#定义sinks组件的相关属性

a1.sinks.logger-sink1.type=logger

a1.sinks.logger-sink1.maxBytesToLog=100

#组件之间进行绑定

a1.sources.avro-sour1.channels=mem-chan1

a1.sinks.logger-sink1.channel=mem-chan1

1.2 启动Agent

[root@tianqinglong01 flumeconf]# flume-ng agent -c …/cong -f ./avro-logger.conf -n a1 -Dflume.root.logger-INFO,console

再开一个客户端

[root@tianqinglong01 ~]# echo “hello flume” >> text

[root@tianqinglong01 ~]# flume-ng avro-client -c $FLUME_HOME/conf -H tianqinglong01 -p 9999 -F ./text

案例演示 实时采集(监听文件): exec+memory+hdfs


Exec Source:监听一个指定的命令,获取一条命令的结果作为它的数据源

#常用的是tail -F file指令,即只要应用程序向日志(文件)里面写数据,source组件就可以获取到日志(文件)中最新的内容

memory:传输数据的Channel为Memory

hdfs是输出目标为Hdfs

配置方案

[root@tianqinglong flumeconf]# vi exec-hdfs.conf

a1.sources=r1

a1.sources.r1.type=exec

a1.sources.r1.command=tail -F /root/flume-test-exec-hdfs

a1.sinks=k1

a1.sinks.k1.type=hdfs

a1.sinks.k1.hdfs.path=hdfs://qianfeng01:8020/flume/tailout/%Y-%m-%d

a1.sinks.k1.hdfs.filePrefix=events

a1.sinks.k1.hdfs.round=true

a1.sinks.k1.hdfs.roundValue=10

a1.sinks.k1.hdfs.roundUnit=second

a1.sinks.k1.hdfs.rollInterval=3

a1.sinks.k1.hdfs.rollSize=20

a1.sinks.k1.hdfs.rollCount=5

a1.sinks.k1.hdfs.batchSize=1

a1.sinks.k1.hdfs.useLocalTimeStamp=true

a1.sinks.k1.hdfs.fileType=DataStream

a1.channels=c1

a1.channels.c1.type=memory

a1.channels.c1.capacity=1000

a1.channels.c1.transactionCapacity=100

a1.sources.r1.channels=c1

a1.sinks.k1.channel=c1

启动Agent

[root@tianqinglong flumeconf]# flume-ng agent -c …/conf -f ./exec-hdfs.conf -n a1 -Dflume.root.logger-INFO,console

测试数据

[root@tianqinglong ~]# echo “hello world” >> flume-test-exec-hdfs

案例演示 实时采集(监听目录): spool+mem+logger


spool: Source来源于目录,有文件进入目录就摄取。mem:通过内存来传输数据

logger:是传送数据到日志

配置方案

[root@tianqinglong01 flumeconf]# vi spool-logger.conf

a1.sources = r1

a1.channels = c1

a1.sinks = s1

a1.sources.r1.type=spooldir

a1.sources.r1.spoolDir = /home/flume/spool

a1.sources.r1.fileSuffix =.COMPLETED

a1.sources.r1.deletePolicy=never

ai.sources.r1.fileHeader=false

a1.sources.r1.fileHeaderKey=file

a1.sources.r1.basenameHeader=false

a1.sources.r1.basenameHeaderKey=basename

a1.sources.r1.batchSize=100

a1.sources.r1.inputCharset=UTF-8

a1.sources.r1.bufferMaxLines=1000

a1.channels.c1.type=memory

a1.sinks.s1.type=logger

a1.sinks.s1.maxBytesToLog=16

a1.sources.r1.channels=c1

a1.sinks.s1.channel=c1

启动Agent

[root@tianqinglong flumeconf]# flume-ng agent -c …/conf -f ./spool-logger.conf -n a1 -Dflume.root.logger=INFO,console

测试

[root@tianqinglong ~]# for i in seq 1 10 ; do echo i > > / h o m e / f l u m e / s p o o l / i >> /home/flume/spool/ i>>/home/flume/spool/i;done

案例演示:http+mem+logger


http:表示数据来源是http网络协议,一般接收的请求为get或post请求.所有的http请求会通过插件格式的Handle转化为一个flume的Event数据.

mem:表示用内存传输通道

logger:表示输出格式为Logger格式

配置方案

[root@qianfengo1 flumeconf]# vi http-logger.conf

a1.sources = r1

a1.channels = c1

a1.sinks = s1

a1.sources.r1.type=http

a1.sources.r1.bind = tianqinglong01

a1.sources.r1.port = 6666

a1.sources.r1.handler = org.apache.flume. source.http.JSONHandler

a1.channels.c1.type=memory

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值