首先启动zookeeper
zkServer.sh start
在启动kafka
bin/kafka-server-start.sh config/server.properties
创建主题
bin/kafka-topics.sh -create --bootstrap-server 192.168.35.125:9092,192.168.35.126:9092,192.168.35.127:9092 -replication-factor 3 --partitions 3 --topic testkafka
在flume的conf目录下创建:
flume-kafka.conf
内容如下
# define
a1.sources = r1
a1.sinks = k1
a1.channels = c1
# source
a1.sources.r1.type = exec
#拉去的文件路径
a1.sources.r1.command = tail -F -c +0 /opt/logs/test.log
a1.sources.r1.shell = /bin/bash -c
# channel
a1.channels.c1.type = memory
a1.channels.c1.capacity = 1000
a1.channels.c1.transactionCapacity = 100
# sink
#a1.sinks.k1.type = org.apache.flume.sink.kafka.KafkaSink
#a1.sinks.k1.kafka.topic = calllog
#a1.sinks.k1.kafka.bootstrap.servers = bin03:9092,bin04:9092,bin05:9092
#a1.sinks.k1.kafka.flumeBatchSize = 20
#a1.sinks.k1.kafka.producer.acks = 1
#a1.sinks.k1.kafka.producer.linger.ms = 1
#a1.sink
Flume-Kafka-Flink数据整合实践

该博客介绍了如何使用Flume从Kafka消费数据,通过配置flume-kafka.conf文件,启动Flume代理,并验证数据拉取。接着展示了在IDEA中实现Kafka接收内容的WordCount分析,通过pom.xml管理依赖。
最低0.47元/天 解锁文章
838

被折叠的 条评论
为什么被折叠?



