栏目分类:
子分类:
返回
名师互学网用户登录
快速导航关闭
当前搜索
当前分类
子分类
实用工具
热门搜索
名师互学网 > IT > 前沿技术 > 大数据 > 大数据系统

Flume多数据源汇总案例

Flume多数据源汇总案例

1.需求分析

2.实现步骤

1.准备工作

        分发Flume

        [zz@hadoop102 module]$ xsync flume

        在hadoop102、hadoop103以及hadoop104的/opt/module/flume/job目录下创建一个group3文件夹。

        [zz@hadoop102 job]$ mkdir group3

        [zz@hadoop103 job]$ mkdir group3

        [zz@hadoop104 job]$ mkdir group3

2.创建flume1-logger-flume.conf

        配置Source用于监控hive.log文件,配置Sink输出数据到下一级Flume。

        在hadoop103上创建配置文件并打开

        [zz@hadoop103 group3]$ touch flume1-logger-flume.conf

              [zz@hadoop103 group3]$ vim flume1-logger-flume.conf

        添加如下内容

        # Name the components on this agent

        a1.sources = r1

        a1.sinks = k1

        a1.channels = c1

        # Describe/configure the source

        a1.sources.r1.type = exec

        a1.sources.r1.command = tail -F /opt/module/group.log

        a1.sources.r1.shell = /bin/bash -c

        # Describe the sink

        a1.sinks.k1.type = avro

        a1.sinks.k1.hostname = hadoop104

        a1.sinks.k1.port = 4141

        # Describe the channel

        a1.channels.c1.type = memory

        a1.channels.c1.capacity = 1000

        a1.channels.c1.transactionCapacity = 100

        # Bind the source and sink to the channel

        a1.sources.r1.channels = c1

        a1.sinks.k1.channel = c1

3.创建flume2-netcat-flume.conf

        配置Source监控端口44444数据流,配置Sink数据到下一级Flume:

        在hadoop102上创建配置文件并打开

        [zz@hadoop102 group3]$ touch flume2-netcat-flume.conf

        [zz@hadoop102 group3]$ vim flume2-netcat-flume.conf

        添加如下内容

        # Name the components on this agent

        a2.sources = r1

        a2.sinks = k1

        a2.channels = c1

        # Describe/configure the source

        a2.sources.r1.type = netcat

        a2.sources.r1.bind = hadoop102

        a2.sources.r1.port = 44444

        # Describe the sink

        a2.sinks.k1.type = avro

        a2.sinks.k1.hostname = hadoop104

        a2.sinks.k1.port = 4141

        # Use a channel which buffers events in memory

        a2.channels.c1.type = memory

        a2.channels.c1.capacity = 1000

        a2.channels.c1.transactionCapacity = 1000

        # Bind the source and sink to the channel

        a2.sources.r1.channels = c1

        a2.sinks.k1.channel = c1

4.创建flume3-flume-logger.conf

        配置source用于接收flume1与flume2发送过来的数据流,最终合并后sink到控制台。

        在hadoop104上创建配置文件并打开

        [zz@hadoop104 group3]$ touch flume3-flume-logger.conf

        [zz@hadoop104 group3]$ vim flume3-flume-logger.conf

        添加如下内容

        # Name the components on this agent

        a3.sources = r1

        a3.sinks = k1

        a3.channels = c1

        # Describe/configure the source

        a3.sources.r1.type = avro

        a3.sources.r1.bind = hadoop104

        a3.sources.r1.port = 4141

        # Describe the sink

        # Describe the sink

        a3.sinks.k1.type = logger

        # Describe the channel

        a3.channels.c1.type = memory

        a3.channels.c1.capacity = 1000

        a3.channels.c1.transactionCapacity = 100

        # Bind the source and sink to the channel

        a3.sources.r1.channels = c1

        a3.sinks.k1.channel = c1

3.执行配置文件

        分别开启对应配置文件:flume3-flume-logger.conf,flume2-netcat-flume.conf,flume1-logger-flume.conf。

        [zz@hadoop104 flume]$ bin/flume-ng agent --conf conf/ --name a3 --conf-file         job/group3/flume3-flume-logger.conf -Dflume.root.logger=INFO,console

        [zz@hadoop102 flume]$ bin/flume-ng agent --conf conf/ --name a2 --conf-file         job/group3/flume2-netcat-flume.conf

        [zz@hadoop103 flume]$ bin/flume-ng agent --conf conf/ --name a1 --conf-file         job/group3/flume1-logger-flume.conf

4.在hadoop103上向/opt/module目录下的group.log追加内容

        [zz@hadoop103 module]$ echo 'hello' > group.log

5.在hadoop102上向44444端口发送数据

        [zz@hadoop102 flume]$ telnet hadoop102 44444

6.检查hadoop104上数据

 

 

 

转载请注明:文章转载自 www.mshxw.com
本文地址:https://www.mshxw.com/it/350540.html
我们一直用心在做
关于我们 文章归档 网站地图 联系我们

版权所有 (c)2021-2022 MSHXW.COM

ICP备案号:晋ICP备2021003244-6号