栏目分类:
子分类:
返回
名师互学网用户登录
快速导航关闭
当前搜索
当前分类
子分类
实用工具
热门搜索
名师互学网 > IT > 前沿技术 > 大数据 > 大数据系统

spark使用外部配置文件yarn cluster模式提交sparkstructedstreaming任务,解决流式日志过大问题

spark使用外部配置文件yarn cluster模式提交sparkstructedstreaming任务,解决流式日志过大问题

spark 提交脚本:

nohup /opt/soft/spark3/bin/spark-submit 
--master yarn 
--deploy-mode cluster 
--driver-memory 1g 
--num-executors 3 
--total-executor-cores 2 
--executor-memory 2g 
--queue spark 

--conf spark.eventLog.enabled=false 
--conf spark.driver.extraJavaOptions=-Dlog4j.configuration=file:driver-log4j.properties 
--conf spark.executor.extraJavaOptions=-Dlog4j.configuration=file:executor-log4j.properties 

--files ./driver-log4j.properties,./executor-log4j.properties 
--class streaming.SSSHudiETL 
--jars /opt/soft/hudi/hudi-0.9.0/packaging/hudi-spark-bundle/target/hudi-spark3-bundle_2.12-0.9.0.jar  --packages org.apache.spark:spark-avro_2.12:3.0.2 streaming-1.0-SNAPSHOT-jar-with-dependencies.jar &


driver-log4j.properties :

log4j.rootLogger =warn,stdout

log4j.appender.stdout = org.apache.log4j.ConsoleAppender
log4j.appender.stdout.Target = System.out
log4j.appender.stdout.layout = org.apache.log4j.PatternLayout
log4j.appender.stdout.layout.ConversionPattern = %-d{yyyy-MM-dd HH:mm} %5p %t %c{2}:%L - %m%n


executor-log4j.properties :

log4j.rootLogger =warn,stdout,rolling

log4j.appender.stdout = org.apache.log4j.ConsoleAppender
log4j.appender.stdout.Target = System.out
log4j.appender.stdout.layout = org.apache.log4j.PatternLayout
log4j.appender.stdout.layout.ConversionPattern = %-d{yyyy-MM-dd HH:mm} %5p %t %c{2}:%L - %m%n

log4j.appender.rolling=org.apache.log4j.RollingFileAppender
log4j.appender.rolling.layout=org.apache.log4j.PatternLayout
log4j.appender.rolling.layout.conversionPattern=%-d{yyyy-MM-dd HH:mm:ss} %5p %t %c{2}:%L - %m%n
log4j.appender.rolling.maxFileSize=100MB
log4j.appender.rolling.maxBackupIndex=5
log4j.appender.rolling.file=${spark.yarn.app.container.log.dir}/stdout
log4j.appender.rolling.encoding=UTF-8


目录结构:

查看container 下日志

-rw-r--r-- 1 yarn hadoop 34840 Feb  9 13:00 directory.info
-rw-r----- 1 yarn hadoop  5709 Feb  9 13:00 launch_container.sh
-rw-r--r-- 1 yarn hadoop     0 Feb  9 13:00 prelaunch.err
-rw-r--r-- 1 yarn hadoop   100 Feb  9 13:00 prelaunch.out
-rw-r--r-- 1 yarn hadoop  4073 Feb  9 13:00 stderr
-rw-r--r-- 1 yarn hadoop     0 Feb  9 13:00 stdout
转载请注明:文章转载自 www.mshxw.com
本文地址:https://www.mshxw.com/it/742788.html
我们一直用心在做
关于我们 文章归档 网站地图 联系我们

版权所有 (c)2021-2022 MSHXW.COM

ICP备案号:晋ICP备2021003244-6号