当前位置:   article > 正文

sparkSQL实现对hive动态分区_spark hive 动态分区

spark hive 动态分区

1.开始hive动态分区参数
val spark = SparkSession
.builder()
.appName(“test”)
.enableHiveSupport()
.config(“spark.sql.adaptive.enabled”,true)
.config(“spark.sql.adaptive.shuffle.targetPostShuffleInputSize”,134217728)
.config(“hive.exec.dynamic.partition”, true) // 支持 Hive 动态分区
.config(“hive.exec.dynamic.partition.mode”, “nonstrict”) // 非严格模式
.getOrCreate()
2. 动态分区
spark.sql(
“insert overwrite table test1 partition(day,hour) " +
“select” +
" a.platform,” +
" a.aid," +
" a.request_num," +
" b.play_num," +
" b.accomplish_num," +
" b.click_num," +
" a.day," +
" a.hour" +
" from test"
)
注意:动态分区的字段day\hour必须放最后,测试过放前面还是以最后两个字段作为分区,也说明动态分区字段以字段位置来进行分区的

声明:本文内容由网友自发贡献,不代表【wpsshop博客】立场,版权归原作者所有,本站不承担相应法律责任。如您发现有侵权的内容,请联系我们。转载请注明出处:https://www.wpsshop.cn/w/小蓝xlanll/article/detail/474625
推荐阅读
相关标签
  

闽ICP备14008679号