纯真ip地址库解析hive udf实现

纯真ip地址库解析hive udf实现

纯真IP地址库qqwry.dat解析代码https://github.com/difeng/qqwry

hive udf实现,基于上述代码实现。利用该udf函数,方便做数据分析。

pom.xml中添加hive,hadoop相关依赖

<dependency>
     <groupId>org.apache.hive</groupId>
     <artifactId>hive-exec</artifactId>
     <version>1.2.1</version>
</dependency>
<dependency>
    <groupId>org.apache.hadoop</groupId>
    <artifactId>hadoop-common</artifactId>
    <version>2.7.3</version>
</dependency>
<dependency>
    <groupId>org.apache.hadoop</groupId>
    <artifactId>hadoop-hdfs</artifactId>
    <version>2.7.3</version>
</dependency>

 

package common.udf.qqwry2;
import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.FileSystem;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.hive.ql.exec.UDF;
import java.io.*;
import java.net.URI;

public class IPLocation extends UDF {
    private static Configuration configuration;
    private static FileSystem fileSystem;
    private static InputStream in;
    private static byte[] data;
    private long firstIndexOffset;
    private long lastIndexOffset;
    private long totalIndexCount;
    private static final byte REDIRECT_MODE_1 = 0x01;
    private static final byte REDIRECT_MODE_2 = 0x02;
    static final long IP_RECORD_LENGTH = 7;
    private static Long lastModifyTime = 0L;
    public static boolean enableFileWatch = false;

    static {
        try {
            configuration = new Configuration();
            fileSystem = FileSystem.get(URI.create("hdfs:///data/qqwry.dat"), configuration);
            in = fileSystem.open(new Path("hdfs:///data/qqwry.dat"));
            ByteArrayOutputStream out = null;
            out = new ByteAr
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值