往ElasticSearch中导入.csv数据
很多网上提供的数据集往往是csv文件格式的数据,为了能够直接向ElasticSearch中添加数据集,除了支持json文件意外以外,ES也可以支持csv文件格式。
注意: logstash的版本应当与ElasticSearch一致。
在logstash安装目录下创建xxx.conf文件(名字随意)
input
{
file{
path => ["/文件的绝对路径/Reviews.csv"]
start_position => "beginning"
}
}
filter{
csv{
separator => ","
columns => ["Id","ProductId","UserId","ProfileName","HelpfulnessNumerator","HelpfulnessDenominator","Score","Time","Summary","Text"]
}
mutate{
convert => {
"Id" => "integer"
"ProductId" => "string"
"ProfileName" => "string"
"HelpfulnessNumerator" => "integer"
"HelpfulnessDenominator" => "integer"
"score" => "integer"
"Time" => "integer"
"Summary" => "string"
"Text" => "string"
}
}
}
output{
elasticsearch