结构化文本查询

原创于 2022-07-11 11:19:44 发布 · 163 阅读

0 ·

CC 4.0 BY-SA版权

文章标签：

#文本 #结构化

JAVA计算专栏收录该内容

363 篇文章

订阅专栏

博客探讨了在处理tab-delimited文件时，如何检查第三列是否包含特定单词并根据第四列内容输出相应结果的问题。指出原始Python代码的不足，并推荐使用SPL语言进行更简洁的结构化查询，包括动态条件查询和在不同应用程序中调用SPL脚本的方法。

【问题】

I'm trying to check whether column 3 of a tab-delimited file contains a certain word. If it does not, it should continue reading. If it does contain the word, it should check column 4. Depending on whether there is content in column 4, the output should be something found or something not found.

I'm not stuck on the second part of this, i.e. checking column 4. My output gives me"something found" when there is in fact no content there.

for line in f:

if line.strip()split("\t")[2] == "word":

print ("word")

if line.strip().split("\t")[3] is not None:

print ("something found")

else:

print("nothing found")

The file looks like this:

reference #1 reference #2 notword content ...(more columns)
reference #1 reference #2 word content ...
reference #1 reference #2 word noContent ...