tika1.16支持的文件格式

Apache Tika 是一个强大的解析库,能够识别和解析多种文件格式。包括但不限于 AppleSingle、ASM 类文件、音频文件(如 WAV、AIFF)、CHM 文件、源代码文件、PKCS7 签名、TSD 时间戳数据、DBF 数据库、DIF 文件、DWG 图形、Epub 电子书、可执行文件、Feed(RSS/ATOM)、字体文件、GDAL 数据、Geo 信息文件、HTML、图像(如 JPG、PNG、TIF、WebP)、IPTC 元数据、MATLAB 数据、Microsoft Office 文件(如 DOCX、XLSX、PPTX)、MBOX 邮件存档、MP3 音频、MP4 视频、NetCDF 数据、ODF 开放文档、PDF、RAR 压缩包、RTF 文本、TXT 文本、视频文件、WordPerfect 文件、XML 文件等。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

Full list of Supported Formats

    org.apache.tika.parser.apple.AppleSingleFileParser
        application/applefile
    org.apache.tika.parser.asm.ClassParser
        application/java-vm
    org.apache.tika.parser.audio.AudioParser
        audio/vnd.wave
        audio/x-wav
        audio/basic
        audio/x-aiff
    org.apache.tika.parser.audio.MidiParser
        application/x-midi
        audio/midi
    org.apache.tika.parser.chm.ChmParser
        application/vnd.ms-htmlhelp
        application/x-chm
        application/chm
    org.apache.tika.parser.code.SourceCodeParser
        text/x-c++src
        text/x-groovy
        text/x-java-source
    org.apache.tika.parser.crypto.Pkcs7Parser
        application/pkcs7-signature
        application/pkcs7-mime
    org.apache.tika.parser.crypto.TSDParser
        application/timestamped-data
    org.apache.tika.parser.dbf.DBFParser
        application/x-dbf
    org.apache.tika.parser.dif.DIFParser
        application/dif+xml
    org.apache.tika.parser.dwg.DWGParser
        image/vnd.dwg
    org.apache.tika.parser.epub.EpubParser
        application/x-ibooks+zip
        application/epub+zip
    org.apache.tika.parser.executable.ExecutableParser
        application/x-msdownload
        application/x-sharedlib
        application/x-elf
        application/x-object
        application/x-executable
        application/x-coredump
    org.apache.tika.parser.feed.FeedParser
        application/atom+xml
        application/rss+xml
    org.apache.tika.parser.font.AdobeFontMetricParser
        application/x-font-adobe-metric
    org.apache.tika.parser.font.TrueTypeParser
        application/x-font-ttf
    org.apache.tika.parser.gdal.GDALParser
        application/x-gsc
        image/x-ozi
        application/x-pds
        image/eir
        application/x-usgs-dem
        application/aaigrid
        application/x-bag
        application/elas
        application/x-rs2
        application/x-tsx
        application/x-lcp
        image/geotiff
        application/x-mbtiles
        application/x-cappi
        application/x-netcdf
        application/x-gsag
        application/x-epsilon
        application/x-ace2
        application/jaxa-pal-sar
        image/x-pcraster
        application/x-msgn
        image/arg
        application/x-hdf
        image/x-mff
        application/x-kro
        image/x-hdf5-image
        image/x-dimap
        image/x-srp
        image/big-gif
        application/x-envi
        application/x-cosar
        application/x-ntv2
        image/bmp
        application/x-doq2
        application/x-bt
        application/x-kml
        application/x-gmt
        application/x-rst
        application/vrt
        application/pcisdk
        application/x-ctg
        application/x-e00-grid
        application/x-rik
        image/ida
        image/x-mff2
        application/sdts-raster
        application/x-snodas
        image/jp2
        image/sar-ceos
        application/terragen
        application/x-wcs
        application/leveller
        application/x-ingr
        application/x-gtx
        image/sgi
        application/x-pnm
        image/raster
        application/fits
        application/x-r
        image/gif
        application/x-envi-hdr
        application/x-http
        application/x-rmf
        application/x-ecrg-toc
        application/aig
        application/x-rpf-toc
        image/adrg
        application/x-srtmhgt
        application/x-generic-bin
        application/jdem
        image/x-airsar
        application/x-webp
        application/x-ngs-geoid
        application/x-pcidsk
        image/x-fujibas
        application/x-wms
        application/x-map
        image/ceos
        application/xpm
        application/x-zmap
        image/envisat
        application/x-ers
        application/x-doq1
        application/x-isis2
        application/x-nwt-grd
        application/x-ppi
        image/ilwis
        application/x-isis3
        application/x-nwt-grc
        application/x-blx
        application/gff
        application/x-ndf
        image/jpeg
        application/x-geo-pdf
        application/x-l1b
        image/fit
        application/x-gsbg
        application/x-sdat
        application/x-ctable2
        application/x-grib
        application/x-coasp
        application/x-dipex
        application/grass-ascii-grid
        image/fits
        application/x-til
        application/x-dods
        image/png
        application/x-gxf
        application/x-gs7bg
        application/x-cpg
        application/x-lan
        application/x-xyz
        image/bsb
        application/x-p-aux
        application/dted
        application/x-rasterlite
        image/nitf
        image/hfa
        application/x-fast
        application/x-los-las
    org.apache.tika.parser.geo.topic.GeoParser
        application/geotopic
    org.apache.tika.parser.geoinfo.GeographicInformationParser
        text/iso19139+xml
    org.apache.tika.parser.grib.GribParser
        application/x-grib2
    org.apache.tika.parser.hdf.HDFParser
        application/x-hdf
    org.apache.tika.parser.html.HtmlParser
        text/html
        application/vnd.wap.xhtml+xml
        application/x-asp
        application/xhtml+xml
    org.apache.tika.parser.image.BPGParser
        image/bpg
        image/x-bpg
    org.apache.tika.parser.image.ICNSParser
        image/icns
    org.apache.tika.parser.image.ImageParser
        image/png
        image/vnd.wap.wbmp
        image/bmp
        image/x-xcf
        image/gif
        image/x-icon
        image/x-ms-bmp
    org.apache.tika.parser.image.PSDParser
        image/vnd.adobe.photoshop
    org.apache.tika.parser.image.TiffParser
        image/tiff
    org.apache.tika.parser.image.WebPParser
        image/webp
    org.apache.tika.parser.iptc.IptcAnpaParser
        text/vnd.iptc.anpa
    org.apache.tika.parser.isatab.ISArchiveParser
        application/x-isatab
    org.apache.tika.parser.iwork.IWorkPackageParser
        application/vnd.apple.keynote
        application/vnd.apple.iwork
        application/vnd.apple.numbers
        application/vnd.apple.pages
    org.apache.tika.parser.jpeg.JpegParser
        image/jpeg
    org.apache.tika.parser.mail.RFC822Parser
        message/rfc822
    org.apache.tika.parser.mat.MatParser
        application/x-matlab-data
    org.apache.tika.parser.mbox.MboxParser
        application/mbox
    org.apache.tika.parser.mbox.OutlookPSTParser
        application/vnd.ms-outlook-pst
    org.apache.tika.parser.microsoft.EMFParser
        image/emf
    org.apache.tika.parser.microsoft.JackcessParser
        application/x-msaccess
    org.apache.tika.parser.microsoft.MSOwnerFileParser
        application/x-ms-owner
    org.apache.tika.parser.microsoft.OfficeParser
        application/x-tika-msoffice-embedded; format=ole10_native
        application/msword
        application/vnd.visio
        application/vnd.ms-project
        application/x-tika-msworks-spreadsheet
        application/x-mspublisher
        application/vnd.ms-powerpoint
        application/x-tika-msoffice
        application/sldworks
        application/x-tika-ooxml-protected
        application/vnd.ms-excel
        application/vnd.ms-outlook
    org.apache.tika.parser.microsoft.OldExcelParser
        application/vnd.ms-excel.workspace.3
        application/vnd.ms-excel.workspace.4
        application/vnd.ms-excel.sheet.2
        application/vnd.ms-excel.sheet.3
        application/vnd.ms-excel.sheet.4
    org.apache.tika.parser.microsoft.TNEFParser
        application/vnd.ms-tnef
        application/x-tnef
        application/ms-tnef
    org.apache.tika.parser.microsoft.WMFParser
        image/wmf
    org.apache.tika.parser.microsoft.ooxml.OOXMLParser
        application/vnd.ms-powerpoint.template.macroenabled.12
        application/vnd.ms-excel.addin.macroenabled.12
        application/vnd.openxmlformats-officedocument.wordprocessingml.template
        application/vnd.ms-excel.sheet.binary.macroenabled.12
        application/vnd.openxmlformats-officedocument.wordprocessingml.document
        application/vnd.ms-powerpoint.slide.macroenabled.12
        application/vnd.ms-visio.drawing
        application/vnd.ms-powerpoint.slideshow.macroenabled.12
        application/vnd.ms-powerpoint.presentation.macroenabled.12
        application/vnd.openxmlformats-officedocument.presentationml.slide
        application/vnd.ms-excel.sheet.macroenabled.12
        application/vnd.ms-word.template.macroenabled.12
        application/vnd.ms-word.document.macroenabled.12
        application/vnd.ms-powerpoint.addin.macroenabled.12
        application/vnd.openxmlformats-officedocument.spreadsheetml.template
        application/vnd.ms-xpsdocument
        application/vnd.ms-visio.drawing.macroenabled.12
        application/vnd.ms-visio.template.macroenabled.12
        model/vnd.dwfx+xps
        application/vnd.openxmlformats-officedocument.presentationml.template
        application/vnd.openxmlformats-officedocument.presentationml.presentation
        application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
        application/vnd.ms-visio.stencil
        application/vnd.ms-visio.template
        application/vnd.openxmlformats-officedocument.presentationml.slideshow
        application/vnd.ms-visio.stencil.macroenabled.12
        application/vnd.ms-excel.template.macroenabled.12
    org.apache.tika.parser.microsoft.ooxml.xwpf.ml2006.Word2006MLParser
        application/vnd.ms-word2006ml
    org.apache.tika.parser.microsoft.xml.SpreadsheetMLParser
        application/vnd.ms-spreadsheetml
    org.apache.tika.parser.microsoft.xml.WordMLParser
        application/vnd.ms-wordml
    org.apache.tika.parser.mp3.Mp3Parser
        audio/mpeg
    org.apache.tika.parser.mp4.MP4Parser
        video/x-m4v
        application/mp4
        video/3gpp
        video/3gpp2
        video/quicktime
        audio/mp4
        video/mp4
    org.apache.tika.parser.netcdf.NetCDFParser
        application/x-netcdf
    org.apache.tika.parser.odf.OpenDocumentParser
        application/x-vnd.oasis.opendocument.presentation
        application/vnd.oasis.opendocument.chart
        application/x-vnd.oasis.opendocument.text-web
        application/x-vnd.oasis.opendocument.image
        application/vnd.oasis.opendocument.graphics-template
        application/vnd.oasis.opendocument.text-web
        application/x-vnd.oasis.opendocument.spreadsheet-template
        application/vnd.oasis.opendocument.spreadsheet-template
        application/vnd.sun.xml.writer
        application/x-vnd.oasis.opendocument.graphics-template
        application/vnd.oasis.opendocument.graphics
        application/vnd.oasis.opendocument.spreadsheet
        application/x-vnd.oasis.opendocument.chart
        application/x-vnd.oasis.opendocument.spreadsheet
        application/vnd.oasis.opendocument.image
        application/x-vnd.oasis.opendocument.text
        application/x-vnd.oasis.opendocument.text-template
        application/vnd.oasis.opendocument.formula-template
        application/x-vnd.oasis.opendocument.formula
        application/vnd.oasis.opendocument.image-template
        application/x-vnd.oasis.opendocument.image-template
        application/x-vnd.oasis.opendocument.presentation-template
        application/vnd.oasis.opendocument.presentation-template
        application/vnd.oasis.opendocument.text
        application/vnd.oasis.opendocument.text-template
        application/vnd.oasis.opendocument.chart-template
        application/x-vnd.oasis.opendocument.chart-template
        application/x-vnd.oasis.opendocument.formula-template
        application/x-vnd.oasis.opendocument.text-master
        application/vnd.oasis.opendocument.presentation
        application/x-vnd.oasis.opendocument.graphics
        application/vnd.oasis.opendocument.formula
        application/vnd.oasis.opendocument.text-master
    org.apache.tika.parser.pdf.PDFParser
        application/pdf
    org.apache.tika.parser.pkg.CompressorParser
        application/zlib
        application/x-gzip
        application/x-lz4
        application/x-bzip2
        application/x-snappy
        application/x-compress
        application/x-java-pack200
        application/x-lzma
        application/gzip
        application/x-bzip
        application/x-xz
    org.apache.tika.parser.pkg.PackageParser
        application/x-tar
        application/java-archive
        application/x-arj
        application/x-archive
        application/zip
        application/x-cpio
        application/x-tika-unix-dump
        application/x-7z-compressed
    org.apache.tika.parser.pkg.RarParser
        application/x-rar-compressed
    org.apache.tika.parser.rtf.RTFParser
        application/rtf
    org.apache.tika.parser.txt.TXTParser
        text/plain
    org.apache.tika.parser.video.FLVParser
        video/x-flv
    org.apache.tika.parser.wordperfect.QuattroProParser
        application/x-quattro-pro; version=9
    org.apache.tika.parser.wordperfect.WordPerfectParser
        application/vnd.wordperfect; version=5.1
        application/vnd.wordperfect; version=5.0
        application/vnd.wordperfect; version=6.x
    org.apache.tika.parser.xml.DcXMLParser
        application/xml
        image/svg+xml
    org.apache.tika.parser.xml.FictionBookParser
        application/x-fictionbook+xml
    org.gagravarr.tika.FlacParser
        audio/x-oggflac
        audio/x-flac
    org.gagravarr.tika.OggParser
        audio/ogg
        application/kate
        application/ogg
        video/daala
        video/x-ogguvs
        video/x-ogm
        audio/x-oggpcm
        video/ogg
        video/x-dirac
        video/x-oggrgb
        video/x-oggyuv
    org.gagravarr.tika.OpusParser
        audio/opus
        audio/ogg; codecs=opus
    org.gagravarr.tika.SpeexParser
        audio/ogg; codecs=speex
        audio/speex
    org.gagravarr.tika.TheoraParser
        video/theora
    org.gagravarr.tika.VorbisParser
        audio/vorbis
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值