Spring配置项<context:annotation-config>的解释说明

本文介绍了Spring框架中使用&lt;context:annotation-config&gt;配置项注册BeanPostProcessor的方法,包括AutowiredAnnotationBeanPostProcessor等4种处理器的作用及依赖注解,并讨论了简化配置方式。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

今天在闲逛优快云论坛时,看到一位博主写的一篇关于《Spring中IOC的Annotation的实现》的文章, 于是点击进去看了下, 发现在说明中对Spring配置文件中的有些配置节点模凌两可,表述的存在问题,于是自己在Demo中实现并实际操作了下,再次做个笔录。

我们一般在含有Spring的项目中,可能会看到配置项中包含这个配置节点<context:annotation-config>,这是一条向Spring容器中注册

AutowiredAnnotationBeanPostProcessor

CommonAnnotationBeanPostProcessor

PersistenceAnnotationBeanPostProcessor

RequiredAnnotationBeanPostProcessor

这4个BeanPostProcessor.注册这4个BeanPostProcessor的作用,就是为了你的系统能够识别相应的注解。

那么那些注释依赖这些Bean呢。

如果想使用@ Resource 、@ PostConstruct、@ PreDestroy等注解就必须声明CommonAnnotationBeanPostProcessor。
如果想使用@PersistenceContext注解,就必须声明PersistenceAnnotationBeanPostProcessor的Bean。
如果想使用@Autowired注解,那么就必须事先在 Spring 容器中声明 AutowiredAnnotationBeanPostProcessor Bean。
如果想使用 @Required的注解,就必须声明RequiredAnnotationBeanPostProcessor的Bean。

同样,传统的声明方式如下: 

<bean class="org.springframework.beans.factory.annotation.AutowiredAnnotationBeanPostProcessor "/> 

 但是,就一般而言,这些注解我们是经常使用,比如Antowired,Resuource等注解,如果总是按照传统的方式一条一条的配置,感觉比较繁琐和机械。于是Spring给我们提供了<context:annotation-config/>的简化的配置方式,自动帮助你完成声明,并且还自动搜索@Component , @Controller , @Service , @Repository等标注的类。

<context:component-scan base-package="com.**.impl"/>

因此当使用 <context:component-scan/> 后,就可以将 <context:annotation-config/> 移除了。

转载于:https://www.cnblogs.com/_popc/p/3972212.html

<?xml version="1.0" encoding="UTF-8"?> <web-app xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns="http://java.sun.com/xml/ns/javaee" xsi:schemaLocation="http://java.sun.com/xml/ns/javaee http://java.sun.com/xml/ns/javaee/web-app_3_0.xsd" id="WebApp_ID" version="3.0"> <display-name>ssm</display-name> <!-- 编码过滤器开始 --> <filter> <filter-name>charset</filter-name> <filter-class>org.springframework.web.filter.CharacterEncodingFilter</filter-class> <init-param> <param-name>encoding</param-name> <param-value>UTF-8</param-value> </init-param> </filter> <filter-mapping> <filter-name>charset</filter-name> <!-- <url-pattern>*.action</url-pattern> --> <servlet-name>springmvc</servlet-name> </filter-mapping> <!-- 编码过滤器结束 --> <!-- 配置spring容器的监听器 开始 目地:在启用tomcat的时候加载 applicationContext.xml ApplicationContext context=new ClassPathXmlApplicationContext("classpath:applicationContext.xml') --> <listener> <listener-class>org.springframework.web.context.ContextLoaderListener</listener-class> </listener> <!-- 配置上下文的参数 contextConfigLocation 在org.springframework.web.context.ContextLoader里面--> <context-param> <param-name>contextConfigLocation</param-name> <param-value>classpath:applicationContext.xml</param-value> </context-param> <!-- 配置spring容器的监听器 结束--> <!-- 配置前端控制器开始 --> <servlet> <servlet-name>springmvc</servlet-name> <servlet-class>org.springframework.web.servlet.DispatcherServlet</servlet-class> <!--配置contextConfigLocation --> <init-param> <param-name>contextConfigLocation</param-name> <param-value>classpath:springmvc.xml</param-value> </init-param> <!-- load-on-startup 1 代表当tomcat启动加web.xml里就创建 DispatcherServlet的对象 --> <load-on-startup>1</load-on-startup> </servlet> <servlet-mapping> <servlet-name>springmvc</servlet-name> <!-- *.action:所以的在xxxx.action结尾的请求都交给DispatcherServlet /* 所以的请求都交给DispacherServlet 包含静态文件的地址 js css png gif,使用此种方式可以实现 RESTful风格的url --> <url-pattern>*.html</url-pattern> </servlet-mapping> <!-- 配置前端控制器结束 --> <!--欢迎页面 当tomcat启动的时候 直接执行的页面--> <welcome-file-list> <welcome-file>index.html</welcome-file> </welcome-file-list> </web-app> classpath填什么?
最新发布
08-08
package com.ljy.test; import com.ljy.domain.Book; import com.ljy.service.BookService; import com.ljy.config.SpringConfig; // 正确导入配置类 import org.junit.Test; import org.junit.runner.RunWith; import org.springframework.beans.factory.annotation.Autowired; import org.springframework.test.context.ContextConfiguration; import org.springframework.test.context.junit4.SpringJUnit4ClassRunner; @RunWith(SpringJUnit4ClassRunner.class) @ContextConfiguration(classes = SpringConfig.class) public class BookServiceTest { @Autowired private BookService bookService; @Test public void testFindBookById() { Book book = bookService.findBookById(1); System.out.println("图书id: " + book.getId()); // 确保 Book 类有 getId() } }无法解析符号 'test'无法解析符号 'SpringJUnit4ClassRunner'无法解析符号 'ContextConfiguration'<project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 http://maven.apache.org/xsd/maven-4.0.0.xsd"> <modelVersion>4.0.0</modelVersion> <groupId>com.ljy</groupId> <artifactId>ljy-book</artifactId> <version>1.0-SNAPSHOT</version> <packaging>war</packaging> <properties> <spring.version>5.3.28</spring.version> <mybatis.version>3.5.7</mybatis.version> <mybatis-spring.version>2.0.6</mybatis-spring.version> <mysql-connector.version>8.0.33</mysql-connector.version> <druid.version>1.2.16</druid.version> <servlet.version>3.1.0</servlet.version> <!-- 切换回Java EE版本 --> <jsp.version>2.2.1-b03</jsp.version> <!-- 切换回Java EE版本 --> <jstl.version>1.2</jstl.version> <maven.compiler.source>1.8</maven.compiler.source> <maven.compiler.target>1.8</maven.compiler.target> <project.build.sourceEncoding>UTF-8</project.build.sourceEncoding> </properties> <dependencyManagement> <dependencies> <dependency> <groupId>org.springframework</groupId> <artifactId>spring-framework-bom</artifactId> <version>${spring.version}</version> <type>pom</type> <scope>import</scope> </dependency> </dependencies> </dependencyManagement> <repositories> <repository> <id>aliyun-maven</id> <url>https://maven.aliyun.com/repository/central</url> <releases> <enabled>true</enabled> </releases> <snapshots> <enabled>false</enabled> </snapshots> </repository> <repository> <id>central</id> <url>https://repo.maven.apache.org/maven2</url> <releases> <enabled>true</enabled> </releases> <snapshots> <enabled>false</enabled> </snapshots> </repository> </repositories> <dependencies> <!-- Spring Core --> <dependency> <groupId>org.springframework</groupId> <artifactId>spring-context</artifactId> </dependency> <dependency> <groupId>org.springframework</groupId> <artifactId>spring-webmvc</artifactId> </dependency> <dependency> <groupId>org.springframework</groupId> <artifactId>spring-jdbc</artifactId> </dependency> <dependency> <groupId>org.springframework</groupId> <artifactId>spring-tx</artifactId> </dependency> <!-- MyBatis --> <dependency> <groupId>org.mybatis</groupId> <artifactId>mybatis</artifactId> <version>${mybatis.version}</version> </dependency> <dependency> <groupId>org.mybatis</groupId> <artifactId>mybatis-spring</artifactId> <version>${mybatis-spring.version}</version> </dependency> <!-- Database --> <dependency> <groupId>com.mysql</groupId> <artifactId>mysql-connector-j</artifactId> <version>${mysql-connector.version}</version> <scope>runtime</scope> </dependency> <dependency> <groupId>com.alibaba</groupId> <artifactId>druid</artifactId> <version>${druid.version}</version> </dependency> <!-- Servlet/JSP --> <dependency> <groupId>javax.servlet</groupId> <artifactId>javax.servlet-api</artifactId> <version>${servlet.version}</version> <scope>provided</scope> </dependency> <dependency> <groupId>javax.servlet.jsp</groupId> <artifactId>jsp-api</artifactId> <version>${jsp.version}</version> <scope>provided</scope> </dependency> <dependency> <groupId>javax.servlet</groupId> <artifactId>jstl</artifactId> <version>${jstl.version}</version> </dependency> <dependency> <groupId>org.projectlombok</groupId> <artifactId>lombok</artifactId> <version>1.18.28</version> <scope>provided</scope> </dependency> <!-- JSON Processing --> <dependency> <groupId>com.fasterxml.jackson.core</groupId> <artifactId>jackson-databind</artifactId> <version>2.15.2</version> </dependency> <!-- Logging --> <dependency> <groupId>org.slf4j</groupId> <artifactId>slf4j-api</artifactId> <version>1.7.36</version> </dependency> <dependency> <groupId>ch.qos.logback</groupId> <artifactId>logback-classic</artifactId> <version>1.2.11</version> </dependency> <!-- Testing --> <dependency> <groupId>junit</groupId> <artifactId>junit</artifactId> <version>4.13.2</version> <scope>test</scope> </dependency> <dependency> <groupId>org.springframework</groupId> <artifactId>spring-test</artifactId> <version>${spring.version}</version> <scope>test</scope> </dependency> </dependencies> <build> <plugins> <plugin> <groupId>org.apache.maven.plugins</groupId> <artifactId>maven-compiler-plugin</artifactId> <version>3.8.1</version> <configuration> <source>${maven.compiler.source}</source> <target>${maven.compiler.target}</target> </configuration> </plugin> <plugin> <groupId>org.apache.maven.plugins</groupId> <artifactId>maven-resources-plugin</artifactId> <version>3.2.0</version> <configuration> <resources> <resource> <directory>src/main/resources</directory> <filtering>true</filtering> <includes> <include>**/*.properties</include> <include>**/*.xml</include> </includes> </resource> </resources> </configuration> </plugin> <plugin> <groupId>org.apache.tomcat.maven</groupId> <artifactId>tomcat7-maven-plugin</artifactId> <version>2.2</version> <configuration> <port>8080</port> <path>/</path> </configuration> </plugin> </plugins> </build> </project>
05-15
下面是我的pom.xml <?xml version="1.0" encoding="UTF-8"?> <project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 https://maven.apache.org/xsd/maven-4.0.0.xsd"> <modelVersion>4.0.0</modelVersion> <parent> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-parent</artifactId> <version>3.5.3</version> <relativePath/> <!-- lookup parent from repository --> </parent> <groupId>com.eduassistant</groupId> <artifactId>eduassistant</artifactId> <version>0.0.1-SNAPSHOT</version> <name>eduassistant</name> <description>eduassistant</description> <url/> <licenses> <license/> </licenses> <developers> <developer/> </developers> <scm> <connection/> <developerConnection/> <tag/> <url/> </scm> <properties> <java.version>17</java.version> </properties> <dependencies> <!-- Spring Boot 基础依赖 --> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-web</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-data-jpa</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-data-redis</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-security</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-validation</artifactId> </dependency> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-actuator</artifactId> </dependency> <!-- 数据库 --> <dependency> <groupId>com.mysql</groupId> <artifactId>mysql-connector-j</artifactId> <scope>runtime</scope> </dependency> <!-- 数据库迁移 --> <dependency> <groupId>org.flywaydb</groupId> <artifactId>flyway-core</artifactId> </dependency> <dependency> <groupId>org.flywaydb</groupId> <artifactId>flyway-mysql</artifactId> </dependency> <!-- 工具 --> <dependency> <groupId>org.projectlombok</groupId> <artifactId>lombok</artifactId> <optional>true</optional> </dependency> <!-- JSON 处理 --> <dependency> <groupId>com.fasterxml.jackson.core</groupId> <artifactId>jackson-databind</artifactId> </dependency> <dependency> <groupId>com.fasterxml.jackson.datatype</groupId> <artifactId>jackson-datatype-jsr310</artifactId> </dependency> <!-- HTTP 客户端 --> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-webflux</artifactId> </dependency> <dependency> <groupId>org.hibernate.validator</groupId> <artifactId>hibernate-validator</artifactId> <version>6.2.0.Final</version> </dependency> <!-- DashScope SDK 依赖--> <dependency> <groupId>com.alibaba</groupId> <artifactId>dashscope-sdk-java</artifactId> <version>1.0.0</version> <!-- 请确认使用最新版本 --> </dependency> <!-- JWT --> <dependency> <groupId>io.jsonwebtoken</groupId> <artifactId>jjwt-api</artifactId> <version>0.11.5</version> </dependency> <dependency> <groupId>io.jsonwebtoken</groupId> <artifactId>jjwt-impl</artifactId> <version>0.11.5</version> <scope>runtime</scope> </dependency> <dependency> <groupId>io.jsonwebtoken</groupId> <artifactId>jjwt-jackson</artifactId> <version>0.11.5</version> <scope>runtime</scope> </dependency> <!-- 配置管理 --> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-configuration-processor</artifactId> <optional>true</optional> </dependency> <!-- 缓存支持 --> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-cache</artifactId> </dependency> <!-- Document Processing --> <dependency> <groupId>org.apache.poi</groupId> <artifactId>poi</artifactId> <version>5.2.3</version> </dependency> <dependency> <groupId>org.apache.poi</groupId> <artifactId>poi-ooxml</artifactId> <version>5.2.3</version> </dependency> <dependency> <groupId>org.apache.pdfbox</groupId> <artifactId>pdfbox</artifactId> <version>2.0.27</version> </dependency> <!-- Milvus --> <dependency> <groupId>io.milvus</groupId> <artifactId>milvus-sdk-java</artifactId> <version>2.3.4</version> </dependency> <!-- Swagger --> <dependency> <groupId>org.springdoc</groupId> <artifactId>springdoc-openapi-starter-webmvc-ui</artifactId> <version>2.1.0</version> </dependency> <!-- 测试 --> <dependency> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-starter-test</artifactId> <scope>test</scope> </dependency> <dependency> <groupId>org.springframework.security</groupId> <artifactId>spring-security-test</artifactId> <scope>test</scope> </dependency> <dependency> <groupId>org.testcontainers</groupId> <artifactId>junit-jupiter</artifactId> <version>1.19.3</version> <scope>test</scope> </dependency> </dependencies> <build> <plugins> <plugin> <groupId>org.apache.maven.plugins</groupId> <artifactId>maven-compiler-plugin</artifactId> <configuration> <annotationProcessorPaths> <path> <groupId>org.projectlombok</groupId> <artifactId>lombok</artifactId> </path> </annotationProcessorPaths> </configuration> </plugin> <plugin> <groupId>org.springframework.boot</groupId> <artifactId>spring-boot-maven-plugin</artifactId> <configuration> <excludes> <exclude> <groupId>org.projectlombok</groupId> <artifactId>lombok</artifactId> </exclude> </excludes> </configuration> </plugin> </plugins> </build> </project>
07-11
<think>我们有一个Spring Boot项目,现在需要添加Apache Tika的依赖。根据之前的讨论,我们需要添加`tika-core`和`tika-parsers`两个依赖。 但是,请注意,在现有的pom.xml中,我们已经有了`org.apache.poi`和`org.apache.pdfbox`的依赖,而Tika的解析器会依赖这些库,并且可能带来版本冲突。 因此,我们需要考虑排除Tika中自带的POI和PDFBox,以避免版本冲突,因为我们已经明确指定了这两个库的版本。 具体步骤: 1. 添加`tika-core`和`tika-parsers`依赖。 2. 在`tika-parsers`依赖中排除`poi`和`pdfbox`,因为我们已经单独引入了这两个依赖,并且版本与Tika可能不兼容(Tika 2.9.1默认依赖的POI版本可能低于5.2.3,PDFBox可能低于2.0.27)。 另外,我们也可以考虑添加`tika-parsers-standard-package`,但这里我们使用`tika-parsers`即可,因为它包含了所有标准解析器。 注意:Tika 2.9.1版本依赖的POI版本是5.2.0,PDFBox是2.0.29,但我们项目中已经指定了POI 5.2.3和PDFBox 2.0.27。为了避免冲突,我们排除Tika中的这两个依赖。 因此,我们在`tika-parsers`依赖中添加排除项(exclusions): - 排除`org.apache.poi:poi` - 排除`org.apache.poi:poi-ooxml` - 排除`org.apache.pdfbox:pdfbox` 这样,Tika解析器会使用我们项目中已经声明的POI和PDFBox版本。 添加后的依赖如下:</think>## 在 Spring Boot 项目中添加 Apache Tika 依赖 根据您提供的 `pom.xml` 文件,以下是添加 Apache Tika 依赖的最佳实践方案。我会解决可能出现的依赖冲突问题,并提供完整的集成方案。 ### 最佳依赖配置方案 在 `<dependencies>` 部分添加以下依赖: ```xml <!-- Apache Tika 依赖 --> <dependency> <groupId>org.apache.tika</groupId> <artifactId>tika-core</artifactId> <version>2.9.1</version> </dependency> <dependency> <groupId>org.apache.tika</groupId> <artifactId>tika-parsers</artifactId> <version>2.9.1</version> <!-- 排除冲突依赖 --> <exclusions> <exclusion> <groupId>org.apache.poi</groupId> <artifactId>poi</artifactId> </exclusion> <exclusion> <groupId>org.apache.poi</groupId> <artifactId>poi-ooxml</artifactId> </exclusion> <exclusion> <groupId>org.apache.pdfbox</groupId> <artifactId>pdfbox</artifactId> </exclusion> </exclusions> </dependency> ``` ### 完整的 pom.xml 更新示例 ```xml <?xml version="1.0" encoding="UTF-8"?> <project xmlns="http://maven.apache.org/POM/4.0.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xsi:schemaLocation="http://maven.apache.org/POM/4.0.0 https://maven.apache.org/xsd/maven-4.0.0.xsd"> <!-- 其他配置保持不变 --> <dependencies> <!-- 现有依赖保持不变 --> <!-- 添加 Apache Tika 依赖 --> <dependency> <groupId>org.apache.tika</groupId> <artifactId>tika-core</artifactId> <version>2.9.1</version> </dependency> <dependency> <groupId>org.apache.tika</groupId> <artifactId>tika-parsers</artifactId> <version>2.9.1</version> <exclusions> <exclusion> <groupId>org.apache.poi</groupId> <artifactId>poi</artifactId> </exclusion> <exclusion> <groupId>org.apache.poi</groupId> <artifactId>poi-ooxml</artifactId> </exclusion> <exclusion> <groupId>org.apache.pdfbox</groupId> <artifactId>pdfbox</artifactId> </exclusion> </exclusions> </dependency> <!-- 其他依赖保持不变 --> </dependencies> <!-- build 部分保持不变 --> </project> ``` ### 依赖冲突解决方案 1. **POI 冲突解决**: - Tika 默认依赖 POI 5.2.0,但您已使用 5.2.3 - 通过 `<exclusions>` 移除 Tika 中的 POI 依赖 - 使用您指定的更高版本 POI 2. **PDFBox 冲突解决**: - Tika 依赖 PDFBox 2.0.29,但您使用 2.0.27 - 排除 Tika 中的 PDFBox 依赖 - 保持您指定的版本 3. **其他潜在冲突**: - Tika 可能引入其他库(如 commons-compress) - Spring Boot 的依赖管理会自动解决大部分冲突 - 如果出现问题,使用 `mvn dependency:tree` 分析依赖树 ### Spring Boot 集成示例 #### 1. 创建 Tika 服务类 ```java import org.apache.tika.Tika; import org.apache.tika.exception.TikaException; import org.springframework.stereotype.Service; import org.springframework.web.multipart.MultipartFile; import java.io.IOException; @Service public class TikaService { private final Tika tika = new Tika(); public String detectContentType(MultipartFile file) throws IOException { return tika.detect(file.getInputStream(), file.getOriginalFilename()); } public String extractText(MultipartFile file) throws IOException, TikaException { return tika.parseToString(file.getInputStream()); } public String extractTextWithMetadata(MultipartFile file) throws IOException, TikaException { // 高级用法:包含元数据提取 Metadata metadata = new Metadata(); metadata.set(Metadata.RESOURCE_NAME_KEY, file.getOriginalFilename()); return tika.parseToString(file.getInputStream(), metadata); } } ``` #### 2. 创建 REST 控制器 ```java import org.springframework.beans.factory.annotation.Autowired; import org.springframework.http.ResponseEntity; import org.springframework.web.bind.annotation.PostMapping; import org.springframework.web.bind.annotation.RequestParam; import org.springframework.web.bind.annotation.RestController; import org.springframework.web.multipart.MultipartFile; @RestController public class DocumentController { @Autowired private TikaService tikaService; @PostMapping("/extract-text") public ResponseEntity<String> extractText(@RequestParam("file") MultipartFile file) { try { String contentType = tikaService.detectContentType(file); System.out.println("Detected content type: " + contentType); String textContent = tikaService.extractText(file); return ResponseEntity.ok(textContent); } catch (Exception e) { return ResponseEntity.internalServerError().body("Error processing file: " + e.getMessage()); } } @PostMapping("/extract-with-metadata") public ResponseEntity<DocumentInfo> extractWithMetadata(@RequestParam("file") MultipartFile file) { try { String text = tikaService.extractTextWithMetadata(file); // 在实际应用中,您可能返回结构化数据 return ResponseEntity.ok(new DocumentInfo(text)); } catch (Exception e) { return ResponseEntity.internalServerError().body(null); } } // 返回数据结构 public static class DocumentInfo { private final String content; public DocumentInfo(String content) { this.content = content; } // getter } } ``` #### 3. 配置文件处理限制 在 `application.properties` 中添加: ```properties # 增加文件上传大小限制 spring.servlet.multipart.max-file-size=50MB spring.servlet.multipart.max-request-size=50MB # 配置 Tika 内存限制(可选) tika.max-string-length=10000000 # 10MB 文本限制 tika.max-embedded-resources=50 # 最大嵌入资源数 ``` ### 高级集成:自定义 Tika 配置 ```java import org.apache.tika.config.TikaConfig; import org.apache.tika.parser.Parser; import org.springframework.context.annotation.Bean; import org.springframework.context.annotation.Configuration; @Configuration public class TikaConfig { @Bean public Tika tika() { // 自定义 Tika 配置 TikaConfig config = TikaConfig.getDefaultConfig(); // 获取 PDF 解析器并配置 Parser pdfParser = config.getParser("application/pdf"); if (pdfParser instanceof org.apache.tika.parser.pdf.PDFParser) { ((org.apache.tika.parser.pdf.PDFParser) pdfParser).setExtractAnnotationText(true); } return new Tika(config); } } ``` ### 处理大文件的内存优化 ```java import org.apache.tika.parser.ParseContext; import org.apache.tika.parser.Parser; import org.apache.tika.sax.BodyContentHandler; import org.xml.sax.ContentHandler; import java.io.InputStream; public String extractLargeFile(MultipartFile file) throws Exception { Parser parser = new AutoDetectParser(); ContentHandler handler = new BodyContentHandler(-1); // -1 表示无限制 // 创建内存限制的上下文 ParseContext context = new ParseContext(); context.set(Parser.class, parser); try (InputStream stream = file.getInputStream()) { Metadata metadata = new Metadata(); metadata.set(Metadata.RESOURCE_NAME_KEY, file.getOriginalFilename()); parser.parse(stream, handler, metadata, context); return handler.toString(); } } ``` ### 测试依赖是否生效 创建测试类验证 Tika 功能: ```java import org.junit.jupiter.api.Test; import org.springframework.beans.factory.annotation.Autowired; import org.springframework.boot.test.context.SpringBootTest; import org.springframework.mock.web.MockMultipartFile; import static org.junit.jupiter.api.Assertions.*; @SpringBootTest class TikaIntegrationTest { @Autowired private TikaService tikaService; @Test void testPdfExtraction() throws Exception { // 创建模拟 PDF 文件 String content = "%PDF-1.4\n..."; // 简化 PDF 内容 MockMultipartFile file = new MockMultipartFile( "document", "test.pdf", "application/pdf", content.getBytes() ); String extracted = tikaService.extractText(file); assertTrue(extracted.contains("PDF")); } @Test void testDocxExtraction() throws Exception { // 创建模拟 DOCX 文件 byte[] docxContent = Files.readAllBytes(Paths.get("src/test/resources/sample.docx")); MockMultipartFile file = new MockMultipartFile( "document", "test.docx", "application/vnd.openxmlformats-officedocument.wordprocessingml.document", docxContent ); String extracted = tikaService.extractText(file); assertTrue(extracted.contains("Sample Document")); } } ``` ### 常见问题解决 1. **依赖冲突**: ```bash mvn dependency:tree -Dincludes=org.apache.poi,org.apache.pdfbox ``` 检查输出确保只有一个版本 2. **内存溢出**: ```java // 在 TikaService 中添加 @PostConstruct public void init() { tika.setMaxStringLength(10 * 1024 * 1024); // 10MB } ``` 3. **文件类型检测失败**: ```java // 明确设置文件名帮助检测 metadata.set(Metadata.RESOURCE_NAME_KEY, "document.pdf"); ```
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值