Refactoring with Solr

本文介绍了Apache Solr的安装配置流程及基本操作方法,包括软件下载、部署步骤、schema.xml与solrconfig.xml文件配置详解,并展示了如何利用SolrJ进行文档添加与搜索。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

1.About Solr
Solris the popular, blazing fast, open source NoSQL search platform from the ApacheLucene project. Its major features include powerful full-text search, hithighlighting, faceted search, dynamic clustering, database integration, richdocument (e.g., Word, PDF) handling, and geospatial search. Solr is highlyscalable, providing fault tolerant distributed search and indexing, and powersthe search and navigation features of many of the world's largest internetsites.
SolrFeatures: Solr is a standalone enterprise search server with a REST-like API.You put documents in it (called "indexing") via JSON, XML, CSV orbinary over HTTP. You query it via HTTP GET and receive JSON, XML, CSV orbinary results.
2.Solr Setup

SoftwareDownload
Java:You will need the Java Runtime Environment (JRE) version 1.7 or higher.
Tomcat:Through the server deployment project (May also be other server).
Solr
SetupSteps
Step1
ExtractSolr.zip
Step2
Copysolr \server\webapps\solr.war to tomcat \webapps
Step3
Runtomcat startup.bat (tomcat will automatically unpack solr.war)
Step4
Deletetomcat \webapps\solr.war (if not,tomcat will publish solr every time whenserver start up)
Step5

        
<env-entry>
   
      <env-entry-name>solr/home</env-entry-name>
   
      <env-entry-value>${solrHome }</env-entry-value>
   
      <env-entry-type>java.lang.String</env-entry-type>
   
</env-entry>
    
   
    Changetomcat \webapps\solr\WEB-INF\web.xml.
Addabove code in <web-app /> node.
Step6
Copyall files under solr \example\example-DIH\solr to local path named ${solrHome }
Step7
Copysolr \dist\*.jar to tomcat webapps\solr\WEB-INF\lib
Step8
Starttomcat, to access  http://localhost:8080/solr/
Ifsuccessfully, You will see below page.
3.Schema.xml
Schema.xml is usually thefirst file you configure when setting up a new Solr installation.
Theschema declares:
l Whatkinds of fields there are
l Whichfield should be used as the unique/primary key
l Whichfields are required
l Howto index and search each field

        
<types>
   
     <fieldType name="int"    class="solr.TrieIntField" precisionStep="0"    omitNorms="true" positionIncrementGap="0"/>
   
...
   
</types>
    
   
    TheXML consists of a number of parts.
Field Types

        
<fields>
   
     <field name="id" type="string"    indexed="true" stored="true" required="true"    />
   
     <field name="name" type="textgen"    indexed="true" stored="true"/>
   
...
   
</fields>
    
   
    Theexample Solr schema.xml comes with a number of pre-defined field types, andthey're quite well-documented. You can also use them as templates for creatingnew field types.
Fields
Thedocumentation provides a list of valid attributes:
name: mandatory - the name forthe field
type: mandatory - the name of apreviously defined type from the <types> section
indexed: true if this field shouldbe indexed (searchable or sortable)
stored: true if this field shouldbe retrievable
compressed: [false] if this fieldshould be stored using gzip compression (this will only apply if the field typeis compressable; among the standard field types, only TextField and StrFieldare)
multiValued: true if this field maycontain multiple values per document
omitNorms: (expert) set to true to omitthe norms associated with this field (this disables length normalization andindex-time boosting for the field, and saves some memory). Only full-textfields or fields that need an index-time boost need norms.
termVectors: [false] set to true tostore the term vector for a given field. When using MoreLikeThis, fields usedfor similarity should be stored for best performance.
termPositions: Store position informationwith the term vector. This will increase storage costs.
termOffsets: Store offset informationwith the term vector. This will increase storage costs.
default: a value that should be usedif no value is specified when adding a document.
Misc

         
<uniqueKey>id</uniqueKey>
     
     
      
uniqueKey

         
<defaultSearchField>aggregate_text</defaultSearchField>
     
     
      
Equivalent to the primary keyof the document.
defaultSearchField

         
<solrQueryParser     defaultOperator="OR"/>
     
     
      
solrQueryParser
Usedfor determining if multiple terms are ANDed or ORed together by default.
4.Solrconfig.xml
Solrconfig.xmlis usually the second file you configure when setting up a new Solrinstallation, after schema.xml.

         
<!--
     
Used to specify an alternate directory     to hold all index data
     
other than the default ./data under     the Solr home.
     
If replication is in use, this should     match the replication configuration.
     
-->
     
<dataDir>${solr.data.dir:./solr/data}</dataDir>
     
     
      
The more commonly-usedelements in solrconfig.xml are:
l  data directory location
l cacheparameters
l requesthandlers
Request handlers areresponsble for accepting HTTP requests, performing searches, then returning theresults.

        
<requestHandler name="standard" class="solr.SearchHandler" default="true">
   
      <lst name="defaults">
   
        <str name="echoParams">explicit</str>
   
        <!--
   
       <int name="rows">10</int>
   
       <str name="fl">*</str>
   
       <str name="version">2.1</str>
   
        -->
   
      </lst>
   
</requestHandler>
    
   
    Thedefault request handler that comes configured with the example webapp, alsoknown as the standard request handler, looks like this:
l searchcomponents
Search components extend theabstract class SearchComponent and areresponsible for performing the actual searches.
5.SolrJ
Setting up the classpath
From /dist
              apache-solr-solrj-*.jar
From/dist/solrj-lib
              commons-codec-1.3.jar
              commons-httpclient-3.1.jar
              commons-io-1.4.jar
              jcl-over-slf4j-1.5.5.jar
              slf4j-api-1.5.5.jar
From /lib
              slf4j-jdk14-1.5.5.jar

        
import    org.apache.solr.client.solrj.SolrServerException;
   
import    org.apache.solr.client.solrj.impl.HttpSolrServer;
   
import    org.apache.solr.common.SolrInputDocument;
    
   
import java.io.IOException;
    
   
public class SolrjPopulator {
   
     public static void main(String[] args) throws IOException,    SolrServerException {
   
       HttpSolrServer server = new HttpSolrServer("http://localhost:8983/solr");
   
       for(int i=0;i<1000;++i) {
   
         SolrInputDocument doc = new SolrInputDocument();
   
         doc.addField("cat", "book");
   
         doc.addField("id", "book-" + i);
   
         doc.addField("name", "The Legend of the Hobbit part    " + i);
   
         server.add(doc);
   
         if(i%100==0) server.commit();     // periodically flush
   
       }
   
       server.commit();
   
     }
   
}
    
   
    Add documents using SolrJ

        
import    org.apache.solr.client.solrj.SolrServerException;
   
import    org.apache.solr.client.solrj.impl.HttpSolrServer;
   
import    org.apache.solr.client.solrj.SolrQuery;
   
import    org.apache.solr.client.solrj.response.QueryResponse;
   
import    org.apache.solr.common.SolrDocumentList;
    
   
import java.net.MalformedURLException;
    
   
public class SolrJSearcher {
   
     public static void main(String[] args) throws MalformedURLException,    SolrServerException {
   
       HttpSolrServer solr = new    HttpSolrServer("http://localhost:8983/solr");
    
   
       SolrQuery query = new SolrQuery();
   
       query.setQuery("sony digital camera");
   
       query.addFilterQuery("cat:electronics","store:amazon.com");
   
       query.setFields("id","price","merchant","cat","store");
   
       query.setStart(0);   
   
       query.set("defType", "edismax");
    
   
       QueryResponse response = solr.query(query);
   
       SolrDocumentList results = response.getResults();
   
       for (int i = 0; i < results.size(); ++i) {
   
         System.out.println(results.get(i));
   
       }
   
     }
   
}
    
   
    Search using SolrJ
6.Boosts
7.Next
8.Suggetions
9. References




Boosts
In addition to the scoring factorsmentioned above, the primary method of modifying document scores is byboosting.
There are 2 kinds of boosts. Index-time andQuery-time boosts.
Index-time boosts are applied when addingdocuments, and apply to the entire document or to specific fields.
Query-time boosts are applied whenconstructing a search query, and apply to specific fields.
Query boosts are applied by appending thecaret character ^ followed by a positive number to query clauses.
title:foo OR(title:foo AND title:bar)^2.0 OR title:"foo bar"^10
Negative boosts
Whilst Lucene allows negative boosts, Solrdoes not.
The only way to meaningfully perform anegative boost, is by applying a positive boost to a negative query. Forexample:
(*:*-title:foo)^2.0
This boosts all documents which don't have"foo" in the title by 2.0, thereby effectively applying a down boostto documents which do.
We mainly use Index-time fashion to applyboosts when adding documents.
There are two fields to operate usingSolrj.We can adding boost to the field in solr document or adding boost to solrdocument itself.

There are three people  wrwangwr@cn.ibm, panhm@cn.ibm.com and  yanjuqi@cn.ibm.com. All of them have a title “I can playjava”.Now we add boost 1 to  wrwangwr@cn.ibm.comtitle field,2 to  panhm@cn.ibm.com title field and 3 to  yanjuqi@cn.ibm.com titlefield. After we selected the key word "java",the data will displaylike blow.
基于数据挖掘的音乐推荐系统设计与实现 需要一个代码说明,不需要论文 采用python语言,django框架,mysql数据库开发 编程环境:pycharm,mysql8.0 系统分为前台+后台模式开发 网站前台: 用户注册, 登录 搜索音乐,音乐欣赏(可以在线进行播放) 用户登陆时选择相关感兴趣的音乐风格 音乐收藏 音乐推荐算法:(重点) 本课题需要大量用户行为(如播放记录、收藏列表)、音乐特征(如音频特征、歌曲元数据)等数据 (1)根据用户之间相似性或关联性,给一个用户推荐与其相似或有关联的其他用户所感兴趣的音乐; (2)根据音乐之间的相似性或关联性,给一个用户推荐与其感兴趣的音乐相似或有关联的其他音乐。 基于用户的推荐和基于物品的推荐 其中基于用户的推荐是基于用户的相似度找出相似相似用户,然后向目标用户推荐其相似用户喜欢的东西(和你类似的人也喜欢**东西); 而基于物品的推荐是基于物品的相似度找出相似的物品做推荐(喜欢该音乐的人还喜欢了**音乐); 管理员 管理员信息管理 注册用户管理,审核 音乐爬虫(爬虫方式爬取网站音乐数据) 音乐信息管理(上传歌曲MP3,以便前台播放) 音乐收藏管理 用户 用户资料修改 我的音乐收藏 完整前后端源码,部署后可正常运行! 环境说明 开发语言:python后端 python版本:3.7 数据库:mysql 5.7+ 数据库工具:Navicat11+ 开发软件:pycharm
MPU6050是一款广泛应用在无人机、机器人和运动设备中的六轴姿态传感器,它集成了三轴陀螺仪和三轴加速度计。这款传感器能够实时监测并提供设备的角速度和线性加速度数据,对于理解物体的动态运动状态至关重要。在Arduino平台上,通过特定的库文件可以方便地与MPU6050进行通信,获取并解析传感器数据。 `MPU6050.cpp`和`MPU6050.h`是Arduino库的关键组成部分。`MPU6050.h`是头文件,包含了定义传感器接口和函数声明。它定义了类`MPU6050`,该类包含了初始化传感器、读取数据等方法。例如,`begin()`函数用于设置传感器的工作模式和I2C地址,`getAcceleration()`和`getGyroscope()`则分别用于获取加速度和角速度数据。 在Arduino项目中,首先需要包含`MPU6050.h`头文件,然后创建`MPU6050`对象,并调用`begin()`函数初始化传感器。之后,可以通过循环调用`getAcceleration()`和`getGyroscope()`来不断更新传感器读数。为了处理这些原始数据,通常还需要进行校准和滤波,以消除噪声和漂移。 I2C通信协议是MPU6050与Arduino交互的基础,它是一种低引脚数的串行通信协议,允许多个设备共享一对数据线。Arduino板上的Wire库提供了I2C通信的底层支持,使得用户无需深入了解通信细节,就能方便地与MPU6050交互。 MPU6050传感器的数据包括加速度(X、Y、Z轴)和角速度(同样为X、Y、Z轴)。加速度数据可以用来计算物体的静态位置和动态运动,而角速度数据则能反映物体转动的速度。结合这两个数据,可以进一步计算出物体的姿态(如角度和角速度变化)。 在嵌入式开发领域,特别是使用STM32微控制器时,也可以找到类似的库来驱动MPU6050。STM32通常具有更强大的处理能力和更多的GPIO口,可以实现更复杂的控制算法。然而,基本的传感器操作流程和数据处理原理与Arduino平台相似。 在实际应用中,除了基本的传感器读取,还可能涉及到温度补偿、低功耗模式设置、DMP(数字运动处理器)功能的利用等高级特性。DMP可以帮助处理传感器数据,实现更高级的运动估计,减轻主控制器的计算负担。 MPU6050是一个强大的六轴传感器,广泛应用于各种需要实时运动追踪的项目中。通过 Arduino 或 STM32 的库文件,开发者可以轻松地与传感器交互,获取并处理数据,实现各种创新应用。博客和其他开源资源是学习和解决问题的重要途径,通过这些资源,开发者可以获得关于MPU6050的详细信息和实践指南
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值