Web信息处理--Web Information Processing and Applications

    Web Information Processing and Applications

   

  • Instructor
    Jin Pei-Quan(金培权)        Xu Lin-Li (徐林莉)
    Email: jpq@ustc.edu.cn           Email: linlixu@ustc.edu.cn  

    Teaching Assistants
    林盛, Ph.D. student                            于永波, Master Student                       
    Phone: 13485728758                          Phone: 13865979122 
    Email: linsh@mail.ustc.edu.cn               Email: yyb2012@maiul.ustc.edu.cn
    Room: 1610, 科技实验楼西楼          Room: 1610, 科技实验楼西楼

    Lectures
    Time: Class 6 to 8
    Classroom: 3C221 (West Campus)

    Textbook
    W. Bruce Croft, Donald Melzler, Trvor Strohman, Search Engines: Information Retrieval in Practice,  Pearson Press, 2010
        (中文版:刘挺, 等 译, 搜索引擎:信息检索实践, 机械工业出版社, 2012)

    References
    Christopher D. Manning, Prabhakar Raghavanm, Hinrich Schütze,  An Introduction to Information Retrieval, Cambridge University Press, 2008
    (中文版:王斌 译, 信息检索导论, 人民邮电出版社, 2010)
    Ricardo Baeza-Yates, Berthier Ribeiro-Neto, Modern Informatio Retrieval, Addison Wesley Longman Publishing Co. Inc., 1999 
    Bing Liu, Web Data Mining (2nd Edition), Springer, 2011
    Some state-of-the-art papers from SIGIR, CIKM, WWW, etc.

    Assignments
    Some homework assignments. POLICY: all assignments should be completed and submitted in one week, i.e. before the beginning of next class. Late assignment submissions will be penalized 20% points.

    Examination
    One final test, scheduled to be taken at the end of the course.

    Grading
    Homework: 20%
    Lab: 20%  [ Lab #1 Description. Lab time: 18:30-21:30, Monday and Tuesday, start from 8 October. Lab site: 517, E3 Building]
    Final: 60%

    Course Notes

    No.DateContentsHomeworkChapters Reading
    19.3Introduction to Web Information Processing 

    Chp.1-2

    29.10Web Crawling  (updated)homework

    Chp.3

    39.17Text Processinghomework

    Chp.4

    49.24Indexing   & Lab #1 DescriptionhomeworkChp.5
    510.1(National Day) Lab #1
    Lab time: 18:30-21:30, Monday and Tuesday.
    Location: 517, E3 Building
     
    610.8Queries  homeworkChp.6
    710.15Rankinghomework

    Chp.7

    810.22EvaluationhomeworkChp.8
    910.29Named Entity Recognition 

     

    1011.5Relation Extraction 

     

    11-18 Web Data Mining   
    19 Review  
    20 Final Exam  
### Information Processing Technician Tutorial An information processing technician plays a crucial role in managing and organizing data within various systems. The responsibilities include inputting, processing, storing, retrieving, and disseminating information using different software applications and hardware configurations[^1]. For those interested in becoming an information processing technician or enhancing skills in this field, several resources are available. #### Online Courses Platforms like Coursera, Udemy, LinkedIn Learning offer comprehensive courses on database management, programming languages such as Python which is beneficial for automating tasks related to information handling, and other relevant subjects that can help one become proficient at being an information processing technician[^2]. #### Books and Manuals Books focusing specifically on the duties of an information processing technician may not be abundant; however, literature covering computer science fundamentals, office productivity tools (like Microsoft Office Suite), and IT service support will provide valuable insights into what it takes to excel in this profession[^3]. ```python # Example code snippet demonstrating basic file operations useful for technicians. with open('example.txt', 'r') as file: content = file.read() print(content) ``` #### Community Forums and Groups Joining online communities where professionals discuss challenges faced daily while working with large datasets could also prove helpful. Websites like Stack Overflow cater well towards technical queries whereas Reddit has subreddits dedicated explicitly to discussing careers paths similar to that of an information processor technician[^4].
评论 2
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值