Text Processing in Pythonby David Mertz -- published by Addison Wesley Publications of David Mertz -- Gnosis Software Home -- Code samples from the book -- Errata |
Thursday 2006-06-07 |
Sob story: I put up the above PayPal button a year or two back. A couple of you make donations each month (out of about a thousand of you reading the text each week). Tragedy of the commons and all that... but if some more of you would donate a few bucks, that would be great support of the author.
If you want to buy the printed version this Amazon link lets me get a small commission on each sale, about equal to the royalty from the publisher, but coming much quicker (and not affecting the eventual royalty; so I get double). Feel free to email me also.
In a community spirit (and with permission of my publisher), I am making my book available to the Python community. Minor corrections can be made to later printings, and at the least errata noted on this website. Email me at <mertz@gnosis.cx> .
A few caveats: (1) This stuff is copyrighted by AW (except the code samples which are released to the public domain). Feel free to use this material personally; but no permission is given for further distribution beyond your personal use. (2) The book is provided in "smart ASCII" format. This is converted to print (and maybe to fancier electronic formats) by automated scripts (txt->LaTeX->PDF for the printed version). As a highly sophisticated "digital rights management" system, those scripts are not themselves made readily available. :-)
acknowledgments.txt | FOLKS WHO HAVE MADE THIS BOOK BETTER | |
intro.txt | INTRODUCTION | |
chap1.txt | PYTHON BASICS | |
chap2.txt | BASIC STRING OPERATIONS | |
chap3.txt | REGULAR EXPRESSIONS | |
chap4.txt | PARSERS AND STATE-MACHINES | |
chap5.txt | INTERNET TOOLS AND TECHNIQUES | |
appendix_a.txt | A SELECTIVE AND IMPRESSIONISTIC SHORT REVIEW OF PYTHON | |
appendix_b.txt | A DATA COMPRESSION PRIMER | |
appendix_c.txt | UNDERSTANDING UNICODE | |
appendix_d.txt | A STATE-MACHINE FOR ADDING MARKUP TO TEXT | |
glossary.txt | GLOSSARY TERMS |
发现一篇好东西,关于Python进行文件处理的,不过好像对字符串的操作没有什么介绍。有一本电子书《Text Processing in Python》,非常好,专攻Python中的文本处理,作者也是Python社区的牛人,在IBM的developerWorks开有Charming Python专栏,可惜更新太慢。
对于做文本处理方面的朋友,Text Processing in Python尤其对口,以前还打算翻译过来了,贡献给IR Lab,可惜如今身在江湖,有心杀敌,无力回天啊 :)
Trackback: http://tb.donews.net/TrackBack.aspx?PostId=540681
Python文本处理指南
本书由Python社区专家David Mertz撰写,全面介绍了使用Python进行文本处理的方法。内容涵盖基础操作、正则表达式、解析器及互联网工具等。适用于从事文本处理工作的技术人员。
511





