Limit search to available items
Book Cover
E-book
Author Mertz, David

Title Text processing in Python / David Mertz
Published Boston : Addison-Wesley, ©2003

Copies

Description 1 online resource (xix, 520 pages)
Contents 1. Python Basics -- 2. Basic String Operations -- 3. Regular Expressions -- 4. Parsers and State Machines -- 5. Internet Tools and Techniques -- App. A. Selective and Impressionistic Short Review of Python -- App. B. Data Compression Primer -- App. C. Understanding Unicode -- App. D.A State Machine for Adding Markup to Text
Summary Text Processing in Python is an example-driven, hands-on tutorial that carefully teaches programmers how to accomplish numerous text processing tasks using the Python language. Filled with concrete examples, this book provides efficient and effective solutions to specific text processing problems and practical strategies for dealing with all types of text processing challenges. Text Processing in Python begins with an introduction to text processing and contains a quick Python tutorial to get you up to speed. It then delves into essential text processing subject areas, including string operations, regular expressions, parsers and state machines, and Internet tools and techniques. Appendixes cover such important topics as data compression and Unicode. A comprehensive index and plentiful cross-referencing offer easy access to available information. In addition, exercises throughout the book provide readers with further opportunity to hone their skills either on their own or in the classroom. A companion Web site (http://gnosis.cx/TPiP) contains source code and examples from the book. Here is some of what you will find in thie book: When do I use formal parsers to process structured and semi-structured data? Page 257 How do I work with full text indexing? Page 199 What patterns in text can be expressed using regular expressions? Page 204 How do I find a URL or an email address in text? Page 228 How do I process a report with a concrete state machine? Page 274 How do I parse, create, and manipulate internet formats? Page 345 How do I handle lossless and lossy compression? Page 454 How do I find codepoints in Unicode? Page 465 0321112547B05022003
Bibliography Includes bibliographical references (pages xvii-xix) and index
Subject Text processing (Computer science)
Python (Computer program language)
Word Processing
Python (Computer program language)
Text processing (Computer science)
Form Electronic book
ISBN 9780321112545
0321112547