Simple method for indexing MS Word documents
Building indexers/spiders that can read binary MS Word (.doc) documents can be difficult, expecially on *nix servers, which don't support PHP's COM abilities. Solutions usually involve installing binaries on the server (often impossible or disallowed). This simple PHP snippet makes a pretty good job of extracting text from an MS Word document for use in a search index. While not pretending to be perfect, it has proved itself useful on thousands of test documents.
Visit publisher site: Simple method for indexing MS Word documentsListing Details
- Filed in:
-
Scripts / PHP / Tutorials & Tips / Searching
- Submitted on:
- Last Updated:
- Apr 30, 2006
- Publisher:
- The Mouse Whisperer Other listings by this publisher
License & Pricing Information
LICENSE #1
- License Type:
- Freeware
- Price:
- $0.00 USD
- Additional Info:
User Reviews
Be the first to review this listing!Not yet reviewed by any member.... You can be the FIRST one to write a review for Simple method for indexing MS Word documents