Upload Information Template for Hobbes.nmsu.edu =============================================== Archive Filename: pms_124.zip Short Description: a PM utility for stripping HTML codes Long Description: PMStripper. A fully functional, OS/2 PM utility which strips HTML codes from Web pages leaving some formatting intact. It now translates iso8859-1 character codes (windows codepage) to codepage 437 or codepage 850 character codes. Proposed directory for placement: pub/os2/apps/internet/www/util Your name: Don Hawkinson Email address: dwhawk@southwind.net Program contact name: (same) Program contact email: (same) Program URL: http://www2.southwind.net/~dwhawk Would you like the contact email address included in listings? yes Operating System/Version: OS/2 Warp 3 and later Additional requirements: (none) Replaces: pms_123.zip, pms_122.zip, pms_121.zip, pms_120.zip, This PM utility strips HTML codes from Web pages, leaving only the text and optionally the URLs. While the layout of tables and lists is lost during stripping, data is sorted to separate lines for legibility. Install PMStripper as the HTML source viewer in WebX instead of the OS/2 system editor E, and you can use the Ctrl+V keystroke combination to load the web page source directectly into PMStripper. Since the IBM Web Explorer does not have an option to save the contents of a web page as plain ascii text, using PMStripper makes it easier to save text from web pages. Although Netscape will save a web page as plain ascii text, it also saves the layout of the page. Using PMStripper as the HTML source viewer with Netscape will make it easier obtain just the text of an article posted on a web page. Registered users of PMStripper will have access to two additional executable files that were compiled with the 486 and 586 compiler options. These versions will offer some performance improvement for users with 468 , Pentium, or equivalent CPUs. New in PMStripper 1.24 Added 'Translate iso8859-1 character codes' option under 'Strip Options'. Fixed bug in how html comments are stripped. New in PMStripper 1.23 Added 'Hard code word wrap' menu item under "File" on the menubar. This option adds CR-LF pairs to each line in the display window to make the current word wrap permanent. This should make it easier to print the stripped text. New in PMStripper 1.22 Corrected a bug in handling the "TD" html code. New in PMStripper 1.21 Fixed bug that prevented the font selection made from the menu from being saved as the new default font. Added support for saving drag and drop color and font changes made to the information area below the main window. This should make PMStripper easier to use for OS/2 users with vision problems. Added a "Save Marked Text To File" option under "File" on the menubar. New in PMStripper 1.20 Added support for saving drag & drop color and font changes made to the menubar. This should make the utility easer to use for OS/2 users with vision problems. Fixed a bug in the naming of the profile file when the environment variable in the config.sys file is used to specify the location of the file. New in PMStripper 1.19 This release is a bug fix release. It fixes bugs in the option that allows the user to specify a default directory for the HTML source files. These bugs appear to have been present since version 1.14. This option is disabled in the shareware version, so a new shareware version has not been released. New in PMStripper 1.18 Added current background and foreground color to the information saved when the 'Save Settings' option is selected. Added some additional translations for word processor codes to the 'Translate quotes' option. Updated the PMStripper documentaion to include the 'Use idle time priority' and 'Reset to default colors' options. New in PMStripper 1.17 Fixed a bug that could cause PMStripper it to abort. This could happen if the user had too much text marked when the user pressed ALT+S or selected "Save As" from the menu. With the fix, if there are more than 1024 characters the program does not attempt to use the marked text as a filename. If there are less then 1024 and more than 254 marked characters, the text is truncated and then used as the file name. Added a 'Translate quotes' to the strip options. This option translates the "smart quotes" used on some web pages into the standard ascii values (0x93 and 0x94 are changed to 0x22). The "smart apostrophes" are translated to standard ascii (0x91 and 0x92 are changed to 0x22). The two "special hyphen" characters are translated to standard ascii (0x96 and 0x97 are changed to 0x2d). The 0x85 character is translated into 3 periods (0x2e) to approximate an elipsis character. In addition the 0xA0 and the 0x99 characters are translated to a space. The translation is done before any html character enties are translated, so this option should not effect languages that use those characters as part of their normal text. Added an 'Only http type' option to the URL options. This option limits the URLs to those links containing a http reference. The "normal" URL detection looks for htlm code containing href and will find gopher, ftp, mailto, and relative links to other web pages as well as complete URLs.