EXTRADATA TECHNOLOGIES
UNWWWRAP 1.0 README FILE


Contents
A. What is this thing for?
UnWWWrap! is an interactive tool to extract data from Web Pages.
It works as a companion to Internet Explorer

1. Installation
1.1. Installation problems

You can quickly check if UnWWWrap! is functional by doing a hot-click
on any Internet Explorer window. Hotclick is a click with your Right
mouse button while holding down the CTRL key.

If UnWWWrap! is installed properly, once you hotclick, you will see
a hourglass, and UnWWWrap! will launch and appear shortly

If instead of a hourglass you see a drop-down menu, UnWWWrap hook
installation has failed. To make sure, close all browser windows
and launch a new Internet Explorer. Repeat the test on this new
window. If it fails, please contact support

Note that UnWWWrap! only works with browser displaying a WEB PAGE.
If your browser is displaying anything else (e.g. a directory listing),
UnWWWrap! will not pop up. This is normal

1.2. UnWWWrap and other Internet Explorer helpers

Due to a bug in Internet Explorer 5, it is possible that UnWWWrap
will not work if any of other Helper is installed. In fact, due
to this bug only one Helper can be active at a time. Therefore,
during installation UnWWWrap! DISABLES the other helpers.
We are working on a better solution. Please contact support if this
has caused inconvenience

2. Other pecularities

2.1 'Funny Characters'

From time to time you will see 'funny characters' appearing in
captured HTML or other data. This is because there are actually
various international alphabets used on the internet. UnWWWrap!
treats them consistently by converting them to UTF-8 character
format, therefore allowing any mix of lanuages on one page.
However, many editors, such as Notepad (before Windows 2000)
do NOT handle UTF-8. This is why you see 'funny' characters. They
are harmless. To display the data properly on your web page,
make sure to specify UTF-8 as its character set:

<html>
 <head>
  <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
 </head>
 <body>
  your utf-8 encoded data
 </body>
</html>

Note that some web pages use bullet and other special characters, which
may also look 'funny' when converted to UTF-8

2.2 International Alpabets support

UnWWWrap! copies Grids to clipboard in Unicode format, so the original
language information will be preserved when pasting to Excel.

For text and HTML, UnWWWrap! uses UTF-8, which should be supported by your
HTML Editor, but may not appear correctly in some standard editors
(see 'Funny Characters' section above)

UnWWWrap! application is not fully internationalized. It shows national
data correctly on the grid and pastes it correctly to Excel. Other
parts of UnWWWrap! User Interface may not display your language right.
Please provide us feedback on your particular needs, and we will correct
this in a subsequent version

2.3 Copying/Pasting Unicode

When you click Grid->Excel or Grid->Word buttons, your data is copied
to clipboard as Unicode. It can be easily pasted to Word or Excel, but
some applications (such as Notepad) will not accept the data. This is
normal

3. Usage Hints

3.1 Accessing Password-protected web sites automatically

Note that if the data you are accessing is sensitive, it is best NOT TO
access it in an unattended fascion, unless your computer is physically secure,
(nobody has access to it).

You can set bookmarks and extract data from password-protected sites IF
you are willing to trust Internet Explorer with your password. IE offers
you to SAVE YOUR PASSWORD. Check that option. UnWWWrap always clicks OK
on the password dialog. This will accept the password you specified before

4. Contacting support:
Please e-mail support@extradata.com Your feedback is welcome, and we will
do our best to resolve your problem in a timely manner

Happy Data Hunting!

-ExtraData Development Team

