It recognizes and grabs links, images, documents, contacts, recurring vocabulary and phrases, rss feeds and converts structured and unstructured data into formatted tables which can be exported to spreadsheets or databases.
The program includes a Mozilla-based browser and a side bar which gives access to a number of views with pre-set extractors.
The application can navigate through series of links and sequences of search engine results pages to extract information elements, organize them in tables and export them to various formats.
[1] Regular expressions can be included in scrapers as well as in other parts of the application to define variable recognition markers.
[2] Although OutWit Hub is presented as a tool for non-technical users, the fact that the application doesn't use the document object model structure for its extractions prevents visual "point & grab" data scraping and forces the user who wants to create custom scrapers to define markers in the source code of the page.