About: OutWit Hub     Goto   Sponge   NotDistinct   Permalink

An Entity of Type : wikidata:Q7397, within Data Space : dbpedia.org associated with source document(s)
QRcode icon
http://dbpedia.org/c/2j5RYLy7fc

OutWit Hub is a Web data extraction software application designed to automatically extract information from online or local resources. It recognizes and grabs links, images, documents, contacts, recurring vocabulary and phrases, rss feeds and converts structured and unstructured data into formatted tables which can be exported to spreadsheets or databases. The first version was released in 2010. Version 9.0 was released in January 2020.

AttributesValues
rdf:type
rdfs:label
  • OutWit Hub (en)
rdfs:comment
  • OutWit Hub is a Web data extraction software application designed to automatically extract information from online or local resources. It recognizes and grabs links, images, documents, contacts, recurring vocabulary and phrases, rss feeds and converts structured and unstructured data into formatted tables which can be exported to spreadsheets or databases. The first version was released in 2010. Version 9.0 was released in January 2020. (en)
foaf:name
  • OutWit Hub (en)
name
  • OutWit Hub (en)
dct:subject
Wikipage page ID
Wikipage revision ID
Link from a Wikipage to another Wikipage
Link from a Wikipage to an external page
sameAs
dbp:wikiPageUsesTemplate
developer
  • OutWit Technologies (en)
genre
license
operating system
has abstract
  • OutWit Hub is a Web data extraction software application designed to automatically extract information from online or local resources. It recognizes and grabs links, images, documents, contacts, recurring vocabulary and phrases, rss feeds and converts structured and unstructured data into formatted tables which can be exported to spreadsheets or databases. The first version was released in 2010. Version 9.0 was released in January 2020. The program includes a Mozilla-based browser and a side bar which gives access to a number of views with pre-set extractors. Web pages and textual documents are broken down into their different constituents, presented as tables in these views. The application can navigate through series of links and sequences of search engine results pages to extract information elements, organize them in tables and export them to various formats. The predefined extractors allow to collect structured tables, lists or feeds. Custom scrapers can also be created to extract data from less structured page elements. Regular expressions can be included in scrapers as well as in other parts of the application to define variable recognition markers. Although OutWit Hub is presented as a tool for non-technical users, the fact that the application doesn't use the document object model structure for its extractions prevents visual "point & grab" data scraping and forces the user who wants to create custom scrapers to define markers in the source code of the page. The advantage of this approach, however, is that it allows a more precise definition of extraction masks than HTML nodes and faster execution, as the document object model tree doesn't need to be rendered by the browser at extraction time. (en)
prov:wasDerivedFrom
page length (characters) of wiki page
genre
license
operating system
foaf:isPrimaryTopicOf
is Link from a Wikipage to another Wikipage of
is Wikipage redirect of
is foaf:primaryTopic of
Faceted Search & Find service v1.17_git147 as of Sep 06 2024


Alternative Linked Data Documents: ODE     Content Formats:   [cxml] [csv]     RDF   [text] [turtle] [ld+json] [rdf+json] [rdf+xml]     ODATA   [atom+xml] [odata+json]     Microdata   [microdata+json] [html]    About   
This material is Open Knowledge   W3C Semantic Web Technology [RDF Data] Valid XHTML + RDFa
OpenLink Virtuoso version 08.03.3331 as of Sep 2 2024, on Linux (x86_64-generic-linux-glibc212), Single-Server Edition (378 GB total memory, 49 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2024 OpenLink Software