Archive for August, 2009

Semantic analysis

Tuesday, August 25th, 2009

I understand it is mean to talk about features that are not implemented in the downloadable versions, but I would like to share my ideas on the purpose behind our experimental semantic features.

The “mechanical” recognition and extraction algorithms used in most views of the Hub are mostly based on a combination of DOM analysis (when dealing with HTML pages) and morphological recognition of objects and strings. These techniques are very efficient for simple scraping of data, but they are not sufficient when we need to discriminately extract data about certain themes or topics. We are currently adding semantic capacities to our extractors (in professional applications only, for now).

At the moment, we are only focusing  on statistical analysis of the words and phrases, without performing any syntactic analysis of the texts. However, the results are very promising and seem to confirm our original ideas.

(more…)

Our mission

Tuesday, August 11th, 2009

At OutWit, we are working on adding intelligence to the Web browser.

The free beta applications that you have been downloading from our site are only parts of what we are developing. They are implementations of some of the recognition and extraction capacities that we are including in the OutWit Kernel. We have been talking about a public API for more than a year now and, although it is definitely still in the pipe, we have been delaying it (as for the complete help and documentation) until we can reach a stable enough version of the kernel and feel confortable with people starting to write code around it.

We are convinced that the future will prove it was a good idea to add semantic intelligence to the browser itself instead of exclusively focusing on the server side.