Skip to Content

You are currently visiting an old archive website with limited functionality. If you are looking für the current Berlin Buzzwords Website, please visit https://berlinbuzzwords.de

Main

Nutch as a web mining platform - the present and the future

Mon, 2010-06-07 16:30 - 17:15
Speaker: 
Andrzej Bialecki

The Nutch platform for building large-scale search engines continues to serve as a flag example of Hadoop-based applications. This talk will start with an overview of Nutch architecture, present some less typical uses of Nutch as a web mining platform (based on real use cases), and outline a new range of applications expected as a result of the currently ongoing redesign of the platform.

Finite-State Queries in Lucene

Mon, 2010-06-07 11:30 - 12:15
Speaker: 
Robert Muir

The talk would focus upon how in an upcoming version of lucene, you will be able to do scalable 'inexact' queries such as pattern-matching, fuzzy, etc.

In current versions of lucene these queries are not very scalable.

Syndicate content