Experiences with the Nutch search engine
Slides
Related content
Report a problem or upload files
If you have found a problem with this lecture or would like to send us extra material, articles, exercises, etc., please use our ticket system to describe your request and upload the data.Enter your e-mail into the 'Cc' field, and we will keep you updated with your request's status.
Description
Nutch is open-source software that implements a web search engine. It has been used in a variety of applications: vertical search engines, archival web search, search engines that incorporate novel metadata, etc. Nutch is itself implemented using Hadoop, an open-source platform for scalable computing. Hadoop facilitates the development and management of applications that run on large numbers of computers and on very large datasets. Hadoop has been demonstrated on clusters with hundreds of computers and is designed to scale to thousands of computers. This talk will present the architecture, capabilities and current status of these two projects.
Link this page
Would you like to put a link to this lecture on your homepage?Go ahead! Copy the HTML snippet !
Reviews and comments:
j'utilise nutch , je l'ai installé sur suze 2.6 mais j'arrive pas encore à compiler la source de nutch sur jbuildre ou ..
Lots of great content here. Go Doug Go!
This video is corrupted. After a few minutes a playing the windows media version, it stops. I've tried RealPlayer and Windows Media Player.
Sorry but he could have skipped on his "ah" "ah" "um" "um", it was annoying as hell
good to have this video file to be watched, very useful !
Write your own review or comment: