This project is read-only.


Rating:        Based on 1 rating
Reviewed:  1 review
Downloads: 278
Released: Mar 8, 2009
Updated: Apr 9, 2009 by EsbenCarlsen
Dev status: Beta Help Icon

Recommended Download

Source Code
source code, 10181K, uploaded Apr 4, 2009 - 278 downloads

Release Notes

Source code of Initial release of NCrawler:

Simple and very efficient multithreaded web crawler with pipeline based processing written in C#. Contains HTML, Text, PDF, and IFilter document processors. Easy to add steps to pipeline to extract, use and alter information

Added Language Detection using Google web service

Reviews for this release

Very simple to use and extend, and seems to perform excellent.
by HRJN on Aug 17, 2009 at 7:52 AM