EMailEntityExtractionProcessor

the property bag doesnt show any property for email when i use the above class please help!

Id #11909 | Release: None | Updated: Jul 2, 2013 at 11:24 PM by basita | Created: Jul 2, 2013 at 11:24 PM by basita

Multiple threads problem

Hello, I've recently tried to use NCrawler from WinService application and faced to strange problem - when I try to crawl sites which have at least 100-200 pages crawler silently stops to work. Ex...

Id #11719 | Release: None | Updated: Apr 19, 2013 at 3:05 AM by mick_astley | Created: Apr 19, 2013 at 3:05 AM by mick_astley

Multithreading problem

Hello, I've recently tried to use NCrawler from WinService application and faced to strange problem - when I try to crawl sites which have at least 100-200 pages crawler silently stops to work. Ex...

Id #11718 | Release: None | Updated: Apr 19, 2013 at 3:04 AM by mick_astley | Created: Apr 19, 2013 at 3:04 AM by mick_astley

Built-in PDF text extractor is terrible

I found that the built-in iTextSharp Pdf Processor quite simply does not work on most PDF documents. So I searched and found this: http://www.squarepdf.net/pdfbox-in-net-download/ I find that thi...

Id #11655 | Release: None | Updated: Mar 20, 2013 at 5:10 PM by bp2008 | Created: Mar 20, 2013 at 5:10 PM by bp2008

RobotService doesn't work, wrong case

In RobotService we have this line: Instruction = lineArray[0].Trim().ToUpperInvariant(); then, when it compares first character of line in switch/case it uses small caps: case 'u' case 'd'...

Id #11485 | Release: None | Updated: Feb 21, 2013 at 10:52 PM by hudo | Created: Jan 9, 2013 at 8:13 PM by hudo

Exception events not fired on 404

First of all this is an excellent, well engineered project, I hope it's not dead! I found out that when a page is not found (error 404) a WebExeption is thrown from WebDownloaderV2.ResponseCallback...

Id #10733 | Release: None | Updated: Feb 21, 2013 at 10:52 PM by Piedone | Created: May 4, 2012 at 9:29 PM by Piedone

how to get the full html?

Hi there, Thanks for your NCrawler. Looks very good.   Can you explain how to obtain the HTML of each crawl?   Thanks Ricardo

Id #10436 | Release: None | Updated: Feb 21, 2013 at 10:52 PM by Piedone | Created: Feb 26, 2012 at 10:02 AM by ricardok1

Crawler.Redis module

Added Crawler.Redis module courtesy of Kamil Janiszewski   what is this module for? Can someone tell me please becouse idn't know if works.

Id #10146 | Release: None | Updated: Feb 21, 2013 at 10:52 PM by youngcoder | Created: Jan 14, 2012 at 8:19 AM by senzacionale

UrlEncode urls to be crawled

Urls found in an html document with spaces or characters like "å ä ö" won't pass the Uri.IsWellFormedUri check and therefore won't be added to a crawl step.

Id #9375 | Release: None | Updated: Feb 21, 2013 at 10:52 PM by ivanlewis | Created: Aug 23, 2011 at 12:20 PM by swemaniac

thread problem on work method

Hi esben!   private void Work(Crawler crawler, PropertyBag propertyBag) { AllocConsole(); Console.Out.WriteLine(); Console.Out.WriteLine("Url: {0}", propertyBag.Step.Uri); Console.Out.WriteLine("C...

Id #9112 | Release: None | Updated: Feb 21, 2013 at 10:52 PM by senzacionale | Created: Jul 19, 2011 at 6:08 AM by senzacionale