My app crawls a couple of webpages. Grabs the innertext (not html) of each of those pages and put them in textboxes.

My goal is to automate DATE extraction off those texts. I think I will have to code some wicked routines to intercept all dates that could be found, regardless of the format:

May 30, 2012
May 30 2012
May-30-2012
May 30, 2012
2012-04-30
12-04-30
12/04/30
(and *SO* many others)

I think that by frequently updating my code, I might end up retrieving most the dates found in there.

I already have some good (I think) ideas on how to approach this piece of code but then I thought: As someone ever done that? If a routine is already available, that'd save some work. Couldn't find on the forums.

Any thoughts greatly appreciated!