May 14, 2008

a "mostly" compliant java based email parser

if you've ever had to deal with email addresses you know it's sometimes a black black art to verify RFC-2822 compliance. ColdFusion's isValid() function is certainly handy for this but it's not 100% (actually i don't think anything is 100% RFC-2822 compliant, just have a read thru RFC-2822 to see what i mean).

well today on the javamail list someone announced The only more-or-less-2822-compliant Java-based email address extracter/verifier with some header verification as well. i did "some" testing on it w/our "crazy" email address suite & it passed but as for being more-or-less-2822-compliant i guess i'll have to take the author's word for that ;-)

have a look at EmailAddress.java and see for yourself.

May 06, 2008

the death of codepages?

mark davis, via the unicode mailing list, mentioned an offical google blog posting that shows that unicode "was the most frequent encoding found on web pages" since dec-2007 (unicode, utf-8, is the blue line on the graph below). wow. i guess people really do get it :-)



reference: Moving to Unicode 5.1