Image

Imagegorgonous wrote in Imagelinux

HTML Cleaner

Anybody know of any good HTML cleaner Perl modules (Or scripts) that will add newlines and indentations with tables? I want to parse some data that I will be retrieving from a site, but it's all in one gigantic HTML blob that will be a serious pain in the ass to parse unless it's cleaned up a bit. (I'm sure they did it this way for that very reason)

Update: I couldn't find one quick enough for my attention span, so I just rolled my own that does just enough to get by.