

the bad thing oabout regex parsing html (or xml) in general is that how often it just works. like 90% of times, it works 100%, it is just the last 10% where shit breaks. I in most of my scripts use regex or grep, or in language with string methods, use find, and the amount of times it works is just so appealing to implement because all xml parsing libraries suck, and their bindings suck and it is just way to much work when grep ‘title’ gets you 90% there. I feel this.









if i am not wrong, comments are also lazy loaded