Parse Meta Tags In Java
I have a collection of HTML documents for which I need to parse the contents of the tags in the section. These are the only HTML tags whose values I'm int
Solution 1:
You can likely use the Jericho HTML Parser. In particular, have a look at this to see how you can go about finding specific tags.
Solution 2:
If it suits your application you can use Tidy to convert HTML to valid XML, and then use as much XPath as you like!
Post a Comment for "Parse Meta Tags In Java"