User:Jeff.Creamer/Sandbox

jsoup is an open-source Java library of methods designed to extract and manipulate data stored in HTML documents.

jsoup was written in 2009 by Jonathan Hedley, a software development manager for Amazon Seattle. He has distributed it under the MIT License, a permissive free software license similar to the Creative Commons attribution license.

Hedley's avowed intention in writing jsoup was "to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup."

Projects powered by jsoup
jsoup is used in a number of current projects, including Google's OpenRefine data-wrangling tool.