jsoup

From Wikipedia, the free encyclopedia
Jump to navigation Jump to search
jsoup Java HTML Parser
DeveloperJonathan Hedley
Stable release
1.21.2 / August 25, 2025; 7 months ago (2025-08-25)[1]
Repository
  • {{URL|example.com|optional display text}}Lua error in Module:EditAtWikidata at line 29: attempt to index field 'wikibase' (a nil value).
Written inJava
Engine
    Lua error in Module:EditAtWikidata at line 29: attempt to index field 'wikibase' (a nil value).
    Operating systemCross-platform
    PlatformJava (JVM)
    TypeHTML parser
    LicenseMIT license
    Websitejsoup.org

    jsoup is an open-source Java library designed to parse, extract, and manipulate data stored in HTML documents.

    History

    [edit | edit source]

    jsoup was created in 2009 by Jonathan Hedley. It is distributed it under the MIT License, a permissive free software license similar to the Creative Commons attribution license.

    Hedley's avowed intention in writing jsoup was "to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup."

    Projects powered by jsoup

    [edit | edit source]

    jsoup is used in a number of current projects,[2] including Google's OpenRefine data-wrangling tool.

    See also

    [edit | edit source]

    References

    [edit | edit source]
    1. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    2. ^ Lua error in Module:Citation/CS1/Configuration at line 2172: attempt to index field '?' (a nil value).
    [edit | edit source]