I've used Mozilla's Readability library (https://github.com/mozilla/readability) for a while now to extract pure text from our company's old wiki. The quality is OK, but I wondered if there are any other (better, preferably) alternatives out there.
Doesn't have to be in any specific language, but should be standalone so that it can be integrated into an existing application.