https://github.com/cantino/ruby-readability I've been told by @oncletom that it works and does the job fine in lots of cases. There may be test cases to copy (Apache2 licence like this repo)