Skip to content
Lejeune Gaël edited this page Dec 6, 2016 · 3 revisions

Welcome to the daniel wiki!

Daniel relies on the rstr_max lib which has been implemented by Romain Brixtel, thanks to him. It is integrated in this repository

Daniel works much better with a boilerplate removal tool such as justext. You can clone it from : https://github.com/miso-belica/jusText Or install it from : http://corpus.tools/wiki/Justext

If Justext is not installed or you do not want to install it, you can provide already cleaned HTML files as well. Please remember that the markup (paragraph and titles) is needed.

Clone this wiki locally