-
Notifications
You must be signed in to change notification settings - Fork 1
Home
Lejeune Gaël edited this page Dec 6, 2016
·
3 revisions
Welcome to the daniel wiki!
Daniel relies on the rstr_max lib which has been implemented by Romain Brixtel, thanks to him. It is integrated in this repository
Daniel works much better with a boilerplate removal tool such as justext. You can clone it from : https://github.com/miso-belica/jusText Or install it from : http://corpus.tools/wiki/Justext
If Justext is not installed or you do not want to install it, you can provide already cleaned HTML files as well. Please remember that the markup (paragraph and titles) is needed.