|
this.clean(articleContent, "button") |
This line of code seems to be the one responsible for removing the side-by-side images of this Android Authority article, which are core to the content:
https://www.androidauthority.com/zerocam-ai-3498885/
The HTML tag structure behind the images is the following:
button > picture > source > img
Could there be some exception where the tags are not removed when they contain an image? Although the logic would probably have to be a bit more sophisticated to prevent button icons from making it into articles.
Readability4J/src/main/kotlin/net/dankito/readability4j/processor/ArticleGrabber.kt
Line 791 in 170d052
This line of code seems to be the one responsible for removing the side-by-side images of this Android Authority article, which are core to the content:
https://www.androidauthority.com/zerocam-ai-3498885/
The HTML tag structure behind the images is the following:
button > picture > source > img
Could there be some exception where the tags are not removed when they contain an image? Although the logic would probably have to be a bit more sophisticated to prevent button icons from making it into articles.