Skip to content

[BUG] Regresion: getting messages with deletion even the content still seems to be there #104

@mhalano

Description

@mhalano

NOTE: IF YOU'RE UNSURE, PLEASE LOOK AT OR OPEN A DISCUSSION INSTEAD! https://github.com/mborsetti/webchanges/discussions

Describe the bug.
A clear and concise description of what the bug is. If it's related to an error, please also attach the traceback
of the exception obtained by running webchanges with -v or -vv.

Using version 3.31.0 I started receiving many emails that said the content was deleted. I set additions_only: true and I still get the emails because it says more than 75% of the content was changed (removed, in my case).

Steps to reproduce.
Include relevant snippets of anything required (e.g. jobs and configuration files) for anyone to fully reproduce the
bug.
I have this job that shows problem:

---
name: New NVIDIA driver series available on graphics-drivers PPA
url: http://ppa.launchpad.net/graphics-drivers/ppa/ubuntu/pool/main/n/
additions_only: true
filters:
  - html2text: strip_tags
  - re.findall:
      pattern: 'nvidia-graphics-drivers-(.*?)\/'
      repl: \1
  - re.findall: '(?m)(^.*$)(?!\n.+)'
  - striplines:

Expected behavior.
A clear and concise description of what you expected to happen.

Should not receive any emails without change.

Screen scrape/screenshots.
If applicable, add screen scrape or screenshots to help explain the bug; please use -v or -vv to capture
logging.

Image Image

Version info.
Please run webchanges --detailed-versions and paste the output here.

➜ ~/webchanges/bin/webchanges --detailed-version
Software:
• webchanges: 3.31.0
• CPython: 3.13.3 ('main', 'Jun 16 2025 18:15:32') GCC 14.2.0
• SQLite: 3.46.1

Installed PyPi dependencies:
• cssselect: 1.3.0
• h2: 4.2.0
• html2text: 2025.4.15
• httpx: 0.28.1
  - anyio: 4.9.0
  - certifi: 2025.7.14
  - idna: 3.10
• lxml: 6.0.0
• markdown2: 2.5.4
• msgpack: 1.1.1
• platformdirs: 4.3.8
• pyyaml: 6.0.2
• zstandard: 0.23.0

Additional context.
Add any other context about the problem.

I put in my crontab to run two instances of webchanges:

*/30 * * * * ~/.local/bin/webchanges > /dev/null 2>&1
*/30 * * * * ~/webchanges/bin/webchanges --database ~/webchanges/database.db --cache ~/webchanges/cache.db > /dev/null 2>&1

The version installed in the system using pip is 3.30. The version under ~/webchanges (I used virtualenv to do that) is 3.31.0.

I can check what version sent the email because the version is in the footer, and always is 3.31.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingpythonPull requests that update Python code

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions