Duplicate Content · User Defined Rules #58
Notifications
Due Date
No due date set.
Blocks
#14 Duplicate Content · User Defined Rules
corbz/PYRSS-Bot
Reference: corbz/PYRSS-Website#58
Loading…
x
Reference in New Issue
Block a user
No description provided.
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
It's clear now that the ability to uniquely identify articles needs to become a user problem, as hard-coding it will not work due to the varying implementations of uniqueness from differing RSS sources.
Example: one tested feed produced the same article under two different guids and urls, one using
/local/
in the path, and the other using/town-name/
instead, making it impossible to rely on guids or urls for uniqueness.Another example: The BBC publishes updates to their existing articles using updated guids, usually ending in
#<revision-number-here>
, meaning guids alone arent good enough, but they also release the article under multiple categories with different urls, causing the same issue as before.Task:
I've added various options into the webui for how to identify duplicate articles, being:
The user can select one or many per subscription.