special support for selfhosted/federated services #50
Labels
No Label
bug
confirmed
critical
discussion
documentation
enhancement
fix-extractor
help-wanted
new-extractor
suggestion
support
wontfix
No Milestone
No project
No Assignees
1 Participants
Notifications
Due Date
No due date set.
Dependencies
No dependencies set.
Reference: laudom/haruhi-dl#50
Loading…
Reference in New Issue
There is no content yet.
Delete Branch "%!s(<nil>)"
Deleting a branch is permanent. Although the deleted branch may exist for a short time before cleaning up, in most cases it CANNOT be undone. Continue?
Typical extraction flow (for context):
The problem with this flow on federated/services services is that they have a shitload of domains, and they can't ever be all listed. fediverse.network lists over 2500 running instances. There could even (at least teoretically) be cases of selfhosted servers in local networks (as an alternative to Facebook Workplace or something).
The selfhosted services cannot just skip the domain part, as the URL scheme may (and does) overlap with other services.
As an example: Gab Social's https://gab.com/ACT1TV/posts/104450493441154721 overlaps with Facebook's https://www.facebook.com/aniaainagrodzka/posts/10222580971026685.
For this reason, we should implement a separate loader for selfhosted services. The extraction flow should work like this:
Existing selfhosted extractors (PeerTube, and any others if they exist) should be then migrated to use this.
closed
mentioned in commit
889005bab3
mentioned in issue #17
mentioned in issue #11
changed the description