this post was submitted on 13 Apr 2024
15 points (100.0% liked)

technology

23212 readers
218 users here now

On the road to fully automated luxury gay space communism.

Spreading Linux propaganda since 2020

Rules:

founded 4 years ago
MODERATORS
 

I found a way to retrieve the text of comments deleted by user or mod. I don't think it is an issue of federated instances not respecting the deletion.

Is it a bug?

Or is "deleting" always really just "hiding"?

you are viewing a single comment's thread
view the rest of the comments
[–] glans@hexbear.net 5 points 5 months ago (2 children)

A long time ago I heard that the most effective way to delete reddit comments was to edit the comment to have no real content, then wait a while (days/weeks/months) so that any reddit mirroring/scraping service would have time to pick it up. Then delete it.

The premise being that such services would only retain the most up to date version of the comment.

With federation in the mix I don't know what would be expected.

[–] ElGosso@hexbear.net 5 points 5 months ago

Reddit itself used to not archive previous edits of comments, which is why this was recommended. Whether scraping or mirroring services don't depends on the service I would assume.

[–] chickentendrils@hexbear.net 4 points 5 months ago (1 children)

That probably worked for some, I pushshift's API returned just the latest revision and a timestamp it was edited at. For forums I archive, I store every event. If it's a small enough edit (using difflib I think) then I store the deltas. If it overwrote most of the comment I store the latest non-overwritten one and mark it as having been overwritten with the last event time on the comment.

Text is tiny, and with federation it's trivial to scrape but even centralized forums barely impede data archival.

[–] glans@hexbear.net 1 points 5 months ago

Ya but it's unusual to be doing that.

In tests I was able to retrieve the text of comments deleted a year ago, when I had never even participated in lemmy. Certainly I have no archive of anything and if I did it wouldn't extend so far back.

I think it's fair to allow people to get rid of the low-hanging fruit if they want. Even though the internet is forever. Depending on the threat model, it might be good enough.