this post was submitted on 22 Nov 2024
36 points (100.0% liked)

askchapo

22838 readers
93 users here now

Ask Hexbear is the place to ask and answer ~~thought-provoking~~ questions.

Rules:

  1. Posts must ask a question.

  2. If the question asked is serious, answer seriously.

  3. Questions where you want to learn more about socialism are allowed, but questions in bad faith are not.

  4. Try !feedback@hexbear.net if you're having questions about regarding moderation, site policy, the site itself, development, volunteering or the mod team.

founded 4 years ago
MODERATORS
 

Not asking for any specific purpose, just that it'd suck if all that was lost because reddit-logo threw a tantrum.

top 12 comments
sorted by: hot top controversial new old
[–] Awoo@hexbear.net 27 points 1 month ago* (last edited 1 month ago) (2 children)

You can obtain an archive of /r/chapotraphouse here: https://the-eye.eu/redarcs/

Type in chapo, you'll find moretankiechapo there too. And countless others. The entirety of the top 20,000 subreddits was archived before they shut down the API and that API still allowed you to pull content from banned subs.

Posts and comments archive separately so you'll have to get them separately. There's probably some way to rebuild the entire thing using them as the basis of new databases but don't ask me how.

[–] Shinhoshi@lemmygrad.ml 14 points 1 month ago (1 children)

I wonder if y'all could upload the archive to Hexbear like we did with !genzhouarchive@lemmygrad.ml

[–] Awoo@hexbear.net 7 points 1 month ago

With someone that knows how that was done yeah.

[–] Tabitha@hexbear.net 10 points 1 month ago (2 children)

r/ChapoTrapHouse Posts 177M .zst, Comments 845M .zst

that's pretty neat, I think someone could make an archive page/utility out of the subreddit based on that.

[–] Tabitha@hexbear.net 8 points 1 month ago* (last edited 1 month ago) (1 children)

I guess there's no overlap between people who know what to do with a raw data dump (probably me?) and people who know what they want from it lol (not me lol), because this could have been done 4 years ago.

[–] ChaosMaterialist@hexbear.net 1 points 1 month ago

I have thought of fishing out some of the effortposts and comments. There are lots of gems in there. But yeah I'm not sure what to do with it.

[–] Awoo@hexbear.net 7 points 1 month ago* (last edited 1 month ago)

shutdownbythecia.com ?

[–] propter_hog@hexbear.net 14 points 1 month ago

probably, because social media companies almost never delete anything, but it'll never see the light of day again.

[–] Tabitha@hexbear.net 7 points 1 month ago (2 children)

you could have used pushshift before the thing that happened a year ago to get the raw JSON (pretty much all the image URLs still work for banned subs AFAIK).

I don't know how easy it is to use pushshift for free after the event.

[–] Tabitha@hexbear.net 6 points 1 month ago

pretty much all the image URLs still work for banned subs AFAIK

there is a caveat that galleries and videos might not work if a subreddit mod removed them, but I don't know if that applies to banned subreddits.

[–] sovietknuckles@hexbear.net 2 points 1 month ago

The sites that @Awoo@hexbear.net and I posted are both powered by pushift

[–] sovietknuckles@hexbear.net 2 points 1 month ago* (last edited 1 month ago)

If you're searching for something instead of browsing, https://ihsoyct.github.io/index.html is good