this post was submitted on 27 Jan 2025
101 points (98.1% liked)

Slop.

337 readers
615 users here now

For posting all the anonymous reactionary bullshit that you can't post anywhere else.

Rule 1: All posts must include links to the subject matter, and no identifying information should be redacted.

Rule 2: If your source is a reactionary website, please use archive.is instead of linking directly.

Rule 3: No sectarianism.

Rule 4: TERF/SWERFs Not Welcome

Rule 5: No bigotry of any kind, including ironic bigotry.

Rule 6: Do not post fellow hexbears.

Rule 7: Do not individually target other instances' admins or moderators.

Rule 8: Do not post public figures, these should be posted to c/gossip

founded 2 months ago
MODERATORS
 
you are viewing a single comment's thread
view the rest of the comments
[–] trinicorn@hexbear.net 11 points 5 days ago (1 children)

maybe it goes harder in mandarin

I imagine the training material on the subject would be much better in mandarin

[–] peppersky@hexbear.net 5 points 5 days ago (2 children)

Pretty sure there's loads of communist theory in all different flavors in english (and dozens of other languages) available for free on the Internet. Kinda makes me wonder: there are some ai image models where you can look at a part of their datasets, is there actually some way to check whether the dataset of an LLM contains any amount of narcist theory?

[–] keepcarrot@hexbear.net 3 points 5 days ago (1 children)

I thought this could run off-line? Doesn't that mean we could just dump prolewiki into it or something? (Or is it already compiled? Idk)

[–] trinicorn@hexbear.net 1 points 5 days ago

it is already compiled/trained. it's open source so you could re-train it but they did spend in the low millions on training so fully retraining from scratch is impractical for an individual. Maybe there's a way to do supplemental/reinforcement training on the released model but I have no idea.

[–] trinicorn@hexbear.net 1 points 5 days ago

sure, there's theory, but in terms of raw amount of stuff in the training material there's going to be a lot less high quality english discussion of marxism I'd bet and a lot of psuedo-marxist junk mixed in there, probably much less of that in Chinese. Some models publish their training datasets I believe but not all