this post was submitted on 02 Feb 2025
18 points (95.0% liked)

Data Is Beautiful

7097 readers
13 users here now

A place to share and discuss data visualizations. #dataviz


(under new moderation as of 2024-01, please let me know if there are any changes you want to see!)

founded 4 years ago
MODERATORS
 
  1. These data are publicly available, I just "rearranged" them in a database I created.
  2. Understand these data as trends, not reality, there is billions of erotics comics/hentais out there and my database only hold 43 893 of them.
  3. I am not a data analyst. These values do not hold a real value and I can have made errors.

I have a LOTS more data I can share like: most popular characters/parodies/tags, etc.

I do NOT encourage any of you to read erotic comics/hentais. I personally don't but I though these data might be interesting.

top 6 comments
sorted by: hot top controversial new old
[–] corvus@lemmy.ml 2 points 1 day ago (1 children)

What do you mean by "average number of pages"? Average over what?

[–] Llufollis@sh.itjust.works 3 points 1 day ago* (last edited 1 day ago)

Each entry in the database contains a language and a number of pages. I sorted all the entries by language and took the average number of pages for each of them. But it also display a major weakness, each language don't have the same number of entries, some have thousands, others less than a hundred. I should have "normalized" the number of entry for each language and exclude languages which don't have enough entries.

[–] fuckwit_mcbumcrumble@lemmy.dbzer0.com 4 points 2 days ago (1 children)

What’s the one with the missing label?

[–] Llufollis@sh.itjust.works 3 points 2 days ago

The language hasn't been defined. I just took the language defined in the metadata of each books without processing them, so the typos and weird stuff are because the website owner or the website it take the books from have different naming convention/made typos (e.g. Japanese, Javanese).

[–] fallowseed@lemmy.world 4 points 2 days ago* (last edited 2 days ago) (1 children)

"i only look at them for the analysis" ;) i'm curious what it is about the catalan/bulgarian/dutch-- is it because there are simply less releases in those countries so more entry-level stuff, does it have to do with how their language condenses?

[–] ImplyingImplications@lemmy.ca 1 points 2 days ago

It's probably just that people in those countries prefer their erotic comics to be short and to the point. The chart even has a bar for "speechless" so I don't think language density has much to do with page count. There's also a bar for "translated" which seems to show translators pick shorter comics, which is interesting.