this post was submitted on 02 Feb 2025
18 points (95.0% liked)
Data Is Beautiful
7097 readers
13 users here now
A place to share and discuss data visualizations. #dataviz
(under new moderation as of 2024-01, please let me know if there are any changes you want to see!)
founded 4 years ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
Each entry in the database contains a language and a number of pages. I sorted all the entries by language and took the average number of pages for each of them. But it also display a major weakness, each language don't have the same number of entries, some have thousands, others less than a hundred. I should have "normalized" the number of entry for each language and exclude languages which don't have enough entries.