29
submitted 5 months ago by comfydecal@infosec.pub to c/linux@lemmy.ml

Ubuntu seems like it has the best compatibility, but any other suggestions for data wrangling, data analysis, data visualization, and machine learning in Julia, Python and R?

top 19 comments
sorted by: hot top controversial new old
[-] bizdelnick@lemmy.ml 49 points 5 months ago

Any distro you are comfortable with.

[-] Nibodhika@lemmy.world 36 points 5 months ago

Which is the correct answer to 99% of questions that start with "Which distro..."

[-] thejevans@lemmy.ml 23 points 5 months ago

I do scientific computing and I've used Ubuntu, Debian, Fedora, Arch, and NixOS for work.

Any and all of these can do what you need. Hell, you could probably throw your whole development environment into a docker container and use it anywhere. Pick one and go with it.

That said, here are my preferences:

Right now, I really like NixOS and Nix for development environments, but it's a lot to learn, so I wouldn't recommend it unless you were really excited to try it.

Before NixOS, I used Arch on my laptop, and it was soooo nice to be able to build my own desktop environment just the way I wanted it from the ground up, which is possible on any distribution, but the Arch documentation makes this much more approachable. If you are happy with KDE Plasma or Gnome, and you're using well-supported hardware, then I wouldn't say Arch is really worth the time (unless you're excited to play with it).

Fedora and Nobara (a Fedora-based distribution with a lot of gaming-focused presets) have been a breath of fresh air coming off the heels of painstakingly setting up Arch and then NixOS. Fedora is pretty nice out of the box and Nobara has been the best experience of going from zero to gaming even when compared to Windows.

Debian (especially Debian 12) has been fantastic for servers and for machines that don't need to use the newest hardware. It's still my go-to for lots of things.

Ubuntu is fine, but Canonical, the company that makes it, has made some unfortunate choices lately, and with Debian 12 being as good as it is, I don't think I'll ever have a reason to go back.

Side note: One thing to look out for in the near future is System76's COSMIC desktop environment, which seems to be doing a lot of things right. There is already active development to get it working on NixOS, and I'm sure it will be available on Pop!OS from the start. I would also bet that it would be ready to go on Arch not long after. It will likely eventually be easy to install on all distributions, but if you want to try that out as soon as it's ready, one of those three would be a good option.

[-] genie@lemmy.world 4 points 5 months ago* (last edited 5 months ago)

That's awesome! I hadn't heard about COSMIC DE.

Well put. The one thing I would add is using the Nix package manager on a distro other than NixOS! I'm daily driving Fedora 39 + Nix (home-manager) with zero problems. My pick would either be Fedora or Debian.

Tons of good documentation either way. Flatpak the packages you, no kidding, need to be easy / consistent to debug. Non-root podman for containers. Nix for more up to date packages than are available in the native repos (especially useful with Debian) + the other benefits like nix-shell.

[-] beta_tester@lemmy.ml 10 points 5 months ago

Did you know you can use distrobox and use any distros package manager? There's no such thing as "distro x" has the most packages or is supported the best.

Use any distro you want. Atomic distros are the latest thing. Fedora silverblue or opensuse aeon

[-] steersman2484@sh.itjust.works 8 points 5 months ago

Just use any distro you like and install the packages you need. Done.

[-] darkl1nk@lemmy.ml 5 points 5 months ago* (last edited 5 months ago)

This is going to be unpopular, but you can easily compile both Python and R and configure them to your liking. For Python you can even use Anaconda3 and forget about installing most packages by yourself.

As for Julia, I usually just install the precompiled binary package.

So, any distribution you feel comfortable with will do.

[-] Ramin_HAL9001@lemmy.ml 4 points 5 months ago* (last edited 5 months ago)

For data science, it depends on what GPU you plan to use. If it's an Nvidia brand GPU, go with Ubuntu or Fedora. I say from personal experience that it is easier to get Nvidia drivers working on Ubuntu or Fedora than on most other distros I have tried. If it is a Radeon GPU, it will work fine on pretty much any distro at all since Radeon does a good job following Linux standard APIs for graphics card drivers, so for Radeon products I would also recommend Debian or Mint (along side Fedora and Ubuntu).

[-] Kata1yst@kbin.social 3 points 5 months ago

Learn docker on the distro you're most comfortable with.

[-] SmoochyPit@beehaw.org 3 points 5 months ago

I recently installed R on my Arch desktop to play with. Any Linux distro could work well if you install the right things, the distro mostly influences how they get installed afaik.

Bottles and docker could be helpful depending on software supports and your needs.

[-] slacktoid@lemmy.ml 3 points 5 months ago

Stick to ubuntu or red hat alts. They are whats gonna be in any cloud or HPC cluster.

[-] qjkxbmwvz@lemmy.sdf.org 2 points 5 months ago

Debian got me through grad school.

Not the latest and greatest (if you run stable), but if you need the latest e.g. Julia, it's not too bad to compile it.

[-] digdilem@lemmy.ml 2 points 5 months ago

We use EL (Specifically Rocky, a rebuild of Redhat) for this, but I strongly suspect that any of the main distros will be absolutely fine provided they have modern enough versions of the software you need.

[-] Aties@lemmy.world 1 points 5 months ago

Have you considered something like jupyterhub?

[-] Guenther_Amanita@feddit.de 1 points 5 months ago

Use anything you want. All distros should support those packages, use what you're the most comfortable with.

I personally would recommend Fedora Silverblue/ it's other atomic variants or uBlue especially.
It's pretty much unbreakable, modern and supports ALL distros' package managers through Distrobox. It's also pretty simple in my opinion, since you pretty much don't have to worry about traditional package management.

I think you're searching something reliable and simple, so this would be a solid choice.

Mint would be great too

[-] BiggestBulb@kbin.run 0 points 5 months ago

R-ch (pronounced Arch btw)

[-] filister@lemmy.world 0 points 5 months ago

I would say Nix but I presume you are a beginner so perhaps stick with Pop OS, Mint or Ubuntu.

[-] atzanteol@sh.itjust.works -2 points 5 months ago

I've been running Linux since the 90s and use popos. Nothing "beginner" about it.

[-] filister@lemmy.world 2 points 5 months ago

It is beginner friendly, that was my point. At the end of the day it is Linux and you can do everything you can do on a normal Linux too.

this post was submitted on 06 Jan 2024
29 points (87.2% liked)

Linux

45457 readers
2164 users here now

From Wikipedia, the free encyclopedia

Linux is a family of open source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991 by Linus Torvalds. Linux is typically packaged in a Linux distribution (or distro for short).

Distributions include the Linux kernel and supporting system software and libraries, many of which are provided by the GNU Project. Many Linux distributions use the word "Linux" in their name, but the Free Software Foundation uses the name GNU/Linux to emphasize the importance of GNU software, causing some controversy.

Rules

Related Communities

Community icon by Alpár-Etele Méder, licensed under CC BY 3.0

founded 5 years ago
MODERATORS