this post was submitted on 13 Feb 2024
21 points (70.6% liked)

Linux

48200 readers
1179 users here now

From Wikipedia, the free encyclopedia

Linux is a family of open source Unix-like operating systems based on the Linux kernel, an operating system kernel first released on September 17, 1991 by Linus Torvalds. Linux is typically packaged in a Linux distribution (or distro for short).

Distributions include the Linux kernel and supporting system software and libraries, many of which are provided by the GNU Project. Many Linux distributions use the word "Linux" in their name, but the Free Software Foundation uses the name GNU/Linux to emphasize the importance of GNU software, causing some controversy.

Rules

Related Communities

Community icon by Alpár-Etele Méder, licensed under CC BY 3.0

founded 5 years ago
MODERATORS
 

got told to crosspost over here to reach more people:

https://kbin.social/m/linuxquestions/p/4631784

I don't know if and how crossposting functions in kbin/lemmy, so hopefully it'll work that way

you are viewing a single comment's thread
view the rest of the comments
[–] rufus@discuss.tchncs.de 1 points 9 months ago* (last edited 9 months ago) (1 children)

Alright. You should also play with the options for the framebuffer, drm and video modes or forcefully enable some outputs. (I forgot how to do all of that.)

  • nvidia-drm.modeset ...
  • video="vesafb"
  • ...

I'd skip the general acpi=off since that only causes more issues and isn't feasable in the long run anyways. You need to find the option that specifically fixes only the issue with that one component that isn't working correctly.

Another idea: Maybe you can find the error message. Can you perhaps login via SSH from another machine? This would allow you to run dmesg while your screen is black. Maybe the error shows up in the dmesg kernel messages and you can take it from there. (Some installers even allow login from remote, that is a bit tricky but should be documented somewhere with the distro.)

[–] Varen@kbin.social 2 points 9 months ago* (last edited 9 months ago) (1 children)

@rufus

I will try and report back if I can get the dmesg output. But since I guess that the USB gets cut off right before something happens I guess there simply isn‘t.

Will try with the installed Nobara and see, if dmesg catches anything up at all

[–] rufus@discuss.tchncs.de 1 points 9 months ago* (last edited 9 months ago) (1 children)

But once you're trying to extract the log, you need to start it with acpi on. I mean we want to see that error happening.

Maybe paste the whole log somewhere on a pastebin service... If you manage to do it... Sometimes it's not an obvious error message. (...But something like the HDMI port being turned off...)

[–] Varen@kbin.social 2 points 9 months ago (1 children)

@rufus

I‘m afraid that it‘s where it will fail, cause I can‘t get it to work with acpi on

I thought of getting the „bad installed“ (acpi=off) OS, let it start with acpi on and afterwards grab the logs again with acpi off, but don‘t know if that will work…

[–] rufus@discuss.tchncs.de 1 points 9 months ago* (last edited 9 months ago) (1 children)

I don't know what gets written to disk on which distro and which logs are just kept in memory. dmesg alone just shows the current boot. I think if you're doing it that way journalctl --dmesg --boot=-1 would be the correct command. That should do it.

[–] Varen@kbin.social 2 points 9 months ago (1 children)

@rufus
Alright, will try later and report back the outcome / pastebin if I can grab it

[–] rufus@discuss.tchncs.de 1 points 9 months ago* (last edited 9 months ago) (2 children)

Good luck! 😀

(FYI: You can skip mentioning names that way, a direct reply will show up on Lemmy. And if you want to mention someone, you'd need to add the instance name for it to have an effect. i.e. @rufus@discuss.tchncs.de )

[–] Varen@kbin.social 2 points 9 months ago

Yeah thank you, I really hope we‘ll get some progress.

(Not doing it on purpose, it‘s just how kbin behaves 😅 really thinking about dropping it and give lemmy a try. Originally decided for kbin because I wanted both worlds but since the behaviour is so strange …yeah 😉)

[–] Varen@kbin.social 2 points 9 months ago* (last edited 9 months ago) (1 children)

Unfortunately nothing. Did install - reboot with acpi=off, reboot with no acpi parameter, reboot with acpi=off

Dmesg shows for the boot=-1 the first boot after install with acpi=off

No log for the try without acpi parameter 😢😢

Booting with acpi=off shows many logs with „IRQ not found for nvidia …“ (in the meaning, not wordly).

Edit: can‘t find an irq for your nvidia card

Edit 2: found a boot.log file. When trying to boot without acpi=off then no log is written, the bootprocess doesn‘t even start. From this point of view I‘d guess a Problem with UEFI. Still no idea what and where, but it‘s not graphics related if the bootprocess doesn‘t start at all… what d‘you think?

[–] rufus@discuss.tchncs.de 1 points 9 months ago (2 children)

I think you can safely ignore all the errors that happen while acpi=off. That will switch all kinds of things around and the operating system can't set up the hardware properly without it, so it is to be expected that half the things crap out and throw error messages. Could be a red herring anyways.

Are you sure Secure Boot is switched to "Other OS"? (see https://www.asus.com/support/faq/1049829/ ) You could verify that with the 'msinfo32' in the guide.

And I'm really not sure if it's the UEFI. From your description it seems you're getting to the boot loader and something happens after... Maybe try not messing with the acpi, but removing the "quiet" and "splash" if they're there and adding "nomodeset" instead. After you hit Enter (or Ctrl-X with Grub) the early kernel messages should pop up. Something with loading and initrd or like that. What happens then? Does it load the kernel? Do additional log messages with a boot process appear? (If it's too fast, you can try a video recording of your screen with your phone.)

[–] Varen@kbin.social 2 points 9 months ago (3 children)

took the video
https://youtu.be/855QTzZlhWk
as you can see, absolutely nothing happens or shows up. (still uploading rn, should be available shortly)

[–] PipedLinkBot@feddit.rocks 1 points 9 months ago

Here is an alternative Piped link(s):

https://piped.video/855QTzZlhWk

Piped is a privacy-respecting open-source alternative frontend to YouTube.

I'm open-source; check me out at GitHub.

[–] rufus@discuss.tchncs.de 1 points 9 months ago* (last edited 9 months ago) (1 children)

Really weird. Honesty, I currently have no idea how to proceed. Maybe I can google a bit later/tomorrow. But it's not looking good. There doesn't seem to be any good information out there concerning that mainboard an Linux. And since we don't get an error message there isn't a clear thing to begin with.

I would have expected at least something happening on the video. Maybe a few lines of text and then the screen flickering and going to black... But nothing?

[–] Varen@kbin.social 2 points 9 months ago (1 children)

yeah, that's the reason I am so desperate ... I understand, so maybe about time to look around for a new mobo?
anyways, I wanted to thank you very much for your time and effort in figuring out and trying to help me, I really do appreciate it very much!

[–] rufus@discuss.tchncs.de 1 points 9 months ago (1 children)

No problem. I wish I could have provided you with a solution. If you have something like a 14 days free return policy, I'd consider returning it. Otherwise, wait a few days before you start thinking about a new motherboard. Maybe someone with a better idea still replies to you.

But I can empathize with your situation. I also used to have hardware that was a mess or didn't work at all. It's just annoying.

[–] Varen@kbin.social 2 points 9 months ago (1 children)

No worries, at least you tried and I‘m grateful for it.

One last thing: so you would also go for mobo replacement? In my opinion its uefi/bios and a new mobo should help, but I like having more opinions…

Thank you again for trying

[–] rufus@discuss.tchncs.de 1 points 9 months ago

It's likely the mobo. A Ryzen processor and a nvidia 3070 isn't so uncommon. And you already said you updated the BIOS and put quite an amount of effort in.

If you can return it or have a friend in need of a mobo... I'd swap it... However, there is still a possibility that someone reads your post on the weekend and happens to know something more. Or you use Linux in a VM from within Windows. That may work, I don't know you, I wouldn't do it since I use Linux 99% of the time.

Only thing I would advise against is hoping the issue will someday get fixed. In many cases it never happens.

[–] rufus@discuss.tchncs.de 1 points 9 months ago* (last edited 9 months ago) (1 children)

I have another kernel option for you to try: "earlyprintk is useful when the kernel crashes before the normal console is initialized. It is not enabled by default because it has some cosmetic problems."

  • earlyprintk=vga,keep debug
  • earlyprintk=efi,keep or earlyprintk=bios,keep

I'm not sure if it gets you anywhere, but it could make the fist kernel messages show up.

And you could try replacing the acpi=off with acpi=noirq. If it's something with the interrputs, there are extra options like apic=irqfixup or nolapic (mind the difference apic <-> acpi). (Taken from this document)

I had the time to google a bit and I was right, I am about to run out of ideas. There is a good general guide in the arch wiki on how to approach issues:

https://wiki.archlinux.org/title/General_troubleshooting#Boot_problems

Maybe also read that, but that's pretty much it.

[–] Varen@kbin.social 2 points 9 months ago (1 children)

Wow thank you so much for keeping up on it!

I tried the earlyprintk options, but unfortunately none of them did work. Neither did the orher acpi options show up something, I do still have the exact same behaviour :(

I will work through the troubleshoot link you pasted, thank you as well for that.

Appreciate your help, I guess I‘ll start thinking about replacing the mobo

[–] rufus@discuss.tchncs.de 1 points 9 months ago (2 children)

I guess I‘ll start thinking about replacing the mobo

Yeah, I'm slowly getting to the same conclusion. You could try and rip out all other non-essential components to rule them out. If there are any. And go through the BIOS options once more, switch everything to "Other OS" and try the "legacy" modes for ACPI, boot etc. But at this point I somehow doubt any of this will make any difference. Just make sure the next mobo is alright 😆

[–] Varen@kbin.social 2 points 9 months ago (1 children)

Yeah I already got one in mind and looked it up on linux-hardware.org - any other option I‘d have to make sure the next one is alright? (Besides socket and compatibility with my other hw)

[–] rufus@discuss.tchncs.de 1 points 9 months ago (1 children)

Sorry I have no idea. It's been ages since I last bought a mainboard and that one was recommended in a computer magazine... I'd google it. See if other people have issues with it. And I't trust blogs, Reddit and forums more than the traditional compatibility charts.

[–] Varen@kbin.social 2 points 9 months ago (1 children)

Alright, no worries! You invested so much time and effort trying to help me out, I can‘t thank you enough for that, really really appreciate it much!

[–] rufus@discuss.tchncs.de 1 points 9 months ago* (last edited 9 months ago) (1 children)

Hehe, no worries. I think the community needs to stick together. I used to do (voluntary) computer support once a week at university before I moved, now I occasionally do it here. It's always nice being able to help people ...Or in your case at least trying... To me, it's way more fun than discussing politics or the latest news, anyways. And I mean I'm no exception. I also sometimes need support, ask questions about a new distro or get my itches scratched by people who solve my issues on Github.

[–] Varen@kbin.social 2 points 9 months ago

Oh I absolutely can relate haha
Maybe one day I‘ll be here asking dumb questions again, watch out 😁

[–] Varen@kbin.social 2 points 9 months ago* (last edited 9 months ago) (1 children)

got some news. I don't think, that it might change something, but who knows.
I added in grub the option "insmod progress" (which I found by googling somewhere). It should show, if kernel and initrd do load or not and now I can see, that the vmlinuz and initrd are loading to 100% and after that it hangs. So it looks like the kernel loads but then stucks.
As said before, I don't think that this might change something in regards to further tests with my actual mobo, but I didn't want to left that out ...

[–] rufus@discuss.tchncs.de 1 points 9 months ago

Ah nice. At least something. But I don't think it'll change anything since it's still grub outputting that, and not a life sign from the kernel.

[–] Varen@kbin.social 2 points 9 months ago* (last edited 9 months ago)

Yes, secure boot is set to Other OS - if set otherwise there’s a message after grub that secure boot is active, so pretty sure about that.

Unfortunately it doesn‘t show anything, even with no parameters at all. The only thing that shows up when changing the grub parameters is the „booting a command line“ message which stays there forever, nothing happens.
I‘ll record it with my phone when I get home later on today with the nomodeset parameter.