this post was submitted on 04 Dec 2023
699 points (92.7% liked)
Technology
59128 readers
2407 users here now
This is a most excellent place for technology news and articles.
Our Rules
- Follow the lemmy.world rules.
- Only tech related content.
- Be excellent to each another!
- Mod approved content bots can post up to 10 articles per day.
- Threads asking for personal tech support may be deleted.
- Politics threads may be removed.
- No memes allowed as posts, OK to post as comments.
- Only approved bots from the list below, to ask if your bot can be added please contact us.
- Check for duplicates before posting, duplicates may be removed
Approved Bots
founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
view the rest of the comments
They did no such thing. LLMs are probabilistic, not deterministic, and it can generate meaningful responses (to us) that the engineers neither predicted nor designed for.
I get what you're trying to say, but they are absolutely deterministic. All traditional (i.e., non quantum) computers and their programs are deterministic. Computation would be otherwise impossible. LLMs use a "random" seed value when generating their responses in order to "randomize" their responses, but it's all perfectly deterministic. The same input plus the same seed results in the exact same response.
Computers are just a series of binary switches, and programs and data are a bunch of instructions on how to initially set those switches before running a cycle of the CPU. It's deterministic at every step.
I put "random" in quotes because random number generators in software are also deterministic. They also use seed values (like the current time and the MAC address of the PC's network interface) to generate numbers that only seem random. When true randomness is needed, a physical source of entropy must be used like an atmospheric sampler.
The quirks of behavior you're talking about have nothing to do with randomness vs determinism. Their behavior comes from the fact that their data sources are extremely large, and the neural network that it runs on was not designed by a human with specific behaviors like most algorithms are. The weights of the nodes in the neural network were generated by training and not by programmers, and it's extremely complex, so no one can predict its output before running it.
Of course, this is true of even basic algorithms a lot of the time.
For purposes of this discussion pseudo random with weights is probabilistic, or so close to it that this distinction is irrelevant.