yogthos

joined 5 years ago
MODERATOR OF
 

Instead of just generating the next response, it simulates entire conversation trees to find paths that achieve long-term goals.

How it works:

  • Generates multiple response candidates at each conversation state
  • Simulates how conversations might unfold down each branch (using the LLM to predict user responses)
  • Scores each trajectory on metrics like empathy, goal achievement, coherence
  • Uses MCTS with UCB1 to efficiently explore the most promising paths
  • Selects the response that leads to the best expected outcome

Limitations:

  • Scoring is done by the same LLM that generates responses
  • Branch pruning is naive - just threshold-based instead of something smarter like progressive widening
  • Memory usage grows with tree size, there currently no node recycling
 

Instead of just generating the next response, it simulates entire conversation trees to find paths that achieve long-term goals.

How it works:

  • Generates multiple response candidates at each conversation state
  • Simulates how conversations might unfold down each branch (using the LLM to predict user responses)
  • Scores each trajectory on metrics like empathy, goal achievement, coherence
  • Uses MCTS with UCB1 to efficiently explore the most promising paths
  • Selects the response that leads to the best expected outcome

Limitations:

  • Scoring is done by the same LLM that generates responses
  • Branch pruning is naive - just threshold-based instead of something smarter like progressive widening
  • Memory usage grows with tree size, there currently no node recycling
[–] yogthos@lemmygrad.ml 6 points 2 weeks ago

he kinda looks like an action figure of a Bond villain

[–] yogthos@lemmygrad.ml 3 points 2 weeks ago (1 children)

Happy bday comrade!

[–] yogthos@lemmygrad.ml 4 points 2 weeks ago
[–] yogthos@lemmygrad.ml 6 points 2 weeks ago

That's my biggest worry as well, we already see just how unhinged the US is. It's very possible that these psychos will start a nuclear holocaust.

[–] yogthos@lemmygrad.ml 41 points 2 weeks ago (2 children)

Yeah, the math does not work here at all. The US very clearly cannot sustain any sort of war of attrition with an industrially advanced adversary at this point.

[–] yogthos@lemmygrad.ml 2 points 2 weeks ago

life finds a way :)

[–] yogthos@lemmygrad.ml 2 points 2 weeks ago

Thanks, and if I try doing something like that will def hit you up. :)

[–] yogthos@lemmygrad.ml 2 points 2 weeks ago (2 children)

lol don't think I have the wit to do skewers that good :)

view more: ‹ prev next ›