llm

[[2024 NeurIPS#[NeurIPS Tutorial Beyond Decoding Meta-Generation Algorithms for Large Language Models](https //neurips.cc/virtual/2024/tutorial/99522)]]

Greedy

  • pick top result at each steps

Top K Sampling

  • sample from top K

Top P Sampling / Nucleus Sampling

  • sample from the top tokens that add up to P

Beam Search

Speculative

Structured

Structured Generation with LLMs

MCTS