mirror of
https://github.com/ggml-org/llama.cpp.git
synced 2025-08-09 02:12:45 -04:00
* Fix top-p sampling to match the standard definition (smallest set that has probability mass at least p, not largest set with probability mass less than p) * top-p: correct gt to gte * add test for correct top-p behavior