llm-recipes

llama.cpp

Reference - https://github.com/ggerganov/llama.cpp/blob/master/docs/build.md - https://developer.nvidia.com/cuda-downloads - https://github.com/ggerganov/llama.cpp/blob/master/examples/quantize/README.md