How was this quantized?

by jlinux - opened Feb 21, 2024

Feb 21, 2024

Can you share how this was quantized? I am unable to quantize using convert.py from llama.cpp and successfully load it with BPE or SPM vocab. Your insights are appreciated :).

jlinux

Feb 21, 2024

Closing.. llama.cpp has a pending merge request to support which successfully generates GGUF.

jlinux changed discussion status to closed Feb 21, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment