Experimental AMD-focused FP4 quantization and backend work for llama.cpp, developed on Framework Desktop with AMD Strix Halo 395+ and 128 GB unified RAM. ROCmFP4 adds new GGUF tensor formats, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results