Skip to content
View HyperFoldUK's full-sized avatar

Block or report HyperFoldUK

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories Loading

  1. sparse-ternary-fma sparse-ternary-fma Public

    High-performance ternary arithmetic kernel with 2-bit encoding and AVX-512 SIMD acceleration for FHE and AI applications

    C 1 1

  2. BitNet BitNet Public

    Forked from microsoft/BitNet

    Official inference framework for 1-bit LLMs

    C++ 1 1

  3. 2bit-ternary-bandwidth 2bit-ternary-bandwidth Public

    Surgical proof that 2-bit packed ternary encoding solves the memory bandwidth bottleneck in neural network inference.

    C 1

  4. fused-vs-unpacked-bench fused-vs-unpacked-bench Public

    Fused computation on packed ternary data is fundamentally more efficient than decode-then-compute approaches. This is not about a specific FHE implementation or BitNet optimization. This is a compu…

    C 1

  5. llama.cpp llama.cpp Public

    Forked from ggml-org/llama.cpp

    LLM inference in C/C++

    C++