Skip to content
Change the repository type filter

All

    Repositories list

    • gpustack

      Public
      Manage GPU clusters for running LLMs
      Python
      Apache License 2.0
      38448592Updated Oct 28, 2024Oct 28, 2024
    • llama-box

      Public
      LLM inference server implementation based on llama.cpp.
      C++
      MIT License
      11500Updated Oct 26, 2024Oct 26, 2024
    • HTML
      0000Updated Oct 25, 2024Oct 25, 2024
    • TypeScript
      5100Updated Oct 25, 2024Oct 25, 2024
    • fastfetch

      Public
      Like neofetch, but much faster because written mostly in C.
      C
      MIT License
      404000Updated Oct 24, 2024Oct 24, 2024
    • Review/Check GGUF files and estimate the memory usage and maximum tokens per second.
      Go
      MIT License
      43600Updated Oct 24, 2024Oct 24, 2024
    • Deliver LLMs of GGUF format via Dockerfile.
      Go
      MIT License
      1400Updated Oct 24, 2024Oct 24, 2024
    • .github

      Public
      Meta-Github repository for all GPUStack repositories.
      Apache License 2.0
      0000Updated Oct 24, 2024Oct 24, 2024