GPUStack

All

8 repositories

gpustack
Public
Manage GPU clusters for running LLMs
Python
•
Apache License 2.0
•38•448•59•2•Updated Oct 28, 2024Oct 28, 2024
llama-box
Public
LLM inference server implementation based on llama.cpp.
cpp llama gguf openai-compatible-api
C++
•
MIT License
•1•15•0•0•Updated Oct 26, 2024Oct 26, 2024
gpustack.github.io
Public
HTML
•0•0•0•0•Updated Oct 25, 2024Oct 25, 2024
gpustack-ui
Public
TypeScript
•5•1•0•0•Updated Oct 25, 2024Oct 25, 2024
fastfetch
Public
Like neofetch, but much faster because written mostly in C.
C
•
MIT License
•404•0•0•0•Updated Oct 24, 2024Oct 24, 2024
gguf-parser-go
Public
Review/Check GGUF files and estimate the memory usage and maximum tokens per second.
go llama gguf
Go
•
MIT License
•4•36•0•0•Updated Oct 24, 2024Oct 24, 2024
gguf-packer-go
Public
Deliver LLMs of GGUF format via Dockerfile.
go llama gguf
Go
•
MIT License
•1•4•0•0•Updated Oct 24, 2024Oct 24, 2024
.github
Public
Meta-Github repository for all GPUStack repositories.
github-metadata
Apache License 2.0
•0•0•0•0•Updated Oct 24, 2024Oct 24, 2024