tpu

2 repos

Sort by:Most Stars Trending Newest

vllm

vllm-project

A high-throughput and memory-efficient inference and serving engine for LLMs

Featured

amdblackwellcuda

Python80.8K55917.1K1h ago

skypilot

skypilot-org

Run, manage, and scale AI workloads on any AI infrastructure. Use one system to access & manage all AI compute (Kubernetes, Slurm, 20+ clouds, on-prem).

Featured

cloud-computingcloud-managementcost-optimization

Python10.0K321.1K1h ago