model-serving

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

deep-learninggptllama

Python4.1K83308h ago

FedML

FedML-AI

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.

ai-agentdeep-learningdistributed-training

Python4.0K17676mo ago

AI-Infra-from-Zero-to-Hero

HuaizhengZhang

🚀 Awesome System for Machine Learning ⚡️ AI System Papers and Industry Practice. ⚡️ System for Machine Learning, LLM (Large Language Model), GenAI (Generative AI). 🍻 OSDI, NSDI, SIGCOMM, SoCC, MLSys, etc. 🗃️ Llama3, Mistral, etc. 🧑‍💻 Video Tutorials.

ai-infragenailarge-language-models

4.0K4339010mo ago

lorax

predibase

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

fine-tuninggptllama

Python3.8K331410d ago