🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,439 1,044 Updated Mar 7, 2025

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,273 511 Updated May 3, 2024

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 17,274 1,432 Updated Feb 25, 2025

LlamaFamily / Llama-Chinese

Llama中文社区，Llama3在线体验和微调模型已开放，实时汇总最新Llama3学习资料，已将所有代码更新适配Llama3，构建最好的中文Llama大模型，完全开源可商用

Python 14,465 1,292 Updated Sep 5, 2024

ahmetb / kubectl-cond

kubectl plugin to print Kubernetes resource conditions

Go 81 6 Updated Feb 13, 2025

kubernetes-sigs / jobset

JobSet: a k8s native API for distributed ML training and HPC workloads

Python 193 64 Updated Mar 6, 2025

volcano-sh / volcano

A Cloud Native Batch System (Project under CNCF)

Go 4,478 1,039 Updated Mar 7, 2025

lilianweng / lilianweng.github.io

My personal page

HTML 585 93 Updated Dec 26, 2024

ray-project / kuberay

A toolkit to run Ray applications on Kubernetes

Go 1,538 480 Updated Mar 7, 2025

karpathy / LLM101n

LLM101n: Let's build a Storyteller

32,257 1,743 Updated Aug 1, 2024

lapp0 / lm-inference-engines

Comparison of Language Model Inference Engines

207 8 Updated Dec 16, 2024

substratusai / kubeai

AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.

Go 816 67 Updated Mar 7, 2025

open-webui / open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 81,638 9,818 Updated Mar 7, 2025

NVIDIA / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,265 2,721 Updated Mar 7, 2025

NVIDIA / NeMo-Framework-Launcher

Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.

Python 493 145 Updated Mar 6, 2025

openai / transformer-debugger

Python 4,066 243 Updated Jun 4, 2024

intel / ipex-llm

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 7,405 1,320 Updated Mar 7, 2025

mlflow / mlflow

Open source platform for the machine learning lifecycle

Python 19,693 4,379 Updated Mar 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

yu lin Syulin7

Achievements

Achievements

Block or report Syulin7

Starred repositories

kubernetes-sigs / lws

cloudwego / eino

ray-project / ray

iaping / go-okx

unslothai / unsloth

S-Lab-System-Group / Awesome-DL-Scheduling-Papers

kubernetes-sigs / kwok

NVIDIA / cuda-checkpoint

kubernetes-sigs / gateway-api-inference-extension

Byaidu / PDFMathTranslate

tkestack / vcuda-controller

Project-HAMi / HAMi

huggingface / accelerate