Skip to content
View Syulin7's full-sized avatar
🎯
Focusing
🎯
Focusing
  • Shanghai, China
  • 02:20 - 8h ahead

Block or report Syulin7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

LeaderWorkerSet: An API for deploying a group of pods as a unit of replication

Go 315 52 Updated Mar 7, 2025

The ultimate LLM/AI application development framework in Golang.

Go 1,960 127 Updated Mar 7, 2025

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 35,849 6,082 Updated Mar 7, 2025

golang sdk for okx v5 api

Go 40 17 Updated Aug 23, 2024

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! 🦥

Python 33,782 2,398 Updated Mar 7, 2025

Kubernetes WithOut Kubelet - Simulates thousands of Nodes and Clusters.

Go 2,726 216 Updated Mar 6, 2025

CUDA checkpoint and restore utility

C 298 15 Updated Jan 27, 2025

Gateway API Inference Extension

Jupyter Notebook 173 44 Updated Mar 6, 2025

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker/Zotero

Python 18,257 1,498 Updated Mar 6, 2025

Heterogeneous AI Computing Virtualization Middleware

Go 1,350 270 Updated Mar 7, 2025

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,439 1,044 Updated Mar 7, 2025

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,273 511 Updated May 3, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 17,274 1,432 Updated Feb 25, 2025

Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用

Python 14,465 1,292 Updated Sep 5, 2024

kubectl plugin to print Kubernetes resource conditions

Go 81 6 Updated Feb 13, 2025

JobSet: a k8s native API for distributed ML training and HPC workloads

Python 193 64 Updated Mar 6, 2025

A Cloud Native Batch System (Project under CNCF)

Go 4,478 1,039 Updated Mar 7, 2025

My personal page

HTML 585 93 Updated Dec 26, 2024

A toolkit to run Ray applications on Kubernetes

Go 1,538 480 Updated Mar 7, 2025

LLM101n: Let's build a Storyteller

32,257 1,743 Updated Aug 1, 2024

Comparison of Language Model Inference Engines

207 8 Updated Dec 16, 2024

AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.

Go 816 67 Updated Mar 7, 2025

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 81,638 9,818 Updated Mar 7, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 13,265 2,721 Updated Mar 7, 2025

Provides end-to-end model development pipelines for LLMs and Multimodal models that can be launched on-prem or cloud-native.

Python 493 145 Updated Mar 6, 2025

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discr…

Python 7,405 1,320 Updated Mar 7, 2025

Open source platform for the machine learning lifecycle

Python 19,693 4,379 Updated Mar 7, 2025
Next
Showing results