sgl-project / sglang Public

Notifications You must be signed in to change notification settings
Fork 811
Star 8.3k

Code
Issues 203
Pull requests 55
Discussions
Actions
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Security
Insights

Pull requests: sgl-project/sglang

Labels 30 Milestones 0

New pull request New

55 Open 2,110 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

[Docs] add quantization docs

#3253 opened Feb 1, 2025 by yinfan98

Loading…

4 tasks

[ROCm] Enable Fused MLA Triton kernel for DeepSeekV3

#3237 opened Jan 31, 2025 by lcskrishna • Draft

docs(references/deepseek): current section links

#3225 opened Jan 31, 2025 by guspan-tanadi

Loading…

4 tasks

Add support for nvidia modelopt fp8 kv cache

#3223 opened Jan 30, 2025 by Edwardf0t1

Loading…

1 of 4 tasks

Online serving benchmarks of real datasets for hierarchical KV caching

#3211 opened Jan 30, 2025 by PanJason

Loading…

5 tasks

Specify torch==2.5.1 in pyproject.toml

#3210 opened Jan 30, 2025 by merrymercy

Loading…

add support instructions for AMD GPUs

#3208 opened Jan 29, 2025 by Beichen-Ma

Loading…

2 of 4 tasks

Fix min_p sampling crash when using flashinfer backend flashinfer

#3207 opened Jan 29, 2025 by zifeitong

Loading…

3 of 4 tasks

Add a Doc about guide on nvidia jetson #3182 documentation

Improvements or additions to documentation

#3205 opened Jan 29, 2025 by lycanlancelot

Loading…

2 of 4 tasks

feat: Support Janus-pro

#3203 opened Jan 29, 2025 by mickqian

Loading…

3 of 4 tasks

Draft support for reasoning_content in API

#3202 opened Jan 29, 2025 by tot0

Loading…

4 tasks

Fixing a typo engine.py

#3193 opened Jan 28, 2025 by didier-durand

Loading…

Add deepseek_v3 fused gate

#3191 opened Jan 28, 2025 by NovTi

Loading…

Add logit bias into the SGLang interface.

#3187 opened Jan 27, 2025 by cinjon

Loading…

4 tasks

[Feature] Rewrite Sampling Parameter #3165

#3185 opened Jan 27, 2025 by tongyu0924

Loading…

Initial Enablement of CI on MI300

#3168 opened Jan 27, 2025 by saienduri

Loading…

[Feature] Define backends and add Triton backend for Lora

#3161 opened Jan 27, 2025 by Fridge003

Loading…

4 tasks done

Apply sgl w8a8 fp8 kernel high priority

#3148 opened Jan 26, 2025 by HandH1998

Loading…

[MOE] Try to optimize moe align block size multiblocks cuda kernel

#3137 opened Jan 26, 2025 by yiakwy-xpu-ml-framework-team • Draft

8 tasks

fix: Fix deprecated max_tokens param in openai ChatCompletionRequest

#3122 opened Jan 25, 2025 by mickqian

Loading…

3 of 4 tasks

Add EngineFragment

#3120 opened Jan 25, 2025 by fzyzcjy

Loading…

4 tasks

Split communication logic from computation logic into orchestrator

#3118 opened Jan 25, 2025 by fzyzcjy

Loading…

4 tasks

Let DetokenizerManager use TypeBasedDispatcher

#3117 opened Jan 25, 2025 by fzyzcjy

Loading…

4 tasks

Rename TokenizerManager to StdOrchestrator

#3116 opened Jan 25, 2025 by fzyzcjy

Loading…

4 tasks

Extract generation_manager from tokenizer_manager

#3115 opened Jan 25, 2025 by fzyzcjy

Loading…

4 tasks

Previous 1 2 3 Next

Previous Next

ProTip! Filter pull requests by the default branch with base:main.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly