HaiShaw

HAI HaiShaw

39 followers · 8 following

San Francisco

Sponsoring

Achievements

x3 x2

Achievements

x3 x2

Pinned Loading

sgl-project/sglang Public

SGLang is a fast serving framework for large language models and vision language models.

Python 11.7k 1.2k
vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 41k 6.2k
microxcaling Public

Forked from microsoft/microxcaling

PyTorch emulation library for Microscaling (MX)-compatible data formats

Python
pytorch/FBGEMM Public

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1.3k 549
ROCm/pytorch Public

Forked from pytorch/pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 222 62
cs231x/super-resolution-detection Public

End-to-End Super Resolution Object Detection Networks

Jupyter Notebook 11 2

240 contributions in the last year

Learn how we count contributions

Less

Activity overview

Contributed to sgl-project/sglang, vllm-project/vllm, HaiShaw/sglang and 10 other repositories

Contribution activity

March 2025

Created 4 commits in 1 repository

sgl-project/sglang 4 commits

Created a pull request in sgl-project/sglang that received 6 comments

Mar 7

ROCm: Flex Attention Enablement with custom backends

Credits: @poyenc , @amd-hhashemi , @linsun12 , @tenpercent , @carlushuang , @HaiShaw Motivation Add ROCm custom attention backends to enable Flex …

+1,435 −36 lines changed • 6 comments

Opened 4 other pull requests in 1 repository

sgl-project/sglang 1 open 3 merged

[ROCm/Draft/No-Merge]: Flex Attention Enablement
This contribution was made on Mar 7
ROCm: enable trillion-parameter MoE models with INT4-FP8 single node
This contribution was made on Mar 6
Fix the moe padding conditional logic
This contribution was made on Mar 5
ROCm: update aiter and its usage to fused moe (bloat16, fp8, fp8 block-quant)
This contribution was made on Mar 4

Reviewed 10 pull requests in 1 repository

sgl-project/sglang 10 pull requests

Update amd ci docker image to v0.4.3.post4-rocm630.
This contribution was made on Mar 7
ROCm: enable trillion-parameter MoE models with INT4-FP8 single node
This contribution was made on Mar 6
AMD/ROCm: update base image string
This contribution was made on Mar 6
Add tag suffix to nightly docker builds.
This contribution was made on Mar 6
Create release-docker-amd-nightly.yml
This contribution was made on Mar 5
ROCM: AITER BLOCK GEMM
This contribution was made on Mar 5
Fix breakage problem when using custom_ar
This contribution was made on Mar 4
Fix assert options.num_stages != 0 error in the latest ROCm build image
This contribution was made on Mar 4
ROCM support tree_speculative_sampling_target_only
This contribution was made on Mar 3
Enable custom AR for AMD GPUs and maintain it in sgl-kernel
This contribution was made on Mar 1

Created an issue in sgl-project/sglang that received 1 comment

Mar 4

[Feature] Add e4m3fnuz support to MoE-EP in FP8

Checklist 1. If the issue you raised is not a feature but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussi…

2 tasks done

• 1 comment

	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec	Jan	Feb	Mar
Sun
Mon
Tue
Wed
Thu
Fri
Sat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HAI HaiShaw

Sponsoring

Achievements

Achievements

Block or report HaiShaw

Pinned Loading

240 contributions in the last year

Activity overview

Contribution activity

March 2025

Created a pull request in sgl-project/sglang that received 6 comments

ROCm: Flex Attention Enablement with custom backends

Created an issue in sgl-project/sglang that received 1 comment

[Feature] Add e4m3fnuz support to MoE-EP in FP8

	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec	Jan	Feb	Mar
Sun
Mon
Tue
Wed
Thu
Fri
Sat

	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec	Jan	Feb	Mar
Sun
Mon
Tue
Wed
Thu
Fri
Sat

	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct	Nov	Dec	Jan	Feb	Mar
Sun
Mon
Tue
Wed
Thu
Fri
Sat