ZJY0516

ZJY0516

10 followers · 60 following

Nanjing University
08:14 - 8h ahead
https://riverclouds.net/

Highlights

Lists (5)

Sort

游戏

Stars

ml-energy / leaderboard

How much energy do GenAI models consume?

Python 41 4 Updated Oct 16, 2024

asterinas / asterinas

Asterinas is a secure, fast, and general-purpose OS kernel, written in Rust and providing Linux-compatible ABI.

Rust 2,729 160 Updated Mar 3, 2025

ModelTC / lightllm

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,975 231 Updated Mar 4, 2025

AmberLJC / LLMSys-PaperList

Large Language Model (LLM) Systems Paper List

794 30 Updated Feb 27, 2025

albertan017 / LLM4Decompile

Reverse Engineering: Decompiling Binary Code with Large Language Models

Python 5,200 351 Updated Oct 28, 2024

andrewkchan / deepseek.cpp

CPU inference for the DeepSeek family of large language models in pure C++

C++ 263 23 Updated Feb 11, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 2,269 236 Updated Mar 4, 2025

MoE-Inf / awesome-moe-inference

Curated collection of papers in MoE model inference

85 4 Updated Feb 19, 2025

kyutai-labs / jax-flash-attn3

JAX bindings for the flash-attention3 kernels

C++ 11 1 Updated Aug 6, 2024

microsoft / demikernel

Kernel-Bypass LibOS Architecture

Rust 1,097 125 Updated Mar 4, 2025

Dao-AILab / flash-attention

Fast and memory-efficient exact attention

Python 16,078 1,522 Updated Mar 4, 2025

parttimenerd / concurrency-fuzz-scheduler

Custom Linux scheduler for concurrency fuzzing written in Java with hello-ebpf

Java 24 1 Updated Feb 13, 2025

FlagOpen / FlagGems

FlagGems is an operator library for large language models implemented in Triton Language.

Python 434 69 Updated Mar 4, 2025

fattorib / fusedswiglu

Fused SwiGLU Triton kernels

Python 4 Updated Jan 25, 2024

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 1,243 64 Updated Mar 4, 2025

Netflix / vmaf

Perceptual video quality assessment based on multi-method fusion.

Python 4,811 768 Updated Feb 12, 2025

FFmpeg / asm-lessons

FFMPEG Assembly Language Lessons

2,818 77 Updated Mar 3, 2025

zhangchenchen / self-consistent-coder

如何成为一名自洽的程序员

Shell 1,964 91 Updated Feb 28, 2025

mirage-project / mirage

Mirage: Automatically Generating Fast GPU Kernels without Programming in Triton/CUDA

C++ 755 47 Updated Mar 4, 2025

microsoft / T-MAC

Low-bit LLM inference on CPU with lookup table

C++ 690 53 Updated Jan 9, 2025

PKUFlyingPig / MIT6.5940_TinyML

Course materials for MIT6.5940: TinyML and Efficient Deep Learning Computing

Jupyter Notebook 31 2 Updated Jan 8, 2025

vyokky / LLM-Brained-GUI-Agents-Survey

GitHub page for "Large Language Model-Brained GUI Agents: A Survey"

CSS 122 6 Updated Mar 1, 2025

carottX / nju-class

Python 14 2 Updated Jan 13, 2025

graphitemaster / incbin

Include binary files in C/C++

C 1,028 98 Updated Jul 12, 2024

rasbt / LLMs-from-scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 41,370 5,587 Updated Mar 2, 2025

andrewkchan / yalm

Yet Another Language Model: LLM inference in C++/CUDA, no libraries except for I/O

C++ 265 27 Updated Jan 15, 2025

ggerganov / whisper.cpp

Port of OpenAI's Whisper model in C/C++

C++ 38,231 3,981 Updated Mar 4, 2025

google-deepmind / alphafold3

AlphaFold 3 inference pipeline.

Python 6,159 761 Updated Mar 4, 2025

HazyResearch / aisys-building-blocks

Building blocks for foundation models.

455 19 Updated Jan 3, 2024

icsnju / visualinux

A visualized debugging framework to aid in understanding the Linux kernel.

C 108 7 Updated Mar 4, 2025

ZJY0516

Highlights

Lists (5)

LaTeX

学习资源

实用工具

影音

游戏

Stars