-
-
ROCR-Runtime Public
Forked from ROCm/ROCR-RuntimeROCm Platform Runtime: ROCr a HPC market enhanced HSA based runtime
C++ Other UpdatedFeb 9, 2025 -
-
awesome-cpp Public
Forked from fffaraz/awesome-cppA curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.
MIT License UpdatedJan 10, 2025 -
iree Public
Forked from iree-org/ireeA retargetable MLIR-based machine learning compiler and runtime toolkit.
C++ Apache License 2.0 UpdatedJan 10, 2025 -
awesome-tensor-compilers Public
Forked from merrymercy/awesome-tensor-compilersA list of awesome compiler projects and papers for tensor computation and deep learning.
UpdatedJan 10, 2025 -
Triton-Compiler Public
Forked from gfvvz/triton-learning-materialsTriton Compiler related materials.
MIT License UpdatedJan 10, 2025 -
DeepLearningSystem Public
Forked from chenzomi12/AISystemDeep Learning System core principles introduction.
Jupyter Notebook Apache License 2.0 UpdatedJan 10, 2025 -
AI-System Public
Forked from microsoft/AI-SystemSystem for AI Education Resource.
Python Creative Commons Attribution 4.0 International UpdatedJan 10, 2025 -
HIP: C++ Heterogeneous-Compute Interface for Portability
C++ MIT License UpdatedSep 30, 2024 -
cluster-data Public
Forked from google/cluster-dataBorg cluster traces from Google
TeX UpdatedSep 30, 2024 -
Awesome-GPU Public
Forked from Jokeren/Awesome-GPUAwesome resources for GPUs
BSD 3-Clause "New" or "Revised" License UpdatedNov 15, 2023 -
circle Public
Forked from seanbaxter/circleThe compiler is available for download. Get it!
C++ UpdatedAug 19, 2023 -
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedJul 31, 2023 -
xla Public
Forked from openxla/xlaA machine learning compiler for GPUs, CPUs, and ML accelerators
C++ Apache License 2.0 UpdatedJul 20, 2023 -
-
-
openxla-pjrt-plugin Public
Forked from rsuderman/openxla-pjrt-pluginPJRT plugin for interfacing the OpenXLA compiler to Jax, PyTorch/XLA and TensorFlow
C++ Apache License 2.0 UpdatedJun 23, 2023 -
-
-
llvm-project Public
Forked from llvm/llvm-projectThe LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at…
Other UpdatedDec 23, 2022 -
openmlsys-zh Public
Forked from openmlsys/openmlsys-zh《Machine Learning Systems: Design and Implementation》- Chinese Version
TeX UpdatedNov 3, 2022 -
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Python Apache License 2.0 UpdatedOct 22, 2022 -
flax Public
Forked from google/flaxFlax is a neural network library for JAX that is designed for flexibility.
Python Apache License 2.0 UpdatedOct 14, 2022 -
tensorflow-upstream Public
Forked from ROCm/tensorflow-upstreamTensorFlow ROCm port
C++ Apache License 2.0 UpdatedSep 22, 2022 -
AwesomePerfCpp Public
Forked from fenbf/AwesomePerfCppA curated list of awesome C/C++ performance optimization resources: talks, articles, books, libraries, tools, sites, blogs. Inspired by awesome.
CSS UpdatedSep 22, 2022 -
pytorch Public
Forked from ROCm/pytorchTensors and Dynamic neural networks in Python with strong GPU acceleration
C++ Other UpdatedSep 8, 2022 -
recommenders Public archive
Forked from recommenders-team/recommendersBest Practices on Recommendation Systems
Python MIT License UpdatedSep 7, 2022 -
GPU-Puzzles Public
Forked from srush/GPU-PuzzlesSolve puzzles. Learn CUDA.
Jupyter Notebook MIT License UpdatedAug 7, 2022 -
moderngpu Public
Forked from moderngpu/moderngpuPatterns and behaviors for GPU computing
C++ Other UpdatedJun 26, 2022